COMPUTATIONAL 

GEOMETRY 


19951023  140 


Stony  Brook,  New  York 
June  6-8,  1994 

Sponsored  by  the  ACM  Special  Interest  Groups 
for  Graphics  and  Algorithms  and  Computation  Theory 


P  R  E  SS 


ri-v 


THIS 


REPORT  DOCUMENTATION  PAGE 


form  Appro**^ 

0MB  NO  0704-CtBB 


Sa«i»  jO*  wq  e  - •n«T  TVM  ANO  OATES  COVtUIO 


,r»  -•■-■"'fl  ITJ'-TI  .‘L.7-'^  .’..  Bu.ar«  «t.m.«  o-  «"»»  f^^’Z'r 

■tacm. 

oc  iosei 


1.  AGENCY  USt  ONLY  (LtM  bl*nk>  I  i-  WO«T  0*^* 

I  Jun  95 

I  4.  TITLE  ANO  SUITITLi 

Symposium  on  Computational  Geometry 

ft.  author(S) 

Joseph  Mitchell 

['?.  MMOHMINC  0«0ANI2ATi0H  MAM«»»  *^<0  *OOHlSS(t>» 

State  University  of  New  York  at.  Stony  Brook 
Stony  Brook,  NY  11794-3600 

IT  sroHS0l»IM(i/M0NlT0«IM6  AOIHCT  BHO  AOOPHSMliT 

U.S.  Amy  Research  Office 

LsMrch*Ttuigle  Park,  NC  27709-2211 


3.  repost  TY«  ANO  OATIS  COVmtu 

F-fnal  9  May  94-8 _ 

S.  FUNOINC  NUMIIRS 


DAAH04-94-G-0225 


7  perfosming  OaCANUAnui* 
aiSORT  numiir 


10.  SRONSbaiNG  MOelTOaiNO 

A6INCY  aiRORT  NUMSI* 


ARO  33022. 1-MA-CF 


11.  sumCMtHTASt  noTR  findtnES  contained  in  this  report  are  those  of  the 

The  views,  opinions  and/or  f  ®  .  official  Department  of  the  Army 

author(s)  and  should  not  be  constmed  as  *  ^  ^  otYitr  documentation. 

...Itlon.  policy,  or  d.cl.l.n.  .o  by  ot  . .  ^ 


['l2«.  WSTHItUTION  /  AVAItAMJTY  STATtMIMT 

Approved  for  public  release;  distribution  unlimited, 
tu.  aSSTIUCT  (MPMimumiOOwerM 


This  proceedings  contains  the  42  contributed  papers  which  were  selected  from  a  total  of 
113  submitted  extended  abstracts.  The  program  committee  met  m  Arlington,  Virginia,  on 
January  21-22,  1994.  The  selection  was  based  on  the  quality  and  originality  of  the  results 
and  their  relevance  to  computational  geometry.  The  papers  generally  represent  preliminary 
reports  of  ongoing  research  and  it  is  expected  that  most  of  them  will  eventually  appear  m 
more  polished  form  in  refereed  journals. 


U.  SUSJICr  TERMS 


17.  SICURtTT  OASSIFICATioir 
OR  RiRORT 
UNCLASSIFIED 

NSN  ;S4(M)t-2aO-SSOO 


IS.  SICURITY  OASSWICATIOM 

or  THIS  rAOi 

UNCLASSIFIED 


[  is.  SICURITY  CLASSirtCATioir 

or  asstract 
UNCLASSIFIED 


15.  numhR  or  rAet» 


IS.  Mta  cool 

I'O.  UMtTATION  or  AR5TM«.i  j 

UL 


StanOArd  rofw  2SS  (Rt»  2*89) 
•*  ‘"H  iw  2»->« 

M<02 


PROCEEDINGS  OF  THE  TENTH  ANNtTAL  SYMPOSIUM  ON 


COMPUTATIONAL 

GEOMETRY 


By . . . . 

Distribution/ 

Availability  Codes 


Stony  Brook,  New  York 
June  6-8,  1994 


Sponsored  by  the  ACM  Special  Interest  Groups 
for  Graphics  and  Algorithms  and  Computation  Theory 


Avail  a  no  /  or 
Special 


A-l 


□  □ 


The  Association  for  Computing  Machinery 
1515  Broadway 
New  York,  N.Y.  10036 


Copyright  ®  1994  by  the  Association  for  Computing  Machinery,  Inc.  Copying  without  fee 
is  permitted  provided  that  the  copies  are  not  made  or  distributed  for  direct  commercial 
advantage, and  credit  to  the  source  is  given.  Abstracting  with  credit  is  permitted.  For  other 
copying  of  articles  that  carry  a  code  at  the  bottom  of  the  first  page,  copying  is  permitted 
provided  that  the  per-copy  fee  indicated  in  the  code  is  paid  through  the  Copyright  Clearance 
Center,  222  Rosewood  Drive,  Danvers,  MA  01923.  For  permission  to  republish  write  to: 
Director  of  Publications,  Association  for  Computing  Machinery.  To  copy  otherwise  or 
republish,  requires  a  fee  and/or  specific  permission. 


ACM  ISBN:  089791-648-4 
Additional  copies  may  be  ordered  prepaid  from: 


ACM  Order  Department 
P.O.  Box  12114 
Church  Street  Station 
New  York,  N.Y.  10257 


Phone:  1-800-342-6626 
(U.S.A.  and  Canada) 
1-212-626-0500 
(All  other  countries) 

Fax:  1-212-944-1318 
E-mail:  acmpubs@acm.org 


ACM  Order  Number:  429940 


Printed  in  the  U.S.A. 


Foreword 


This  volume  contains  papers  presented  at  the  Tenth  Annual  Symposium  on  Computa¬ 
tional  Geometry,  held  June  6-8,  1994,  in  Stony  Brook,  New  York.  The  Symposium  was 
sponsored  by  the  Special  Interest  Groups  for  Graphics  (SIGGRAPH)  and  for  Algorithms 
and  Computation  Theory  (SIGACT)  of  the  Association  for  Computing  Machinery  and  was 
partially  funded  with  the  generous  support  of  the  agencies  shown  below. 

This  proceedings  contains  the  42  contributed  papers  which  were  selected  from  a  total  of 
113  submitted  extended  abstracts.  The  program  committee  met  in  Arlington,  Virginia,  on 
January  21-22,  1994.  The  selection  was  based  on  the  quality  and  originality  of  the  results 
and  their  relevance  to  computational  geometry.  The  papers  generally  represent  preliminary 
reports  of  ongoing  research  and  it  is  expected  that  most  of  them  will  eventually  appear  in 
more  polished  form  in  refereed  journals.  The  program  committee  wishes  to  thank  all  people 
that  helped  us  in  refereeing  the  papers. 

This  year,  for  the  second  time,  a  video  review  of  computational  geometry  was  created. 
Descriptions  of  the  accepted  videos  appear  in  these  proceedings.  For  more  details  see  the 
introduction  on  page  381. 

The  program  committee  wishes  to  thank  all  the  authors  who  submitted  abstracts  in  response 
to  the  call  for  papers. 
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Vertical  Decompositions  for  Triangles  in  3-Space* 


Mark  de  Berg^  Leonidas  J.  Guibas*  Dan  Halperin^ 


Abstract 

We  prove  that,  for  any  constant  £  >  0,  the  complexity 
of  the  vertical  decomposition  of  a  set  of  n  triangles  in 
three-dimensional  space  is  A'),  where  K  is  the 

complexity  of  the  arrangement  of  the  triangles.  For  a 
single  cell  the  complexity  of  the  vertical  decomposition 
is  shown  to  be  0(n^+^).  These  bounds  are  almost  tight 
in  the  worst  case. 

We  also  give  a  deterministic  output-sensitive  algo¬ 
rithm  for  computing  the  vertical  decomposition  that 
runs  in  0(n^  logn  -f  F  logn)  time,  where  V  is  the  com¬ 
plexity  of  the  decomposition.  The  algorithm  is  reason¬ 
ably  simple  (in  particular,  it  tries  to  perform  as  much  of 
the  computation  in  two-dimensional  spaces  as  possible) 
and  thus  is  a  good  candidate  for  efficient  implementa¬ 
tions. 

1  Introduction 

The  study  of  arrangements  plays  a  fundamental  role  in 
geometric  computing.  An  arrangement  is  the  partition 
of  a  Euclidean  space  into  cells,  as  induced  by  a  collection 
of  possibly  highly  inter-penetrating  objects.  A  surpris¬ 
ing  number  of  seemingly  unrelated  geometric  problems 

’Mark  de  Berg  was  supported  by  the  Dutch  Organization  for 
Scientific  Research  (N.W.O.),  and  by  ESPRIT  Basic  Research  Ac¬ 
tion  No.  7141  (project  ALCOM  IhAlgorithms  and  Complexity). 
Leonidas  Guibas  was  supported  by  NSF  grant  CCR-9215219,  by 
a  grant  from  the  Stanford  SIMA  Consortium,  and  by  grants  from 
the  Digital  Equipment,  Mitsubishi,  and  Toshiba  Corporations. 
Dan  Halperin  was  supported  by  a  Rothschild  Postdoctoral  Fellow¬ 
ship,  by  a  grant  from  the  Stanford  Integrated  Manufacturing  As¬ 
sociation  (SIMA),  by  NSF/ARPA  Research  Grant  IRI-9306544, 
and  by  NSF  Grant  CCR-9215219. 

^Department  of  Computer  Science,  Utrecht  University, 
P.O.Box  80,089,  3508  TB  Utrecht,  the  Netherlands. 
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boil  down  to  the  study  of  certain  cells  in  such  an  ar¬ 
rangement.  A  famous  example  is  the  motion  planning 
problem  in  robotics.  Here  the  underlying  arrangement 
is  the  arrangement  in  configuration  space  of  the  con¬ 
straint  surfaces  defined  by  the  obstacles  and  the  robot. 
Because  of  these  numerous  applications,  much  research 
has  been  devoted  to  bounding  the  combinatorial  com¬ 
plexity  of  arrangements,  and  of  certain  important  sub¬ 
sets  of  arrangements  such  as  zones  and  single  cells. 

For  most  algorithmic  uses,  however,  a  raw  arrange¬ 
ment  is  an  unwieldy  structure.  The  difficulty  is  that 
cells  in  an  arrangement  can  have  very  complex  topolo¬ 
gies,  so  navigating  around  them  is  difficult.  What  we 
often  want  is  a  further  refinement  of  the  cells  into  pieces, 
such  as  simplices,  that  are  each  homeomorphic  to  a  ball 
and  have  constant  description  complexity.  Ideally,  the 
number  of  cells  after  the  refinement  should  be  propor¬ 
tional  to  the  overall  complexity  of  the  arrangement.  For 
arrangements  of  hyperplanes  the  well-known  bottom 
vertex  triangulation  [12]  meets  this  criterion.  For  more 
general  arrangements  such  refined  decompositions  are 
more  difficult  to  find.  For  example,  for  algebraic  hyper¬ 
surfaces  of  constant  maximum  degree  in  d-dimensional 
space  {d  >  3)  the  best  decomposition  technique  known 
so  far  results  in  cells  [8],  where  /3(n)  is  an 

extremely  slowly  growing  function,^  whereas  the  com¬ 
plexity  of  the  arrangement  itself  is  only  0{n'^). 

In  this  paper  we  study  decompositions  for  arrange¬ 
ments  of  triangles  in  3-dimensional  space.  The  sim¬ 
plest  way  to  decompose  such  an  arrangement  is  to  com¬ 
pute  the  bottom  vertex  triangulation  of  the  arrange¬ 
ment  of  the  planes  containing  the  triangles.  The  result¬ 
ing  decomposition  has  size  0(n^),  which  is  optimal  in 
the  worst  case.  In  many  applications,  however,  the  ac¬ 
tual  complexity  of  the  arrangement  of  triangles  is  much 
smaller.  So  the  challenge  is  to  obtain  a  decomposition 
whose  size  is  sensitive  to  the  complexity  of  the  arrange¬ 
ment  of  the  triangles. 

Such  a  complexity-sensitive  decomposition  was  given 
by  Aronov  and  Sharir  [2]:  their  Slicing  Theorem  states 
that  one  can  decompose  an  arrangement  of  n  triangles 
in  3-space  into  0(n^a(n)  -f-  K)  tetrahedra,  where  K  is 

^To  be  precise:  P(n)  =  2“(”F^  where  c  is  a  constant  depend¬ 
ing  on  the  dimension  and  the  degree  of  the  surfaces.  Here  and 
throughout  the  paper,  a{n)  is  the  extremely  slowly  growing  func¬ 
tional  inverse  of  Ackermann’s  function. 


1 


Figure  1:  The  vertical  wall  for  the  fat  edge  in  the  Slicing 
Theorem  and  in  the  vertical  decomposition. 


the  complexity  of  the  arrangement.  This  result  is  close 
to  optimal:  Q(K)  is  clearly  a  lower  bound  on  any  de¬ 
composition,  and  Chazelle  [5]  shows  that  there  are  ar¬ 
rangements  of  complexity  (9(n)  such  that  any  decompo¬ 
sition  into  convex  cells  has  size  n(n^).  (The  triangles  in 
Chazelle ’s  example  form  the  boundary  of  a  simple  poly¬ 
tope.)  The  Slicing  Theorem  obtains  a  decomposition  by 
adding  vertical  walls  for  each  of  the  triangle  boundary 
edges,  one  after  the  other.  The  wall  of  an  edge  e  is  ob¬ 
tained  by  “flooding”  the  zone  of  e  in  an  arrangement  on 
the  vertical  plane  H(e)  containing  e;  this  arrangement 
is  defined  by  intersections  of  H(e)  with  the  triangles  and 
with  already  added  walls.  See  Figure  1(a);  the  dashed 
segment  in  this  figure  is  a  previously  added  wall.  Af¬ 
ter  adding  the  walls  one  is  left  with  convex  cells  that 
can  easily  be  decomposed  into  tetrahedra.  The  Slic¬ 
ing  Theorem  decomposition  has  the  unpleasant  charac¬ 
teristic  that  it  depends  on  the  order  in  which  triangle 
boundary  edges  are  treated.  Thus  the  tetrahedra  in  the 
decomposition  are  not  defined  “locally”,  and  it  is  not 
canonical  in  the  sense  of  Chazelle  and  Friedman  [10]. 
This  means  that  the  decomposition  cannot  be  used  in 
randomized  incremental  algorithms.  It  also  makes  it 
difficult  to  compute  the  decomposition  efficiently. 

A  decomposition  which  does  not  have  this  problem — 
and  one  which  we  think  is  more  intuitive — is  the  fol¬ 
lowing  [11,  22,  23].  This  decomposition  is  also  obtained 
by  erecting  vertical  walls.  This  time  the  wall  for  edge 
e  simply  consists  of  those  points  in  H{e)  that  can  be 
connected  to  e  with  a  vertical  segment  that  does  not 
cross  any  of  the  triangles  in  T.  See  Figure  1(b).  This 
gives  us  a  first  decomposition  Vi  (T).  Secondly,  walls  are 
erected  from  the  intersection  edges  between  pairs  of  tri¬ 
angles  to  produce  a  finer  decomposition  V2(T).  Observe 
that  the  wall  erected  from  an  edge  is  not  obstructed  by 
other  walls,  so  the  decomposition  does  not  depend  on 
the  order  in  which  the  edges  are  treated.  We  call  this 
decomposition  the  vertical  decomposition  for  T  and  de¬ 
note  it  by  V(T)  =  V2(T).  Note  that  the  cells  in  V2{T) 
need  not  be  convex;  in  fact,  they  need  not  even  be  sim¬ 
ply  connected.  But  the  decomposition  can  easily  be 


refined  into  a  convex  subdivision  V3(T)  where  each  cell 
has  constant  complexity,  without  increasing  the  asymp¬ 
totic  complexity  of  the  subdivision — see  below  for  de¬ 
tails.  We  call  the  refined  subdivision  the  full  vertical 
decomposition^  for  T. 

In  this  paper  we  prove  bounds  on  the  maximum  com¬ 
binatorial  complexity  of  vertical  decompositions,  which 
are  sensitive  to  the  complexity  of  the  arrangement  of 
the  triangles.  More  precisely,  we  show  that,  for  any 
constant  e  >  0,  the  complexity  of  the  vertical  decompo¬ 
sition  of  a  set  T  of  n  triangles  in  3-space  is  +  K), 

where  K  is  the  complexity  of  the  arrangement  A{T) 
induced  by  T.  Our  proof  uses  an  interesting  combina¬ 
tion  of  efficient  hierarchical  cuttings  [6,  21] ,  the  counting 
scheme  used  in  hereditary  segment  trees  [9],  the  Slicing 
Theorem  [2],  and  random  sampling  [13,  19].  Our  proof 
can  be  adapted  to  show  that  the  vertical  decomposi¬ 
tion  of  a  single  cell  in  an  arrangement  of  triangles  has 
0(n^+®)  complexity.  We  also  give  an  example  of  a  set 
T  of  triangles  whose  vertical  decomposition  has  com¬ 
plexity  0(n^a^(n)),  whereas  the  complexity  of  A{T)  is 
only  0(na(n)).  This  shows  that  our  bound  is  close  to 
optimal. 

Secondly,  we  present  a  deterministic  algorithm  for 
constructing  V(T)  which  runs  in  time  O(n^logn  -f- 
Flogn),  where  V  is  the  complexity  of  V(T).  The  al¬ 
gorithm  is  reasonably  simple  (in  particular,  it  tries  to 
perform  as  much  of  the  computation  in  two-dimensional 
spaces  as  possible)  and  thus  is  a  good  candidate  for  ef¬ 
ficient  implementations.  The  algorithm  is  also  inter¬ 
esting  as  it  is  a  three-dimensional  version  of  a  Bentley- 
Ottmann  style  sweep  and  may  be  adaptable  to  compute 
other  partial  or  total  information  about  the  arrange¬ 
ment.  Our  approach  is  related  to  a  space  sweep  al¬ 
gorithm  that  has  recently  been  developed  to  compute 
a  decomposition  of  certain  arrangements  for  motion 
planning  problems  [17],  and  to  the  space  sweep  meth¬ 
ods  used  to  construct  point  location  data  structures  for 
monotone  subdivisions  [14,  25]. 

2  Preliminaries 

Let  T  =  {ti, . . . ,  tn}  be  a  collection  of  n  (possibly  in¬ 
tersecting)  triangles  in  IR®.  Let  A{T)  denote  the  ar¬ 
rangement  induced  by  T,  namely,  the  subdivision  of  3- 
space  into  cells  of  dimensions  0, 1, 2  and  3,  induced  by 
the  triangles  in  T.  We  assume  that  the  triangles  in  T 
are  in  general  position.  (For  details,  see,  e.g.,  [2]).  By 
standard  arguments  (see,  e.g.,  [26])  the  combinatorial 
bound  that  we  derive  in  Section  3  holds  for  degenerate 
arrangements  as  well.  However,  the  algorithm  described 

^Mulmuley  [23]  calls  VsfT)  the  vertical  decomposition,  and  he 
calls  V2(T)  the  cylindrical  decomposition. 
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in  Section  4  will  have  to  undergo  several  technical  ad¬ 
justments  (which  we  do  not  discuss  in  this  paper)  to 
accommodate  for  degenerate  arrangements. 

The  combinatorial  complexity  (or  complexity  in  short) 
of  an  arrangement  A  is  defined  to  be  the  overall  num¬ 
ber  of  cells  of  various  dimensions  in  A\  we  denote  the 
complexity  of  ^  by  |>1|. 

Central  to  the  concept  of  vertical  decompositions  is 
the  following  notion  of  visibility;  two  points  p,q  £ 
(vertically)  see  each  other  with  respect  to  T  if  and  only 
if  the  segment  ^  connecting  them  is  vertical  and  the 
interior  of  pq  does  not  intersect  any  triangle  in  T.  Usu¬ 
ally  the  set  T  is  clear  from  the  context  and  we  just  say 
that  that  p  and  q  see  each  other.  This  definition  is  ex¬ 
tended  to  objects  other  than  points  as  follows;  two  sets 
P,  <5  C  see  each  other  if  and  only  if  there  are  points 
p  £  P,  q  £  Q  that  see  each  other. 

We  define  two  three-dimensional  entities  related  to  a 
3D  curve  7.  Let  if (7)  be  the  vertical  surface  that  is 
the  union  of  all  the  vertical  lines  which  contain  a  point 
of  7.  Let  the  vertical  wall  extended  from  the  3D  curve 
7,  denoted  W(7,r),  be  defined  as  follows;  W{'y,T)  = 
{p  G  ;  p  sees  7}.  In  other  words,  W('f,T)  is  the 
union  of  all  vertical  segments  of  maximal  length  that 
have  a  point  of  7  as  an  endpoint  and  whose  interior 
does  not  intersect  any  triangle  in  T.  Note  that  some  of 
these  segments  can  be  rays. 

3  The  Combinatorial  Bound 

Let  T  =  {ti,...,tn}  be  a  set  of  triangles  in  as 
defined  above.  We  investigate  the  maximum  combina¬ 
torial  complexity  of  V(T)  =  V2(T)  as  a  function  of  n, 
the  number  of  triangles  in  T,  and  K,  the  complexity  of 
A{T).  Note  that  the  complexity  of  V(T)  is  at  least  K. 

We  denote  by  E{T)  the  set  of  segments  in  3-space 
which  are  either  an  edge  of  a  triangle  in  T  or  the  in¬ 
tersection  of  two  triangles  in  T.  We  call  the  segments 
in  E(T)  edges;  when  we  discuss  edges  of  triangles  in 
T  we  will  explicitly  say  triangle  boundary  edges,  and 
when  we  discuss  intersections  between  triangles  we  will 
say  intersection  edges.  For  an  edge  e  £  E{T)  we  define 
its  vertical  wall  to  be  W (e,  T).  Let  W{T)  ;=  {W (e,  T)  : 
e  £  E{T)}  be  the  collection  of  all  the  vertical  walls.  The 
vertical  decomposition  for  T  is  the  subdivision  induced 
by  the  set  T  U  >V(T). 

Let  us  first  consider  the  complexity  of  a  single  wall 
lF(e,T).  Recall  that  H{e)  is  the  vertical  surface  con¬ 
taining  e.  By  definition,  the  part  of  W{e,T)  which  is 
above  e  is  bounded  by  the  lower  envelope  of  the  seg¬ 
ments  which  are  intersections  of  the  other  triangles  in 
T  with  H{e)  and  which  lie  above  e.  See  also  Figure  1(b). 
It  is  well  known  that  the  complexity  of  the  lower  enve¬ 


lope  of  n  segments  in  the  plane  is  0{na(n))  [18].  A 
similar  argument  holds  for  the  part  of  W (e,  T)  below  e. 
Hence,  a  single  wall  has  complexity  0{na{n)).  Since 
there  are  3n  triangle  boundary  edges  we  have 

Observation  3.1  The  total  complexity  of  the  walls 
W(e,T)  for  all  boundary  edges  e  of  the  triangles  in  T 
is  0{n^a{n)). 

Notice  that  the  total  number  of  edges  in  E{T) — and 
thus  the  total  number  of  walls — is  O(n^).  It  follows 
that  the  maximum  complexity  of  V(T)  is  0(n^a(n)), 
as  was  noted  in  [23].  However,  it  is  not  clear  whether 
it  is  possible  that  all  walls  have  0(na(n))  complexity. 
Indeed,  below  we  show  that  this  cannot  happen  when 
A'  is  large.  When  A'  =  0(na(n)),  however,  most  walls 
can  have  large  complexity. 

Theorem  3.1  There  exists  a  set  T  of  n  triangles  in  IR^ 
with  |A(T)|  =  Q(na(n))  and  |V(T)|  =  0(n'^a^(n)). 

Proof:  Let  S'  be  a  set  of  [n/2j  line  segments 

in  the  pz-plane  whose  upper  envelope  has  complexity 
0(na(n))  [28],  and  such  that  S'  lies  completely  below 
the  plane  2  =  0.  Now  extend  each  segment  in  the  x- 
direction  to  obtain  a  set  T'  of  infinitely  long  strips,  that 
is,  let  T'  =  {[-00  ;  co]  X  s  ;  s  G  S').  (The  construction 
can  easily  be  modified  to  use  bounded  triangles  instead 
of  infinitely  long  strips.)  The  upper  envelope  of  T'  con¬ 
sists  of  0(na(n))  lines  that  are  parallel  to  the  x-axis. 
In  the  same  way  we  can  construct  a  set  T"  of  fu/2] 
strips  whose  lower  envelope  consists  of  Q{na{n))  lines 
that  are  parallel  to  the  y-axis,  and  that  lie  completely 
above  the  plane  2  =  0.  For  the  set  T  =  T'  UT"  we  have 
|A(r)|  =  e(na(n)),  and  |V(T)|  =  0(n2a2(n)).  □ 

Next,  we  will  establish  an  upper  bound  on  the  complex¬ 
ity  of  the  vertical  decompositions  of  sets  of  triangles  in 
3-space.  We  already  know  that  the  total  complexity 
of  the  walls  erected  for  the  triangle  boundary  edges  is 
0(n^a(n))  so  now  consider  a  wall  erected  from  an  inter¬ 
section  edge  e.  Let  5(e)  be  the  set  of  segments  which  are 
the  intersections  of  the  other  triangles  with  the  vertical 
surface  H{e). 

As  remarked  before,  the  part  of  W(e,T)  above  e  is 
bounded  by  the  lower  envelope  of  (the  parts  of)  the 
segments  in  5(e)  lying  above  e,  and  the  part  of  W (e,  T) 
below  e  is  bounded  by  the  upper  envelope  of  (the  parts 
of)  the  segments  in  5(e)  lying  below  e.  The  complexity 
of  W(e,T)  is  therefore  linear  in  the  number  of  points  of 
the  following  types;^ 

1.  endpoints  of  a  segment  I  £  5(e)  which  are  con¬ 
tained  in  e 

the  wall  has  constant  complexity  this  may  not  be  true,  but 
the  total  complexity  of  all  constant  complexity  walls  is  0{n3). 
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2.  endpoints  of  a  segment  I  £  S{e)  which  lie  strictly 
above  e  and  are  vertically  visible  from  e 

3.  intersections  between  two  segments  li,l2  G  S{e) 
which  are  vertically  visible  from  e 

The  first  type  of  feature  is  the  intersection  of  the  two 
triangles  that  define  e  and  the  triangle  that  defines  /, 
that  is,  it  is  a  vertex  of  AiT).  Each  vertex  participates 
in  a  constant  number  of  walls  so  the  total  number  of 
type  1  features  over  all  the  walls  is  0{K).  For  type  2 
we  note  that  the  endpoint  of  I  is  the  intersection  of 
a  boundary  edge  e'  of  the  triangle  that  defines  I  with 
H{e).  So  there  is  a  visibility  between  e  and  e',  which 
implies  that  e  defines  a  feature  of  W  (e',  T).  Recall  that 
the  sum  of  the  complexities  of  the  walls  erected  from 
triangle  boundary  edges  is  0(n'^a{n)).  Hence,  the  total 
number  of  type  2  features  is  also  0(n^a(n)).  In  the 
remainder  of  this  section  we  bound  the  total  number  of 
type  3  features. 

The  bipartite  case 

We  first  study  the  following  “bipartite”  version  of  the 
problem.  Let  be  a  fixed  non-vertical  plane,  let  Ti  be 
a  set  of  n  triangles  lying  completely  below  h,  and  let 
T2  be  a  set  of  n  triangles  lying  completely  above  h.  We 
want  to  bound  the  number  of  pairs  cj  £  E(Ti),e2  € 
E{T2)  that  can  see  each  other.  Let  6(Ti,T2)  denote  this 
number,  and  let  b(n)  be  the  maximum  value  of  b{Ti,T2) 
over  all  sets  Ti  and  T2  of  n  triangles  each,  as  defined 
above.  Our  example  which  proves  Theorem  3.1  also 
shows  that  b(n)  —  n(n^a^(n)).  We  now  establish  upper 
bounds  for  6(n).  Recall  that  we  only  need  to  consider 
visibilities  between  intersection  edges,  as  the  remaining 
number  of  visibilities  is  (9(n^a(n)). 

We  say  that  a  planar  curve  7  is  convex  if  and  only 
if  7  is  contained  in  the  boundary  of  its  convex  hull.  In 
other  words,  any  line  intersects  7  at  most  twice.  The 
following  simple  lemma  is  crucial  in  our  upper  bound 
proof. 

Lemma  3.1  Let  j  be  a  convex  curve  in  the  plane  h. 
Then  the  number  of  visibilities  between  segments  in 
E(Ti)  and  7  is  0(n2“*'”^),  for  i  =  1, 2. 

Proof:  Recall  that  1^(7)  is  the  vertical  surface  con¬ 
taining  7.  Define  t  n  iL(7)  and  T*  =  {f*  ;  f  £  Ti}. 
The  curve  7  sees  a  segment  e  £  E{Tx)  if  and  only  if 
er\H('y)  is  a  vertex  of  the  upper  envelope  of  T*.  The  in¬ 
tersection  tiOtj  of  two  triangles  intersects  H{j)  at  most 
twice.  Hence,  the  upper  envelope  of  the  set  {f  fl  H{'y)  : 
t  £  Ti}  has  complexity  A4(n)  =  (9(n2“^"^)  [1].  A  simi¬ 
lar  argument  holds  for  E(T2).  O 


Using  this  lemma  we  can  prove  an  almost  tight  upper 
bound  on  b{n).  A  basic  ingredient  in  the  proof  are  effi¬ 
cient  hierarchical  cuttings  [6,  21],  which  we  define  next. 
Let  H  he  a  set  of  n  hyperplanes  in  IR*^.  A  (l/r)-cutting 
for  H  IS  a  subdivision  of  IR*^  into  disjoint  simplices  such 
that  the  interior  of  each  simplex  is  intersected  by  at 
most  n/r  hyperplanes  in  H.  The  size  of  a  cutting  is 
the  number  of  simplices  it  consists  of.  We  say  that  a 
cutting  E'  C-refines  a  cutting  E  if  every  simplex  of  E'  is 
completely  contained  in  a  single  simplex  of  E  and  every 
simplex  of  E  contains  at  most  C  simplices  of  E'.  Now  let 
C,  p  be  constants  and  r  a  parameter  with  1  <  r  <  u.  A 
sequence  E  =  Eq,  Ei , . . . ,  E^,  of  cuttings  is  called  an  effi¬ 
cient  hierarchical  (\/r)- cutting  (forH)  ifEo  is  the  single 
“simplex”  IR*^,  every  E,-  (1  <  i  <  fc)  is  a  (l/p*)-cutting 
of  size  0{p'‘^)  which  C-refines  E;_i,  and  p^~^  <  r  <  p^. 
Notice  that  the  last  condition  implies  that  k  =  0(logr). 
We  call  the  simplex  in  E,_i  that  contains  a  certain  sim¬ 
plex  s  £  E,-  the  parent  of  s,  denoted  by  parent(s). 

Lemma  3.2  b(n)  —  log  n). 

Proof:  Project  the  triangles  of  Ti  and  T2  vertically 
onto  the  plane  h.  Construct  an  efficient  hierarchical 
(l/6n)-cutting  E  =  Eo,Ei,. .  .,Efc  for  the  fin  lines  con¬ 
taining  the  projected  triangle  edges.  For  a  simplex 
s  £  E,-,  let  Tf-{s)  c  T  he  the  set  of  triangles  in 
Ti  whose  projection  fully  contains  s  but  whose  pro¬ 
jection  does  not  contain  parent(s),  and  let  T^{s)  be 
the  set  of  triangles  whose  projection  intersects  s  but 
does  not  contain  it.  Let  Tffs)  Tff{s)  [jT^{s).  De¬ 
fine  Tf(s),  TA(s),  and  T2(s)  analogously.  Finally,  let 
TC(s)  :=  Tf{s)UTf{s),  TX(s)  :=  (s)  U  (s) 

and  T(s)  ;=  T’i(s)  U  T2{s).  In  the  remainder  we  only 
work  with  “clipped”  triangles,  that  is,  for  each  triangle 
t  £  T(s)  we  only  consider  the  part  that  projects  onto  s. 

Consider  a  visibility  between  intersection  edges  £ 
E{Ti)  and  62  £  E{T2).  Let  t{ei)  and  t'(ei)  be  the  two 
triangles  that  define  e^,  that  is,  e:  =  f(ei)nt'(ei).  Simi¬ 
larly,  let  62  =  t(e2)nt'(e2).  We  denote  the  projection  of 
e,-  onto  h  by  el.  The  basic  observation  behind  our  proof 
is  the  fact  that  there  must  be  a  simplex  s  in  some  cutting 
E,-  such  that  e7ne2  £  s,  t{ei),  t'{ei),  ^(02),  t'{e2)  £  T{s), 
and  at  least  one  of  these  triangles  is  in  T*~(s).  This  im¬ 
plies  that  we  only  have  to  count  for  each  simplex  s  £  E 
the  number  of  visibilities  where  at  least  one  of  the  in¬ 
volved  triangles  is  in  T*“(s).  (A  similar  observation  is 
often  made  when  one  uses  hereditary  segment  trees  [9]: 
only  long-long  and  long-short  intersections  need  to  be 
considered,  not  short-short  intersections.) 

So  let  us  count  the  number  of  such  visibilities  for  a 
given  simplex  s .  Let  Ug  :=  |T(s)|.  Consider  some  trian¬ 
gle  t  £  Ti{s).  We  shall  count  the  number  of  visibilities 
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involving  intersections  of  t  and  triangles  t'  G  Tf{s).  To 
this  end  we  consider  the  intersection  of  t  with  the  upper 
envelope  of  Tf{s)\  an  intersection  between  t  and  a  part 
of  some  t'  e  Tf  {s)  that  is  not  on  the  upper  envelope 
can  never  be  visible  from  above.  The  upper  envelope 
of  Tf{s)  is  on  the  boundary  of  a  convex  polyhedron, 
and  the  projection  of  the  intersection  of  t  with  the  up¬ 
per  envelope  of  Tf{s)  is  contained  in  a  convex  curve. 
Lemma  3.1  now  tells  us  that  the  total  number  of  vis¬ 
ibilities  involving  intersections  of  t  and  some  triangle 
t'  G  Tf(s)  is  0(ns2"(”‘)).  A  similar  argument  holds 
of  course  for  the  triangles  in  T2(s),  so  the  total  number 
of  visibilities  at  s  that  involve  at  least  one  triangle  in 
TC(s)  is  0(n22“("')). 

To  obtain  the  total  number  of  visibilities  we  have  to 
sum  over  all  simplices  s  in  the  hierarchical  cutting.  Each 
triangle  t  G  T{s)  has  an  edge  intersecting  parent(s). 
Hence,  for  a  simplex  s  G  H,'  we  have  <  n/p'~^.  Thus 
the  total  number  of  visibilities  can  be  bounded  as  fol¬ 
lows: 


0(n22<"(”>  logn) 


The  general  case 

Before  we  can  prove  a  bound  on  the  complexity  of  ver¬ 
tical  decompositions  which  is  sensitive  to  the  complex¬ 
ity  of  the  arrangement,  we  need  to  prove  the  following 
worst-case  bound. 

Lemma  3.3  The  complexity  of  the  vertical  decompo¬ 
sition  of  a  set  of  n  triangles  in  is  0{n^). 

Proof:  The  proof  is  basically  the  same  as  the  proof 
of  Theorem  3.2  below,  with  the  “intersection-sensitive” 
cuttings  of  Lemma  3.4  replaced  by  Chazelle’s  hierarchi¬ 
cal  cuttings. 

We  need  one  more  ingredient  before  we  can  prove  the 
main  theorem  of  this  section. 

Lemma  3.4  Let  T  be  a  set  of  n  triangles  in  with 
|yl(T)|  =  A',  and  let  r  be  a  parameter  with  1  <  r  <  n. 
There  exists  an  0{logr /r)-cutting  of  size  0{r'^a{r)  -\- 
Kr^ /n^)  for  T. 


simplex  will  be  intersected  by  0(nlogr/r)  triangles  in 
T  (with  high  probability).  Finally,  note  that  the  ex¬ 
pected  complexity  of  A{R)  is  0(r^  -|-  Kr^/n^).  □ 


Theorem  3.2  Let  T  be  a  set  of  n  triangles  in  with 
|«4{T)|  =  K.  Then,  for  any  constant  s  >  0,  the  complex¬ 
ity  of  the  vertical  decomposition  for  T  is  0{ri^'^^  -t-  K). 

Proof:  Define  f{n,  m)  to  be  the  maximum  number  of 
visibilities  between  edges  in  E{T)  over  all  sets  T  of  n 
triangles  where  m  is  the  number  of  triple  intersections 
of  the  triangles  in  T.  We  shall  prove  that,  for  any  £  >  0, 
/(n,m)  =  +  m). 

Fix  a  large  enough  constant  r  =  r(£).  We  distinguish 
two  cases. 

crise  (i):  m  >  n^/r 

Because  of  Lemma  3.3  and  r  is  a  constant  f{n,m)  = 
-b  m). 

case  (ii):  m  <  n^/r 

Because  m  <  n^/r.  Lemma  3.4  implies  that  there  exists 
a  (ci  logr/r)-cutting  H  for  T  of  size  C2r^a(r),  for  some 
constants  ci,C2.  Let  T(s)  C  T  denote  the  subset  of 
triangles  that  intersect  the  interior  of  a  simplex  s  G  5. 
There  are  two  types  of  visibilities  between  intersection 
edges  of  the  triangles  in  T:  visibilities  such  that  there  is 
a  set  T(s)  that  contains  all  four  triangles  involved,  and 
visibilities  where  there  is  no  such  set  T{s).  The  first  type 
of  visibilities  is  counted  recursively.  For  visibilities  of 
the  second  type  we  observe  that  the  segment  connecting 
the  two  intersection  edges  must  intersect  a  facet  of  some 
simplex  s  G  H.  By  Lemma  3.2  the  number  of  such 
visibilities  for  a  fixed  facet  is  at  most  C3n^2“^”Hogn, 
for  some  constant  C3. 

Define  n*  |T(s)|  and  define  m*  to  be  the  num¬ 
ber  of  triple  intersections  in  A{T{s)).  Notice  that 
Us  <  CiJilogr/r  and  there 

are  4c2r^a(r)  facets  of  simplices  in  E.  We  can  now  use 
induction  to  prove  that  there  is  a  constant  c  such  that 
/(n,  m)  <  +  crm\ 

f{n,m)  < 

<  4c2r^a(r)c3n^2“("Hogn  -b  /(ns,ms) 

seE 

<  4c2r^a(r)c3n^2"(")  log  n  -b  +  crm*] 

s€=. 

<4c2r2a(r)c3n22“(")logn  4-  ^  (ail^)2+e  + 

sen 

<  J^2-^e^4c2r^a(^)c32°<"Hogn  C2Q;(r)(ci  log  r)‘ j 


Proof:  Let  A  C  T  be  a  random  sample  of  size  r.  By  <  -b  crm. 
the  Slicing  Theorem  [2]  we  can  triangulate  A{R)  into 
0(r^a(r)-b  |A.(i?)i)  simplices.  By  £-net  theory  [19]  each 
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We  apply  a  similar  analysis  to  the  case  of  a  single 
cell.  A  single  cell  in  an  arrangement  of  n  triangles  can  be 
decomposed  into  0(n^  log  n)  simplices;  this  follows  from 
the  Slicing  Theorem  and  a  bound  on  the  complexity  of 
a  single  cell  [3] .  In  the  inductive  analysis  above  we  now 
have  to  recurse  only  on  the  0{r^  logr)  simplices  of  the 
single  cell  in  the  arrangement  of  a  sample  of  size  r,  where 
in  each  subproblem  we  still  deal  with  a  single  cell.  We 
get  the  following  result. 

Corollary  3.1  For  any  constant  e  >  0,  the  maximum 
combinatorial  complexity  of  the  vertical  decomposition 
of  a  single  cell  in  an  arrangement  of  n  triangles  in  3- 
space  is 

4  The  Algorithm 

Let  T  =  {ti, . . . ,  tn}  be  a  collection  of  triangles  as  de¬ 
fined  in  Section  2.  To  simplify  the  description  of  our 
algorithm,  we  will  assume  that  the  entire  collection  T  of 
triangles  is  contained  inside  a  hounding  simplex,  whose 
faces  are  special  triangles  in  T.  They  are  special  in  the 
sense  that  they  violate  the  general  position  assumption. 
Also,  unlike  all  the  other  triangles  in  T,  we  will  be  in¬ 
terested  in  only  one  side  of  each  of  these  four  triangles 
(the  side  that  faces  the  interior  of  the  simplex). 

As  mentioned  in  the  introduction,  we  will  consider  a 
decomposition  carried  out  in  three  steps:  first  we  extend 
a  vertical  wall  from  every  boundary  edge  of  any  trian¬ 
gle  in  T,  thus  we  obtain  Vi(T);  in  addition,  we  extend 
a  vertical  wall  from  every  intersection  edge  of  two  tri¬ 
angles  in  T,  and  we  get  V2(T);  and  finally  we  refine  the 
subcells  of  V2{T)  into  constant  size  subcells,  to  produce 
V3(r). 

The  data  structure  that  we  obtain  after  carrying  out 
the  entire  algorithm  provides  a  comprehensive  and  con¬ 
venient  representation  of  the  arrangement:  each  sub¬ 
cell  in  this  representation  has  “constant  description 
complexity” — it  is  bounded  by  up  to  six  planar  walls, 
which  are  quadrilaterals — it  is  homcomorphic  to  a  ball, 
and  the  structure  provides  connectivity  information  be¬ 
tween  adjacent  cells  across  vertical  walls  (we  discuss  the 
connectivity  issue  in  detail  in  the  full  version  of  the  pa¬ 
per)  .  A  significant  advantage  of  the  one-step  decomposi¬ 
tion  Vi(T)  is  that  in  return  for  a  relatively  low  overhead 
it  already  captures  the  3D  structure  of  the  arrangement 
and  it  makes  the  next  steps  of  the  algorithm  fairly  sim¬ 
ple. 

For  the  most  part,  our  algorithm  solves  two- 
dimensional  subproblems  and  it  mainly  uses  structures 
describing  two-dimensional  arrangements  of  segments. 
We  will  represent  two-dimensional  subdivisions  using 
the  quad-edge  structure  [16],  and  augment  it  with  ad¬ 
ditional  three-dimensional  data  as  necessary. 


Computing  the  decomposition  Vi(r) 

As  before,  we  denote  the  complexity  of  the  arrangement 
A(fr)  by  K.  We  start  with  computing  the  one-step  de¬ 
composition  of  the  arrangement.  To  transform  A{T) 
into  Vi{T)  wc  have  to  add  the  vertical  wall  W{e,T)  for 
each  boundary  edge  e  of  a  triangle  in  T.  In  Observa¬ 
tion  3.1  we  have  already  seen  that  the  complexity  of 
all  these  walls  is  0{n^a{n)).  Recall  that  the  proof  of 
this  observation  was  based  on  the  fact  that  each  wall 
W (e,  T)  is  bounded  from  above  (below)  by  the  lower 
(upper)  envelope  of  the  intersections  of  other  triangles 
with  the  vertical  fiat  surface  H{e)]  see  Figure  1(b).  This 
fact  also  implies  that  we  can  compute  a  single  wall  in 
O(nlogn)  time  [20],  leading  to  the  following  lemma. 

Lemma  4.1  The  set  of  walls  W{e,T)  for  all  bound¬ 
ary  edges  e  of  the  triangles  in  T  can  be  computed  in 
0(n^  log  n)  time. 

For  each  triangle  boundary  edge  e  we  split  the  por¬ 
tion  between  e  and  the  upper  boundary  of  W(e,T) 
by  exhibiting  extra  vertical  edges  emanating  from  each 
breakpoint  of  the  upper  boundary  until  they  hit  e  (some 
of  these  edges  are  degenerate,  since  the  corresponding 
breakpoint  lies  on  e).  We  repeat  the  analogous  proce¬ 
dure  for  the  lower  boundary,  and  represent  the  collec¬ 
tion  of  vertices,  edges  and  faces  on  W{e,T)  by  a  quad- 
edge  structure.  The  structure  size  is  proportional  to  the 
complexity  of  W (e,  T);  it  can  be  computed  in  time  that 
is  proportional  to  its  size. 

Next,  we  define  a  pair  of  two-dimensional  arrange¬ 
ments  of  segments  for  each  triangle  t,-.  We  consider 
each  triangle  to  be  two-sided,  and  let  t~  denote  the  side 
of  ti  facing  downward  and  ff  be  the  side  of  t,  facing 
upward.  We  will  use  the  notation  t*  to  refer  to  either 
side  of  ti.  For  the  bounding  simplex,  we  need  only  one 
side  of  each  triangle — the  side  facing  the  internal  part  of 
the  simplex.  The  arrangement  on  tf  is  defined  by  a  set 
r(tL)  consisting  of  two  types  of  segments:  intersections 
of  ti  with  other  triangles  of  T  and  boundary  pieces  of 
some  W{e,T)  that  lie  on  tf .  The  second  type  of  seg¬ 
ments  in  r{tf )  are,  in  other  words,  the  contribution  of  ti 
to  the  upper  envelopes  which  bound  some  wall  W(e,  T) 
from  below.  Similarly,  the  arrangement  on  tf  is  defined 
by  the  set  of  segments  r(t[“)  consisting  of  intersections 
of  ti  with  other  triangles  of  T,  and  boundary  pieces  of 
some  W(e,T)  that  lie  on  tf .  For  each  segment  in  r(t*) 
we  attach  the  label  denoting  its  origin:  either  the  la¬ 
bel  of  the  other  triangle  intersecting  t,-  or  the  edge  e  on 
whose  envelope  it  appears.  Let  n*  =  |r(t*)|.  We  will 
use  the  abbreviation  A*  to  denote  the  two-dimensional 
arrangement  .A(r(t*)). 

The  next  step  of  the  algorithm  is  to  compute,  for  each 
side  of  each  triangle  ti  G  T,  the  arrangement  induced 
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by  r(i*).  To  this  end  we  use  a  standard  plane  sweep  al¬ 
gorithm  that  detects  all  the  intersection  points  between 
segments^  [24] .  (At  this  point  one  may  use  the  optimal 
algorithm  by  Chazelle  and  Edelsbrunner  [7],  However, 
due  to  other  steps  of  the  algorithm,  this  will  not  make  a 
difference  in  the  overall  running  time  of  the  algorithm. 
Also,  the  sweep  paradigm  is  more  appropriate  for  the 
generalization  that  we  propose  in  the  sequel.) 

Lemma  4.2  After  having  computed  all  walls  of  trian¬ 
gle  boundary  edges,  all  the  two-dimensional  arrange¬ 
ments  A*  can  be  computed  in  time  0(|Vi(T)|  logn). 

Again,  we  represent  each  two-dimensional  arrange¬ 
ment  by  a  quad-edge  structure. 

It  is  easy  to  verify  that  after  the  first  step  of  the  de¬ 
composition,  the  entire  three-dimensional  arrangement 
is  connected,  that  is,  there  are  no  “floating”  parts.  In 
particular,  the  two-dimensional  boundary  of  each  three- 
dimensional  cell  is  connected.  The  shape  of  each  3D  cell 
may  still  be  rather  convoluted;  it  need  not  even  be  ary- 
monotone.  Therefore,  instead  of  handling  each  3D  cell 
at  a  time  we  carry  out  a  space  sweep  over  the  entire 
decomposition  Vi(T).  (A  similar  approach  has  recently 
been  used  to  obtain  a  decomposition  of  certain  arrange¬ 
ments  related  to  a  motion  planning  problem  [17];  see 
also  [14,  25]  for  dynamic  maintenance  of  a  monotone 
subdivision  in  a  space-sweep.) 

Let  Pyi  denote  the  plane  y  =  yi .  Let  Ay,  =  A{Py,  H 
Vi  (T))  be  the  two-dimensional  arrangement  of  segments 
induced  on  the  plane  Py,^  by  intersecting  it  with®  Vi(T). 
We  use  Ay  to  denote  this  arrangement  for  an  arbitrary 
y-value.  We  will  be  using  the  following  property  of  the 
one-step  decomposition  (whose  simple  proof  is  omitted 
here). 

Lemma  4.3  Every  face  in  Ay  is  x-monotone,  that  is, 
every  z-vertical  line  intersects  any  face  of  Ay  in  at  most 
one  connected  component. 

Transforming  Vi(T)  into  V2(T) 

We  now  proceed  to  describe  the  second  step  of  the  algo¬ 
rithm.  The  major  difficulty  in  transforming  Vi(r)  into 
V2(T)  is  to  efficiently  detect  the  vertical  visibilities  of 
pairs  of  intersection  edges,  one  on  the  ceiling  of  a  cell 

^From  this  point  on,  when  discussing  two-dimensional  arrange¬ 
ments,  we  will  distinguish  between  a  segment  and  an  edge.  A 
segment  is  a  maximal  portion  of  a  line  that  appears  in  the  ar¬ 
rangement,  possibly  intersected  by  other  segments.  An  edge  is 
a  maximal  portion  of  a  segment  not  intersected  by  any  other 
segment. 

®With  a  slight  abuse  of  notation,  we  use  Vi(T)  to  denote  the 
subdivision  of  3-space  after  the  one-step  decomposition,  and  also 
to  denote  the  collection  of  triangles  and  walls  that  induce  this 
subdivision. 


Figure  2:  Examples  of  changes  to  Ay  when  sweeping 
over  a  vertex  of  Vi(T). 


of  Vi(T)  and  the  other  on  the  floor  of  that  cell.  To  this 
end  we  perform  a  space  sweep  with  a  plane  Py  parallel 
to  the  x2-plane  over  Vi(T),  from  y  =  — oo  to  y  —  -boo. 
Throughout  the  sweep  we  maintain  dynamic  data  struc¬ 
tures  that  describe  Ay.  (Below  we  will  indicate  which 
data  structures  we  need.)  Roughly,  our  goal  is  to  up¬ 
date  each  face  of  any  arrangement  Af  with  all  the  edges 
of  ceiling  faces  that  are  visible  when  looking  vertically 
upward  from  that  face,  and  similarly,  to  update  each 
face  of  any  arrangement  A~  with  all  the  edges  of  floor 
faces  that  are  visible  when  looking  vertically  downward 
from  that  face. 

The  arrangement  Ay  changes  continuously  as  we 
sweep  the  plane  Py.  At  a  finite  number  of  “events”, 
however,  the  combinatorial  structure  of  Ay  changes. 
It  is  not  difficult  to  see  that  these  events  are  exactly 
the  vertices  of  Vi(T).  At  such  an  event  the  follow¬ 
ing  changes  to  Ay  can  occur:  (i)  a  vertex  of  Ay  may 
(dis)appear;  (ii)  an  edge  of  Ay  may  (dis)appear;  and 
(iii)  a  face  of  Ay  may  (dis)appear.  In  fact,  several  such 
changes  occur  simultaneously  at  each  event.  Two  ex¬ 
amples  are  given  in  Figure  2.  By  the  general  position 
assumption,  each  event  involves  only  a  constant  number 
of  changes  to  a  constant  number  of  faces  in  Ay. 

Since  our  goal  is  to  detect  vertical  visibilities  between 
two  (intersection)  edges  in  Vi(T),  we  wish  to  detect 
when  there  is  a  vertical  segment  connecting  two  inter¬ 
section  edges  that  does  not  intersect  any  triangle  in  T. 
Consider  the  sweep  plane  Py  when  it  contains  such  a 
vertical  segment  connecting  intersection  edges  ej  and 
62.  Then  the  intersection  of  ei  and  62  with  Py  must 
define  two  vertices  of  Ay  which  are  on  the  boundary  of 
the  same  face  and  lie  on  a  common  vertical  line.  Thus 
we  define  a  new  type  of  event  in  our  space  sweep,  called 
a  vertical  event,  which  occurs  when  two  vertices  of  the 
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same  face  become  aligned  along  a  vertical  line. 

We  maintain  all  the  events  in  a  priority  queue  Q, 
ordered  by  increasing  j/-coordinate.  The  operations  we 
will  perform  on  Q  are  insert  an  event,  delete  an  event, 
and  fetch  the  next  event,  that  is,  fetch  the  event  with 
minimum  y-value  (we  delete  each  event  after  it  has  been 
handled).  We  also  need  to  perform  membership  check 
of  an  event  in  the  queue,  to  avoid  inserting  the  same 
event  several  times.  Such  a  queue  can  be  implemented 
so  that  the  time  for  each  operation  is  O(logm),  where  m 
is  the  maximum  number  of  events  held  simultaneously 
in  the  queue  [24].  Observe  that  we  can  insert  all  the 
events  where  the  structure  of  Ay  changes — the  vertices 
of  Vi(T) — into  Q  before  we  start  the  sweep.  To  each 
event  that  we  insert  into  the  queue,  we  attach  some 
local  geometric  and  combinatorial  information  relevant 
to  that  event.  The  vertical  events,  however,  have  to 
be  computed  on  the  fly.  Next  we  describe  the  data 
structures  which  we  need  to  represent  Ay  in  order  to  be 
able  to  compute  the  vertical  events. 

Let  /  =  f{y)  be  a  face  of  Ay.  Recall  that  /  is  x- 
monotone.  We  split  the  vertices  on  the  boundary  of  / 
into  two  x-monotone  chains:  the  lower  chain  L{f),  and 
the  upper  chain  U{f).  The  vertices  in  each  chain  arc 
ordered  by  increasing  a;-coordinate,  and  we  implement 
each  chain  as  a  balanced  binary  tree.  Whenever  a  ver¬ 
tex  on  the  boundary  of  /  disappears  or  newly  appears, 
we  update  the  relevant  chain  by  deletion  or  insertion  re¬ 
spectively.  When  an  existing  face  disappears,  we  “free” 
its  attached  chains.  When  a  face  newly  appears,  we 
allocate  a  pair  of  new  structures  for  its  lower  and  up¬ 
per  chains.  When  a  face  /  is  split  into  two  faces  (for 
instance  when  an  endpoint  of  a  segment  penetrates  the 
face)  we  create  new  pairs  of  chains  for  these  faces.  These 
are  easily  constructed  by  splitting  the  chains  of  /  and 
adding  a  small  number  of  new  vertices.  Similarly,  when 
two  faces  merge  into  one,  we  create  the  two  chains  of 
the  new  face  by  join  operations  on  the  chains  of  the 
merged  faces  plus  possibly  a  few  deletions  of  vertices. 
All  these  operations  (insertion,  deletion,  split  and  join) 
can  be  carried  out  in  0(log  n)  time  each,  using  red-black 
trees  [15],  [27,  Chapter  4]. 

To  detect  vertical  visibilities  between  vertices  on  the 
boundary  of  /  we  proceed  as  follows.  Whenever  a  new 
vertex  v  is  created  on  the  lower  chain  of  /  (the  opera¬ 
tions  for  a  new  vertex  on  the  upper  chain  are  symmet¬ 
ric),  we  look  for  its  “neighbors”  in  the  upper  chain  of  /, 
that  is,  we  look  for  the  two  vertices  of  U{f)  whose  pro¬ 
jection  onto  the  a;-axis  lie  nearest  to  the  projection  of  v 
onto  the  a;-axis.  Let  uj  and  U2  be  the  two  neighbors  of 
V  on  the  upper  chain.  Each  vertex  w  on  the  boundary 
of  /  is  the  cross-section  of  Py  with  an  edge  e{w)  of  some 
A*.  (It  is  important  here  that  e{w)  is  an  edge  of  A*, 


not  a  segment  on  t*.  See  footnote  4  for  the  difference.) 
Since  we  have  attached  to  each  vertex  some  additional 
information,  we  can  compare  e{v)  with  each  of  e(u2) 
and  e{u2)  to  see  if  their  projections  onto  the  xy-plane 
intersect.  If  they  intersect,  we  have  detected  a  poten¬ 
tial  vertical  visibility  between  two  intersection  edges;  we 
add  this  event  to  Q,  with  the  y-coordinate  of  the  inter¬ 
section  point.  We  say  potential  because  when  we  will 
come  to  handle  this  event,  we  may  find  that  this  “verti¬ 
cal  visibility”  is  obscured  by  another  triangle  (or  trian¬ 
gles)  in  a  manner  that  we  were  unable  to  predict  when 
the  event  was  inserted  into  the  queue.  Therefore,  we 
distinguish  between  two  types  of  events:  actual  events, 
which  are  either  the  events  corresponding  to  features  of 
Vi(T),  or  additional  events  that  indeed  correspond  to  a 
vertical  visibility,  and  false  events,  which  are  potential 
vertical  visibilities  that  are  found  out  to  be  obscured 
when  handled. 

When  we  handle  an  actual  event  q  which  corresponds 
to  vertical  visibility  of  a  pair  of  vertices  u  and  v  on  the 
lower  and  upper  chains  respectively,  each  of  u  and  v  now 
have  new  neighbors  on  the  reciprocal  chain,  so  we  check 
these  new  neighbors  for  potential  additional  events  as 
above. 

The  next  lemma  proves  the  correctness  of  our  ap¬ 
proach. 

Lemma  4.4  The  set  of  events  that  is  computed  during 
the  sweep  contains  the  set  of  visibilities  between  pairs 
of  intersection  edges.  The  false  events  can  be  efficiently 
distinguished  from  the  actual  events.  Each  false  event 
can  be  charged  to  an  actual  event,  and  no  actual  event 
gets  charged  more  than  a  constant  number  of  times  this 
way. 

Proof:  First  we  claim  that  no  vertical  visibility  is 

missed  by  our  algorithm.  The  reason  is  that  before  a 
pair  of  edges  of  Vi{T)  become  vertically  visible,  their 
corresponding  cross-sections  must  become  neighbors  (in 
the  sense  defined  above)  in  L{f)  and  U{f)  for  some  face 
/  of  Ay.  This  is  similar  to  a  basic  argument  in  the 
Bentley-Ottmann  algorithm  for  detecting  intersections 
of  line  segments  [4],  [24,  Section  7.2]. 

Once  we  handle  a  vertical  visibility  event  it  is  easy 
to  determine  whether  it  is  actual  or  false  by  checking 
whether  the  lower  and  upper  chains  involved  in  this 
event  belong  to  the  same  face.  The  event  is  actual  if 
and  only  if  both  chains  belong  to  the  same  face.  This 
could  be  done,  for  example,  by  maintaining  cross  point¬ 
ers  between  the  roots  of  the  trees  describing  the  chains. 

We  charge  each  false  event  q'  to  the  actual  event  q 
that  has  “spawned”  it.  Every  vertical  visibility  event 
is  added  to  the  queue  Q  only  at  actual  events  (a  false 
event  does  not  create  new  events).  Since  no  event  cre¬ 
ates  more  than  a  constant  number  of  additional  events, 
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no  actual  event  will  be  charged  more  than  a  constant 
number  of  times  for  false  events. 

□ 

To  represent  the  decomposition  V2(T),  we  augment 
the  two-dimensional  arrangements  A*  with  information 
on  all  the  edges  that  are  vertically  visible  from  A*. 
More  precisely,  we  add  to  Af  all  the  edges  that  are 
visible  when  looking  vertically  upward  from  tf.  Simi¬ 
larly,  we  add  to  A~  all  the  edges  that  are  visible  when 
looking  vertically  downward  from  .  We  also  record 
on  these  arrangements  the  intersection  of  these  edges 
with  the  original  edges  of  the  corresponding  A*.  Note 
that  these  intersection  points  correspond  exactly  to  the 
events  computed  during  the  space  sweep. 

Lemma  4.5  The  space  sweep  algorithm  described 
above  for  transforming  Vi(T)  into  V2(T)  runs  in  time 
0{Vlogn),  where  V  =  |V2(T)|. 

Proof:  We  have  seen  that  every  event  can  be  handled 
with  a  constant  number  of  operations  on  some  chains 
U(f)  and  L(/),  and  on  Q.  Thus  every  event  takes 
O(logn)  time.  It  remains  to  observe  that  every  event  is 
either  a  feature  of  V2(T),  or  it  can  be  charged  to  a  fea¬ 
ture  of  V2(T)  such  that  no  feature  of  V2(T)  gets  charged 
more  than  a  constant  number  of  times  (Lemma  4.4).  O 


We  denote  the  arrangement  A*  after  updating  it  in 
the  above  process  by  B*.  Every  face  on  every  B*  rep¬ 
resents  a  vertical  cylindrical  cell  which  has  the  same 
xy-projection  as  that  face,  and  has  a  unique  triangle 
bounding  it  on  the  top  and  a  unique  triangle  bounding 
it  on  the  bottom. 

Transforming  V2{T)  into  V3(r) 

The  final  step  of  our  algorithm  is  to  refine  each  B*  fur¬ 
ther  to  obtain  the  full  vertical  decomposition,  in  the 
following  standard  manner.  Consider  the  projection  of 
B*  onto  the  xy-plane.  We  extend  a  y-vertical  segment 
from  each  vertex  of  the  projected  arrangement  upward 
and  downward  until  it  reaches  another  segment,  or  the 
boundary  of  the  projection  of  t,-.  By  projecting  the  seg¬ 
ments  we  have  added  back  to  t*,  we  obtain  a  refine¬ 
ment  of  B*  into  trapezoids.  Finally,  we  extend  each  of 
these  newly  added  segments  into  vertical  walls  inside 
the  respective  three-dimensional  cells  to  obtain  V^iT). 
Adding  the  j/- vertical  extensions  to  all  the  arrangements 
B*  can  be  carried  out  by  a  plane  sweep  of  each  of  the 
arrangements  B*.  This  last  step  of  the  decomposition, 
where  we  transform  V2(T')  into  V3(T),  does  not  increase 


the  asymptotic  complexity  of  the  decomposition  or  the 
algorithm. 

For  lack  of  space,  several  details  and  proofs  have  been 
omitted  throughout — they  are  discussed  in  the  full  ver¬ 
sion  of  the  paper.  The  main  algorithmic  result  of  the 
paper  is  summarized  in  the  following  theorem: 

Theorem  4.1  Given  a  collection  T  of  n  triangles  in 
general  position  in  three-dimensional  space,  the  time 
needed  to  compute  the  full  vertical  decomposition  of 
the  arrangement  A{T)  is  0{n^  log  n-\-V  log  n),  where  V 
is  the  combinatorial  complexity  of  the  vertical  decom¬ 
position. 

Remark  It  is,  in  fact,  possible  to  compute  vertical 
decompositions  with  a  somewhat  lower  overhead  term 
than  the  algorithm  above  achieves,  using  rather  involved 
multi-level  data  structures.  The  alternative  approach  is 
described  in  the  full  version  of  the  paper. 

5  Concluding  Remarks 

We  have  proved  bounds  on  the  maximum  combinatorial 
complexity  of  the  vertical  decomposition  of  sets  of  trian¬ 
gles  in  3-space  that  are  sensitive  to  the  complexity  of  the 
arrangement  of  triangles.  We  have  also  given  a  simple 
deterministic  output-sensitive  algorithm  for  computing 
the  vertical  decomposition. 

While  there  are  various  technical  details  that  still 
need  to  be  verified,  we  believe  that  our  algorithm  will 
work  with  roughly  the  same  time  bound  for  the  case  of 
low-degree  algebraic  surface  patches  in  3-space.  We  are 
currently  studying  the  full  details  of  this  extension.  The 
generalization  of  the  combinatorial  result  seems  more 
difficult  and  currently  we  do  not  see  how  to  adapt  our 
proof  to  this  case. 

Another  open  problem  is  to  tighten  the  small  gap  that 
remains  between  the  upper  and  lower  bounds  for  verti¬ 
cal  decompositions.  We  believe  that  the  n^-factor  in  the 
upper  bound  is  only  an  artifact  of  our  proof  technique 
and  that  the  lower  bound  is  closer  to  the  truth  than  the 
upper  bound. 
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Almost  Tight  Upper  Bounds  for  the  Single  Cell  and  Zone 

Problems  in  Three  Dimensions* 


Dan  Halperin^ 


Abstract 

We  consider  the  problem  of  bounding  the  combinato¬ 
rial  complexity  of  a  single  cell  in  an  arrangement  of 
n  low-degree  algebraic  surface  patches  in  3-space.  We 
show  that  this  complexity  is  for  any  e  >  0, 

where  the  constant  of  proportionality  depends  on  e  and 
on  the  maximum  degree  of  the  given  surfaces  and  of 
their  boundaries.  This  extends  several  previous  results, 
almost  settles  a  7-year-old  open  problem,  and  has  ap¬ 
plications  to  motion  planning  of  general  robot  systems 
with  three  degrees  of  freedom.  As  a  corollary  of  the 
above  result,  we  show  that  the  overall  complexity  of 
all  the  three-dimensional  cells  of  an  arrangement  of  n 
low-degree  algebraic  surface  patches,  intersected  by  an 
additional  low-degree  algebraic  surface  patch  a  (the  so- 
called  zone  of  <j  in  the  arrangement)  is  0(n^+®),  for  any 
£■  >  0,  where  the  constant  of  proportionality  depends  on 
e  and  on  the  maximum  degree  of  the  given  surfaces  and 
of  their  boundaries. 

1  Introduction 

Let  S  =  {(Ji, , . .  ,cr„}  be  a  given  collection  of  n  low- 
degree  algebraic  surface  patches  in  3-space  (see  below 
for  a  more  precise  statement  of  the  properties  that  these 
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surfaces  are  assumed  to  satisfy).  We  denote  by  yl(S)  the 
arrangement  of  S,  i.e.,  the  decomposition  of  3-space  into 
(relatively  open)  cells  of  various  dimensions,  each  being 
a  maximal  connected  set  contained  in  the  intersection 
of  a  fixed  subcollection  of  S  and  not  meeting  any  other 
surface;  we  will  denote  j-dimensional  cells  of  A(S),  for 
j  =  0,1,2,  as  vertices,  edges,  and  faces,  respectively, 
and  the  unquantified  term  cell  will  be  used  to  denote 
3-dimensional  cells  of  A(S).  The  combinatorial  com¬ 
plexity  of  a  cell  C  is  the  number  of  lower-'dimensional 
cells  appearing  on  its  boundary.  The  problem  studied 
in  this  paper  is  to  obtain  a  sharp  upper  bound  on  the 
combinatorial  complexity  of  a  single  cell  in  such  an  ar¬ 
rangement. 

One  of  the  main  motivations  for  studying  this  prob¬ 
lem  is  its  applications  to  robot  motion  planning.  Let  B 
be  an  arbitrary  robot  system  with  3  degrees  of  freedom, 
moving  in  some  environment  V  filled  with  obstacles. 
Any  placement  of  B  can  be  represented  by  a  point  in 
3-space,  whose  coordinates  are  the  3  parameters  con¬ 
trolling  the  degrees  of  freedom  of  B;  this  space  is  called 
the  configuration  space  of  B.  We  want  to  compute  the 
free  portion  of  this  space,  denoted  as  FP,  and  consist¬ 
ing  of  those  placements  of  B  at  which  it  does  not  meet 
any  obstacle.  We  note  that  the  boundary  of  FP  con¬ 
sists  of  placements  at  which  B  makes  contact  with  some 
obstacles,  but  does  not  penetrate  into  any  of  them.  Un¬ 
der  reasonable  assumptions  concerning  B  and  V,  we 
can  represent  the  subset  of  ‘contact  placements’  of  B 
(including  those  placements  at  which  B  makes  contact 
with  an  obstacle  but  may  also  penetrate  into  other  ob¬ 
stacles)  as  the  union  of  a  collection  of  a  finite  number 
of  surface  patches,  all  algebraic  of  constant  maximum 
degree  (and  whose  relative  boundaries  are  also  algebraic 
of  constant  maximum  degree). 

For  example,  if  B  is  an  arbitrary  polygonal  object 
with  k  sides,  and  V  is  an  open  planar  polygonal  re¬ 
gion  bounded  by  m  edges,  the  configuration  space  of 
B  is  a  3-dimensional  space,  each  point  of  which  repre¬ 
sents  a  possible  placement  of  B  by  the  parametrization 
(a;,j/,  tan|),  where  {x,y)  are  the  coordinates  of  some 
fixed  reference  point  on  B,  and  6  is  the  orientation  of 
B.  In  this  case,  each  ‘contact  surface’  is  either  the  lo¬ 
cus  of  all  placements  of  B  at  which  some  specific  corner 
of  B  touches  some  specific  edge  of  V,  or  the  locus  of 
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placements  at  which  some  side  of  B  touches  some  ver¬ 
tex  of  V.  Each  of  the  resulting  0{km)  contact  surfaces 
is  a  2-dimensional  algebraic  surface  patch  of  degree  at 
most  4,  and  its  relative  boundary  consists  of  a  constant 
number  of  algebraic  arcs,  of  constant  maximum  degree 
as  well. 

If  B  is  placed  at  a  free  placement  Z  and  moves  con¬ 
tinuously  from  Z,  then  it  remains  free  as  long  as  the 
corresponding  path  traced  in  configuration  space  does 
not  hit  any  contact  surface.  Moreover,  once  this  path 
crosses  a  contact  surface,  B  becomes  non-free  (assum¬ 
ing,  as  is  customary,  general  position  of  B  and  F).  It 
follows  that  the  connected  component  of  FP  that  con¬ 
tains  Z  is  the  cell  that  contains  Z  in  the  arrangement 
^(S)  of  the  contact  surfaces.  (The  entire  FP  is  the 
union  of  a  collection  of  certain  cells  in  this  arrange¬ 
ment.)  Hence,  bounding  the  combinatorial  complexity 
of  a  single  cell  in  such  an  arrangement  is  a  major  prob¬ 
lem  one  has  to  tackle,  prior  to  the  design  of  efficient 
algorithms  for  computing  such  a  cell. 

Here  is  a  brief  history  of  the  single-cell  problem.  In 
two  dimensions,  it  has  been  shown  in  [14]  that  the  com¬ 
plexity  of  a  single  face  in  an  arrangement  of  n  Jordan 
arcs,  each  pair  of  which  intersect  in  at  most  some  con¬ 
stant  number,  s,  of  points,  is  0{Xs+2(n)),  where  Xq(m) 
is  the  maximum  length  of  Davenport-Schinzel  sequences 
of  order  q  composed  of  m  symbols,  and  is  nearly  linear 
in  m  for  any  fixed  q  (see  [1,  18]  for  more  details).  Thus 
the  maximum  complexity  of  a  single  face  is  nearly  linear 
in  the  number  of  arcs  (for  any  fixed  s),  as  opposed  to  the 
complexity  of  the  entire  arrangement  of  the  arcs,  which 
can  be  quadratic  in  the  worst  case.  Efficient  algorithms 
for  computing  a  single  face  in  a  2-D  arrangement  are 
given  in  [14]  and  in  [8]. 

In  higher  dimensions,  a  prevailing  conjecture  (see, 
e.g.,  [22])  is  that  the  complexity  of  a  single  cell  in  an 
arrangement  .4(S)  as  above  is  at  most  close  to 
w'hich  is  again  roughly  one  order  of  magnitude  smaller 
than  the  maximum  complexity  of  the  entire  arrange¬ 
ment,  W'hich  can  be  0(n‘^)  (see  [23]).  A  stronger  version 
of  the  conjecture  asserts  that  the  maximum  complexity 
of  a  single  cell  in  such  an  arrangement  is  0(n^~^Xs(n)), 
w'here  s  is  some  constant  that  depends  on  the  maximum 
degree  of  the  given  surfaces  and  of  their  boundaries. 

These  conjectures  have  been  proved  only  for  a  few 
special  cases  of  arrangements.  They  are  largely  open  in 
the  general  case  stated  above.  In  fact,  no  bounds  better 
than  0(n‘‘)  are  know'n  for  the  general  case,  even  in  three 
dimensions.  The  special  cases  for  which  better  bounds 
are  known  include  the  case  of  hyperplanes,  w'here  the 
complexity  of  a  single  cell,  being  a  convex  polytope 
bounded  by  at  most  n  hyperplanes,  is  0(nl-‘*/^J)  (by 
the  Upper  Bound  Theorem  [21]),  the  case  of  spheres, 
W'here  an  bound  is  easy  to  obtain  by  lifting 

the  spheres  into  hyperplanes  in  ((i-M)-space  [11,  24],  the 


case  of  (d—  l)-simplices,  where  an  logn)  bound 

has  been  established  in  [5],  and  several  special  cases  in 
three  dimensions  that  arise  in  motion  planning  for  vari¬ 
ous  specific  robot  systems  B  with  three  degrees  of  free¬ 
dom,  including  the  case  of  a  moving  polygon  mentioned 
above,  w'here  an  0(n^+^)  bound  is  proved  in  [17],  some 
restricted  cases  of  that  problem,  where  slightly  more 
improved  bounds  are  obtained  [15],  and  a  few  other  sys¬ 
tems  (see  [15]). 

The  single  cell  problem  is  a  generalization  of  the  re¬ 
lated  problem  of  bounding  the  complexity  of  the  lower 
envelope  of  S,  i.e.,  the  portion  of  the  union  of  the  sur¬ 
faces  of  E,  consisting  of  those  points  w  for  which  no 
surface  of  E  passes  below  w.  This  problem,  also  rather 
difficult,  is  nevertheless  easier  to  analyze,  and  recent  re¬ 
sults  [16,  26]  show'  that  the  combinatorial  complexity 
of  such  an  envelope  is  0(n‘^“^‘*‘®),  for  any  e  >  0,  w'here 
the  constant  of  proportionality  depends  on  e,  d,  and  the 
maximum  algebraic  degree  of  the  given  surfaces  and  of 
their  relative  boundaries. 

In  this  paper  we  derive  an  improved  upper  bound  for 
the  complexity  of  a  single  cell  in  an  arrangement  yl(E) 
of  algebraic  surfaces  in  3-space,  as  above.  This  bound, 
in  3  dimensions,  is  the  same  as  the  bound  for  lower 
envelopes  just  mentioned;  that  is,  it  is  0(n^+®),  for  any 
e  >  0,  where  the  constant  of  proportionality  depends, 
as  above,  on  e  and  on  the  maximum  algebraic  degree 
of  the  given  surfaces  and  of  their  relative  boundaries. 
This  almost  establishes  the  conjecture  mentioned  above 
in  three  dimensions. 

Our  analysis  adapts  the  proof  technique  of  [17],  which 
in  turn  is  based  on  the  analysis  technique  of  [16,  26]  for 
the  case  of  lower  envelopes.  The  lesson  one  can  learn 
from  the  analysis  in  [17]  is  that  in  the  case  of  a  single 
cell  one  needs  the  following  two  preliminary  results  to 
‘bootstrap’  the  recurrences  appearing  in  the  analysis: 

(a)  a  sharp  bound  on  the  number  of  ‘x-extreme’  ver¬ 
tices  of  the  cell  C  (vertices  w'hose  x  coordinate  is 
smallest  or  largest  in  a  small  neighborhood  of  the 
vertex  within  the  closure  of  C),  and 

(b)  a  sharp  bound  on  the  number  of  vertices  bounding 
‘popular’  faces  of  C  (faces  that  are  adjacent  to  C 
on  both  ‘sides’;  see  [3,  5,  17]  and  below). 

Bounds  on  these  quantities  were  obtained  in  [17]  us¬ 
ing  special  properties  of  the  surfaces  that  arise  in  the 
case  studied  there.  A  main  technical  contribution  of 
the  present  paper  is  a  derivation  of  such  bounds  in  the 
general  algebraic  setting  assumed  above.  The  bound 
(a)  is  obtained  using  considerations  which  are  related 
to  Morse  theory  (see  e.g.  [20]),  but  are  simpler  to  derive 
in  3  dimensions.  The  bound  (b)  is  obtained  by  applying 
the  new'  probabilistic  technique  of  [16,  17,  26]  to  count¬ 
ing  only  the  vertices  of  popular  faces  (this  idea  is  in  the 
spirit  of  the  methodology  used  in  [3,  5]).  Once  these 
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two  bounds  are  available,  the  rest  of  the  proof  is  rather 
similar  to  those  used  in  [16,  17,  26],  although  certain 
non-trivial  adjustments  are  required. 

The  paper  is  organized  as  follows.  In  Section  2  we 
give  several  preliminary  results,  including  the  analysis 
of  the  number  of  x-extreme  vertices  of  a  single  cell.  The 
main  analysis  is  presented  in  Section  3.  The  application 
of  the  main  result  to  the  zone  problem,  as  mentioned 
in  the  abstract,  is  discussed  in  Section  4,  and  the  paper 
concludes  in  Section  5  with  further  applications  of  our 
results  and  some  open  problems. 

2  Preliminaries 

Let  E  =  {c7i, . . . ,  a„}  be  a  given  collection  of  n  surface 
patches  in  3-space  that  satisfy  the  following  conditions: 

(i)  Each  (7,-  is  monotone  in  the  xy-direction  (that  is,  ev¬ 
ery  vertical  line  intersects  Ci  in  at  most  one  point). 
Moreover,  each  cr,-  is  a  portion  of  an  algebraic  sur¬ 
face  of  constant  maximum  degree  b. 

(ii)  The  vertical  projection  of  cTj-  onto  the  a;y-plane  is 
a  planar  region  bounded  by  a  constant  number  of 
algebraic  arcs  of  constant  maximum  degree  (say,  b 
too). 

(iii)  The  relative  interiors  of  any  triple  of  the  given 
surfaces  intersect  in  at  most  s  points  (by  Bezout’s 
theorem  [19]  and  by  Property  (iv)  below,  we  always 
have  s  <  b^). 

(iv)  The  surface  patches  in  S  are  in  general  position] 
one  way  of  defining  this  is  to  require  that  the  co¬ 
efficients  of  the  polynomials  defining  the  surfaces 
and  their  boundaries  are  algebraically  independent 
over  the  rationale  (i.e.,  no  multivariate  polynomial 
with  rational  coefficients  vanishes  when  substitut¬ 
ing  into  it  some  of  the  given  coefficients),  thereby 
excluding  all  kinds  of  ‘degenerate’  configurations; 
see  [16,  17,  26]  for  more  details. 

We  remark  that  the  somewhat  restrictive  condition  (iv) 
and  the  first  part  of  condition  (i)  are  not  essential  for  the 
analysis.  If  the  first  part  of  condition  (i)  does  not  hold, 
we  can  decompose  each  surface  into  a  constant  number 
of  xy-monotone  pieces  by  cutting  it  along  the  locus  of 
points  with  2-vertical  tangency.  If  condition  (iv)  does 
not  hold,  we  can  argue,  by  applying,  as  in  [26],  a  small 
random  perturbation  of  the  polynomials,  that  the  com¬ 
plexity  of  a  single  cell  in  a  degenerate  arrangement  of 
surfaces  is  at  most  proportional  to  the  worst-case  com¬ 
plexity  of  a  single  cell  in  arrangements  of  surfaces  in 
general  position. 

We  are  also  given  a  point  Z  not  lying  on  any  surface, 
and  define  C  —  Cz(E)  to  be  the  cell  of  the  arrangement 
M(S)  that  contains  Z\  by  definition,  C  is  an  open  set  in 
B?. 


Theorem  2.1  The  number  of  x-extreme  vertices  of  C 
is  Oin?"). 

Proof.  For  each  xq  G  H,  let  denote  the  plane 
X  =  xq.  We  call  a  point  w  £  dC  critical  if  there  exists 
a  neighborhood  iV  of  w;  and  a  connected  component  K 
of  C  n  AT  so  that  K  fl  is  disconnected,  where  xq  is 
the  x-coordinate  of  w,  but  K  fl  tTx  is  connected  either 
for  all  X  <  xo  or  for  all  x  >  xq  sufficiently  close  to  xq  . 

We  first  claim  that  the  number  of  x-extreme  vertices 
of  C  is  proportional  to  2  plus  the  number  of  critical 
points  of  C.  We  prove  this  by  an  argument  borrowed 
from  [6j. 

We  sweep  C  by  moving  the  plane  in  the  direction 
of  increasing  x,  and  keep  track  of  the  number  I  of  con¬ 
nected  components  of  C  n  tt^  .  This  number  is  initially 
(at  X  =  — oo)  O(n^),  since  this  is  an  upper  bound  on  the 
overall  complexity  of  any  planar  cross  section  of  the  en¬ 
tire  arrangement  «4(S).  I  increases  by  1  when  sweeps 
through  a  local  minimum  of  C,  or  when  a  connected 
component  of  C  fl  splits  into  two  subcomponents;  I 
decreases  by  1  when  tt  sweeps  through  a  local  maximum 
of  C,  or  when  two  components  of  CHt^x  merge  into  a  sin¬ 
gle  component.  (The  general  position  assumption  im¬ 
plies  that  only  two  components  can  merge  into,  or  split 
from,  a  component  of  C  at  any  given  x.)  The  number  of 
events  at  which  components  can  split  or  merge  is  equal, 
by  definition,  to  the  number  Q  of  critical  points  of  C. 

Consider  the  following  dynamic  scheme  for  assign¬ 
ing  weights  to  each  component  of  C  D  Tr®.  Initially, 
at  X  =  — oo,  we  assign  weight  —1  to  each  component 
of  C  n  TT*.  When  tTx  sweeps  through  a  local  minimum 
point  of  C,  a  new  component  of  C  n  tt*  is  created,  and 
is  also  assigned  weight  —1.  When  two  components  of 
C  HtTx  merge,  we  assign  to  the  new  component  weight 
2  plus  the  sum  of  the  weights  of  the  merged  compo¬ 
nents.  When  a  component  shrinks  and  disappears,  its 
final  weight  is  added  to  a  global  count  M.  When  a  com¬ 
ponent  is  split  into  two  subcomponents,  each  of  them 
is  assigned  weight  1  -b  y,  where  w  is  the  weight  of  the 
split  component. 

We  claim  that,  at  any  given  time  during  the  sweep, 
the  weight  of  any  component  of  C  Pi  tTj,  is  always  at  least 
— 1,  and  the  weight  of  a  component  that  was  formed  by 
one  or  more  preceding  splitting  and  merging  operations 
is  non-negative.  Both  claims  are  easy  to  prove  by  in¬ 
duction  on  the  sweep  events.  If  C  has  no  critical  points 
then  it  has  at  most  one  local  minimum  and  at  most  one 
local  maximum,  and  the  claim  holds  trivially  in  this 
case;  so  suppose  C  does  have  critical  points.  In  this 
case  it  is  easily  verified,  using  induction  on  the  sweep 
events,  that  the  weight  of  each  component  of  CriTrj,  that 
shrinks  to  a  point  as  we  reach  a  local  maximum  of  C  is 
nonnegative,  so  the  value  of  M  is  always  nonnegative. 
Similarly,  all  components  that  survive  as  x  reaches  4-oo 
have  nonnegative  weight. 
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Suppose  that,  at  some  point  during  the  sweep,  there 
are  local  minima  of  C  to  the  left  of  that  CflTT-oo 
has  t  components,  that  the  number  of  splittings  and 
mergings  of  cross-sectional  components  to  the  left  of 
is  Qx  <  Q,  and  that  the  current  value  of  the  count 
M  is  Mx-  Then,  as  is  easily  verified  by  induction  on 
the  sweep  events,  the  total  weight  of  the  components 
oi  C  C\  Tlx  plus  Mx  is  equal  to  2Qx  —  Sx  —  t.  Hence, 
since  at  a:  =  -f  c»  the  total  weight  of  the  components  of 
C  DtTx  plus  Mx  is  nonnegative,  we  have  2Q  —  s  —  t  >  0, 
where  s  is  the  total  number  of  local  minima  of  C.  This 
implies  that  s  <  2Q.  A  symmetric  argument  applies  to 
the  number  of  local  maxima  of  C,  and  thus  the  claim  is 
established. 

It  therefore  remains  to  estimate  the  number  of  critical 
points  of  C.  For  any  fixed  surface  cr  G  E,  the  general 
position  assumption  is  easily  seen  to  imply  that  there 
are  only  0(1)  critical  points  that  lie  only  on  a  and  on  no 
other  surface,  so  the  total  number  of  such  points,  over 
all  cr  G  E,  is  only  0(n).  The  number  of  critical  points 
that  lie  on  the  boundary  of  one  surface  of  E  and  on  a 
second  surface,  summed  over  all  pairs  of  such  surfaces, 
is  easily  seen  to  be  only  O(n^). 

Consider  next  critical  points  that  lie  in  the  relative 
interior  of  an  intersection  curve  7,y  =  a,-  H  Oj ,  for  some 
pair  of  surfaces  cr,-,  aj  G  S  and  on  no  other  surface 
of  E.  If  such  a  point  w  is  a  singular  point^  on,  say 
(7,-,  then  w  is  an  intersection  point  between  the  curve 
of  singular  points  on  a;  and  cjj,  and,  by  the  general 
position  assumption,  the  number  of  such  points,  over 
all  pairs  <7i,  aj  G  S,  is  clearly  O(n^).  We  thus  may 
assume  that  tu  is  nonsingular  on  both  surfaces,  and,  by 
the  general  position  assumption,  that  cr;  and  aj  meet 
transversally  at  w.  But  then  the  criticality  of  w  is  easily 
seen  to  imply  that  the  tangent  vector  to  jij  at  tc  must 
be  orthogonal  to  the  7:-axis,  and  the  number  of  points 
on  7,j  with  this  property  is  0(1)  (under  the  general 
position  assumption).  Hence  the  total  number  of  such 
points,  over  all  pairs  cr,-,  aj  G  E,  is  also  0{n^). 

Finally,  suppose  that  w  is  a  critical  point  that  is  also 
a  point  of  intersection  of  three  surfaces  a-,-,  aj,  G  E. 
Arguing  as  above,  we  can  rule  out  the  case  where  w  is 
singular  on  either  of  these  surfaces.  Consider  now  the 
three  intersection  curves  ji  =  TTxg  fl  ct,-,  ■jj  —  fl  aj, 
Ik  =  T^xo  hi  O' A:,  where  Xq  is  the  a;-coordinate  of  iv.  These 
curves  meet  at  w,  they  are  all  smooth  at  w,  and  KniTxa 
lies  on  a  single  side  of  each  of  the  curves.  The  number 
of  critical  points  of  this  kind,  for  which  any  two  of  these 
curves  are  tangent  to  each  other  at  w,  is  easily  seen, 
using  an  argument  similar  to  the  one  given  above,  to  be 
only  0{in?).  Otherwise,  Kni^xo  must  be  fully  contained, 
locally  near  vj,  in  just  one  of  the  six  regions  into  which 


point  V  is  singular  on  an  algebraic  surface  in  IR"^,  defined 
by  P  =  0,  for  some  polynomial  P,  if  all  the  partial  derivatives 
dPIdxi,  for  i  1, ...  ,d,  vanish  at  v;  see  [7,  19]. 


these  curves  split  tTxo  locally  near  w.  However,  this  is 
easily  seen  to  contradict  the  criticality  of  w,  and  thus 
implies  that  the  total  number  of  critical  points  of  C  is 
0{n'^),  as  asserted.  □ 

3  Complexity  of  a  Single  Cell 

We  call  a  vertex  v  of  A(E)  an  inner  vertex  if  v  is  formed 
by  the  intersection  of  the  relative  interiors  of  three  dis¬ 
tinct  surfaces  of  E.  We  concentrate  on  bounding  the 
number  of  inner  vertices  of  Cz(E),  and  later  on  jus¬ 
tify  the  use  of  this  reduced  measure  of  complexity.  Our 
main  result  is: 

Theorem  3.1  The  number  of  inner  vertices  ofCz{T^) 
is  0(n^+‘),  for  any  e  >  0,  where  the  constant  of  pro¬ 
portionality  depends  on  e  and  on  the  maximum  degree 
and  shape  of  the  surfaces. 

Proof.  Let  v  be  an  inner  vertex  of  C  =  Cz(E),  which 
is  incident  to  three  surfaces  ai,  a 2  and  173.  We  may 
assume  that  v  is  not  a  singular  point  on  any  of  these 
surfaces,  and  that  these  surfaces  meet  transversally  at 
V.  If,  say,  t;  is  a  singular  point  of  ai ,  then  it  lies  on  the 
algebraic  curve  of  bounded  degree  consisting  of  all  sin¬ 
gular  points  on  (7j .  The  number  of  intersection  points 
of  this  curve  with,  say  <72,  is  constant  (under  the  as¬ 
sumption  of  general  position),  which  implies  that  the 
number  of  such  vertices  is  only  O(n^).  The  assumption 
concerning  transversality  is  also  justified  by  the  general 
position  assumption. 

For  technical  reasons,  we  distinguish  between  differ¬ 
ent  sides  of  V,  adapting  the  notation  of  [17]  (see  also 
[3,  5,  13]).  Formally,  the  three  tangent  planes  to  the 
surfaces  cr,-  at  v  partition  3-space  into  8  octants,  and  a 
side  R  of  v  is  any  one  of  these  octants.  We  call  the  pair 
(u,  R)  a  0-border.  We  say  that  (u,  R)  is  a  0-border  of 
C  if,  when  we  move  from  v  in  any  direction  that  points 
into  R  by  any  sufficiently  small  distance,  we  enter  C. 
Our  goal  is  to  count  the  number  of  inner  0-borders  of 
C,  which  means  that  we  count  each  vertex  u  of  C  with 
multiplicity,  once  for  each  side  of  v  that  lies  in  C  (in  the 
above  sense).  We  define  k(E)  to  be  the  number  of  inner 
0-borders  on  dCziT.)-  We  also  denote  by  K{n)  the  max¬ 
imum  possible  value  of  k(E),  taken  over  all  collections 
E,  as  above,  with  a  total  of  n  surfaces,  and  over  all  cells 
of,A(E). 

In  the  foregoing  analysis,  the  notion  of  a  side  needs 
also  to  be  extended  to  edges  and  faces  bounding  C.  For 
an  edge  e,  formed  by  the  intersection  of  two  surfaces  a, 
a',  we  can  assume  (by  reasons  similar  to  those  used  in 
the  preceding  arguments)  that  no  point  on  e  is  singular 
on  either  of  these  surfaces,  and  that  a  and  a'  cross  each 
other  transversally  at  each  point  of  e.  Then  at  each 
point  z  G  e,  the  plane  normal  to  e  at  2  is  split  by  the 
two  tangent  planes  to  a,  a'  at  2  into  4  quadrants.  A 
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side  of  e  can  be  thought  of  as  a  continuous  mapping 
(in  the  Hausdorff  sense)  that  maps  each  point  z  G  e 
to  one  of  the  quadrants  at  2  (or,  rather,  to  make  the 
Hausdorff  continuity  well  defined,  to  the  intersection  of 
such  a  quadrant  with  the  unit  ball  around  z).  Similarly, 
a  side  of  a  face  /  can  be  defined  as  a  continuous  mapping 
from  each  point  z  E  f  to  one  of  the  two  unit  vectors 
normal  to  /  at  2.  If  a  vertex  v  is  incident  to  an  edge 
e  and  lies  on  another  surface  a  crossing  e  transversally, 
then  a  side  i?  of  u  is  consistent  with  a  side  Q  of  e  if  the 
limit  of  Q{z),  as  2  approaches  v,  is  contained  in  (the 
closure  of)  R,  and  R  is  the  positive  cone  spanned  by 
the  limit  of  Q{z)  and  by  the  vector  normal  to  cr  at  2 
and  pointing  in  the  direction  of  e.  Consistency  between 
sides  of  a  vertex  and  an  incident  face,  or  between  sides  of 
an  edge  and  an  incident  face,  can  be  defined  in  a  similar 
manner.  If  u  is  a  non-singular  vertex  of  .4(11),  incident 
to  two  edges  ei,  62,  which  are  contained  in  the  same 
intersection  curve  of  a  pair  of  surfaces,  we  say  that  a 
side  Ri  of  ej  is  consistent  with  a  side  R2  of  62  if  the  two 
limits  of  Ri{z)  as  2  approaches  v  along  ei,  and  of  R2{z) 
as  2  approaches  v  along  62,  coincide.  If 
two  sides  of  v  consistent  with  Ri,  R2,  respectively,  we 
say  that  R2  is  the  side  of  v  opposite  to  R[  across  the 
third  surface  defining  v.  Given  an  edge  e  and  a  side  R 
of  e,  we  say  that  {e,R)  is  a  1-border  of  C  if,  when  we 
move  from  any  point  2  €  e  in  a  direction  contained  in 
R{z),  we  enter  C.  Similarly,  we  can  define  2-borders 
(/,  R)  of  C,  for  a  face  /  and  a  side  R  of  /. 

Let  (v,  R)  be  an  inner  0-border  of  C  (non-singular, 
formed  by  the  transversal  intersection  of  its  three  inci¬ 
dent  surfaces,  say  cri ,  <72 ,  0-3) .  The  corresponding  vertex 
V  is  incident  to  (at  least)  three  edges  of  C,  which  we  de¬ 
note  by  ei2,  ei3,  and  623,  where  each  eij  is  a  portion 
of  the  corresponding  intersection  curve  jij  =  Ui  fl  Cj, 
for  1  <  f  <  i  <  3;  moreover,  each  edge  Cij  has  a  side 
Rij  which  is  consistent  with  R,  so  that  {eij,Rij)  is  a 
1-border  of  C.  If  one  of  these  curves,  say  712,  contains 
two  edges,  612,  ej2,  with  respective  sides  Ru,  R'i2^  such 
that  ei2  and  ej2  have  u  as  a  common  endpoint,  R12  and 
R'32  are  consistent  with  each  other,  and  both  (ei2,i?i2) 
and  {e'i2^R'i2)  are  1-borders  of  C,  then,  as  is  easily  seen, 
there  is  a  face  /  on  <73  which  is  incident  to  v  and  which 
forms  with  both  its  sides  2-borders  of  C.  We  call  such 
a  face  a  popular  face  of  C,  borrowing  a  notation  from 
[3,  5];  see  Figure  1.  Let  us  denote  by  7r(S)  the  maxi¬ 
mum  number  of  inner  vertices  of  popular  faces  bounding 
a  single  cell  of  .4(E),  and  let  7r(n)  denote  the  maximum 
of  7r(E),  over  all  collections  E  of  n  surfaces  as  above 
(with  the  same  s  and  6).  (Strictly  speaking,  a  vertex  v 
can  be  incident  to  more  than  one  popular  face,  in  which 
case  we  count  it  in  7r(E)  with  multiplicity,  once  for  every 
incident  popular  face.) 

A  major  novel  ingredient  of  the  proof  is  the  derivation 
of  a  sharp  upper  bound  on  the  number  of  vertices  of  pop- 


Figure  1 :  The  popular  face  /  borders  the  cell  C  on  both 
its  sides 


ular  faces  of  C;  in  the  previous  paper  [17]  such  a  bound 
was  derived  using  special  properties  of  the  surfaces  that 
arose  in  the  specific  motion  planning  application  that 
was  studied  there;  here  we  apply  a  new  technique  for 
obtaining  the  desired  bound  in  general  arrangements. 
We  first  obtain  an  upper  bound  for  the  complexity  /c(n) 
in  terms  of  the  function  tt,  and  then  proceed  to  derive 
an  upper  bound  for  7r(n). 

Thus,  up  to  an  additive  term  of  7r(n),  it  suffices,  for 
the  bound  on  K(n)  that  we  seek,  to  consider  only  inner 
vertices  v  (or,  rather,  inner  0-borders  (v,  R))  which  are 
not  incident  to  any  popular  face  of  C. 

Let  (v,R)  be  such  a  0-border,  and  let  us  continue 
to  follow  the  notations  introduced  above.  For  each 
1  <  f  <  i  <  3,  the  curve  7,j  must  contain  a  maxi¬ 
mal  open  a;-monotone  connected  portion  /3ij  having  v 
as  an  endpoint,  such  that  the  1-border  (j3ij,R'^j),  where 
R'^j  is  the  side  of  /3,j  consistent  with  Rij,  is  disjoint  from 
C.  Let  Zij  denote  the  other  endpoint  of  (iij. 

We  define  the  index  of  v,  denoted  j{v),  to  be  the 
number  of  points  of  intersections  of  ci,  <72,  C73  which 
lie  to  the  right  of  v  (i.e.,  with  x  >  x{v)).  Clearly,  0  < 
j{v)  <  s  -  1. 

We  define  «:^^)(E),  for  j  =  0, . . . ,  s  —  1  to  be  the  max¬ 
imum  number  of  0-borders  {v,  R)  of  any  fixed  cell  of 
A(E),  whose  vertices  v  are  inner  vertices  of  index  at 
most  j.  We  also  define  to  be  the  maximum  pos¬ 

sible  value  for  k^-’^E),  over  all  collections  E  of  n  surface 
patches  satisfying  conditions  (i)-(iv)  (with  the  same  b 
and  s).  Similarly,  we  define  7r^-’^(E),  for  ji  =  0, . . . ,  s  —  1, 
to  be  the  maximum  number  of  vertices  with  index  at 
most  j  of  all  popular  faces  bounding  any  fixed  cell  of 
A(E),  where  each  such  vertex  is  counted  with  multiplic¬ 
ity,  once  for  every  incident  popular  face.  We  also  define 
7r(-’3(n)  to  be  the  maximum  possible  value  for  7r^'’^(E), 
over  all  collections  E  of  n  surface  patches  satisfying  con¬ 
ditions  (i)-(iv)  (with  the  same  b  and  s). 

Our  method  is  to  derive  a  recurrence  relationship  for 
K{n),  by  bounding  each  of  the  functions  in  terms 
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of  (with  a  special  handling  of  the  solution 

of  the  resulting  system  of  recurrences  will  yield  the  as¬ 
serted  bounds. 

So  let  us  fix  an  index  j,  and  assume  that  the  vertex 
V  under  consideration  has  index  at  most  j.  In  this  case, 
we  incur  an  overhead  of  only  7r^^^(n),  when  we  also  add 
to  our  counts  vertices  incident  to  popular  faces.  Several 
cases  can  still  arise: 

(a)  All  three  arcs  /3ij  emerge  from  v  in  the  direction  of 
increasing  x  (or  all  emerge  in  the  direction  of  decreasing 
x).  In  this  case  v  is  an  a;-extreme  vertex  of  C,  as  is 
easily  checked  (using  the  general  position  assumption), 
and  Theorem  2.1  implies  that  the  total  number  of  such 
vertices  (and  corresponding  0-borders)  is  O(n^).  We 
will  thus  assume  in  the  sequel  that  at  least  one  of  these 
arcs  emerges  from  v  in  the  direction  of  increasing  x,  and 
at  least  one  arc  emerges  in  the  direction  of  decreasing 

X. 

(b)  At  least  one  of  the  arcs  f3ij  ends  at  a  point  Zij 
which  is  either  an  endpoint  of  the  original  intersection 
curve  7jy ,  or  a  point  of  local  x-extremum  on  that  curve. 
We  then  charge  (v,R)  to  the  point  Zij,  and  note  that 
the  number  of  such  points  is  O(n^),  and  that  each  such 
point  is  charged  only  a  constant  number  of  times  in  this 
manner  (e.g.,  along  (Sij  it  can  be  charged  at  most  once 
for  every  side  of  /3,-j),  thus  implying  that  the  number 
of  0-borders  {v,R)  of  this  kind  is  only  0{n^).  Again, 
in  what  follows  we  assume  that  this  situation  does  not 
arise,  which  means,  in  particular,  that  each  of  the  three 
endpoints  Z{j  is  a  vertex  of  C;  more  precisely,  each  Zij 
has  a  side  R*j  which  lies  across  the  third  surface  defining 
Zij  from  a  side  consistent  with  i?h,  such  that  (zij.Rfj) 
is  a  0-border  of  C.  See  Figure  2. 

(c)  At  least  one  of  the  arcs  /3ij,  say  for  definiteness  f3i2, 
is  not  intersected  by  the  third  surface  . 

Define  the  level  of  a  point  w  in  3-space  to  be  the 
smallest  number  of  surfaces  of  S  whose  removal  makes 
w  belong  to  the  closure  of  the  cell  containing  Z  in  the 
resulting  subarrangement.  If  rc  is  a  vertex  of  ^(D)  and 
i?  is  a  side  of  w,  we  say  that  w  (resp.  {w,R))  lies  at 
restricted  level  X{w)  =  k  (resp.  \{{w,R))  =  k)  if  by 
removing  k  surfaces  from  E,  none  of  which  is  incident 
to  w,  we  make  w  a  vertex  (resp.  make  (w,  R)  a  0-border) 
of  the  cell  containing  Z  in  the  resulting  subarrangement, 
and  if  k  is  the  smallest  number  with  that  property. 

Let  t  denote  the  number  of  distinct  surfaces  of  S  that 
intersect  Pn-  We  fix  some  threshold  parameter  ^  =  ^j, 
to  be  defined  later  (we  will  use  a  different  parameter  for 
each  j),  and  consider  the  following  two  subcases: 

(c.i)  t  >  ^:  In  this  case  we  charge  {v,R)  to  a  block  of 
^  points  of  intersection  between  /3j2  and  the  surfaces  of 
E,  defined  as  follows.  For  each  surface  a  intersecting 
,;3i2,  choose  its  point  of  intersection  that  lies  nearest 
to  V  along  /3i2.  We  obtain  at  least  ^  such  designated 
points,  and  we  charge  (u,  R)  to  the  block  of  the  first  ^ 


Figure  2:  The  dotted  faces  on  the  surface  aj  appear  on 
the  boundary  of  C,  and  near  both  of  them  C  lies,  say, 
above  ai 


designated  points,  in  their  order  along  Pn  from  v.  All 
those  points  are  inner  vertices  of  >l(E),  and  it  is  clear 
that  none  of  these  vertices  can  be  charged  in  this  manner 
more  than  a  constant  number  of  times  (along  P12  each 
such  vertex  can  be  charged  at  most  twice  for  each  side  of 
Pi2)-  By  construction,  each  of  the  charged  vertices  lies 
at  restricted  level  at  most  as  is  easily  verified.  Our 
goal  is  thus  to  obtain  an  upper  bound  for  the  number 
M  of  inner  vertices  of  >1(S)  that  lie  at  restricted  level 
<  the  number  of  0-borders  in  the  present  subcase  is 

O{M/0. 

For  this  we  apply  the  probabilistic  analysis  technique 
of  [9,  25],  in  the  same  manner  as  in  [17].  That  is,  we 
choose  a  random  sample  TZoir  —  n/^  surfaces  of  E,  and 
construct  the  arrangement  A{TVj.  Let  w  be  an  inner 
vertex  of  A.(E)  at  restricted  level  A  <  f ,  and  let  Q  be 
a  specific  collection  of  A  surfaces,  none  incident  to  w, 
whose  removal  makes  w  a  vertex  of  the  cell  containing 
Z.  The  probability  that  w  shows  up  as  a  vertex  of 
Cz{R)  is  at  least  ("7-r^)/(r)-  number 

(7)  of  possible  samples  TZ,  consider  those  samples  that 
contain  the  three  surfaces  forming  w  and  do  not  contain 
any  of  the  A  surfaces  of  Q;  for  each  of  these  samples 
(and  possibly  for  other  samples  as  well),  w  is  a  vertex 
of  Cz(77).  Hence,  we  have 

E%Y^^A<F:[«(77)]<K(r), 

where  E[-]  denotes  expectation  (with  respect  to  TZ),  and 
where  Fx  is  the  number  of  vertices  w  of  ^(E)  at  re¬ 
stricted  level  A.  (Note  that  k(TZ)  counts  the  number  of 
0-borders  bounding  Cz{TZ),  which  is  clearly  an  upper 
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bound  on  the  number  of  vertices  of  that  cell.)  As  in 
[9,  25],  one  easily  verifies  that,  for  r  =  n/^,  we  have 

j2Fx  =  O{e<n/0); 

A=0 

in  other  words,  the  number  of  inner  vertices  of  A(S) 
at  restricted  level  <  f  is  0{^^K(n/()),  which  in  turn 
implies  that  the  number  of  inner  0-borders  (u,  R)  of  C 
in  this  subcase  is  0(^^K(n/f)). 

(c.ii)  t  <  ^:  In  this  case,  if  we  remove  these  t  surfaces 
from  the  arrangement,  v  becomes  a  vertex  of  a  popular 
face  of  C.  Indeed,  2^2  has  an  appropriately  consistent 
side  i?*2  so  that  (2^2,  R^)  is  a  0-border  of  C.  When  we 
remove  the  t  surfaces  crossing  (3i2,  the  cell  C  expands 
from  R*2  towards  v  and  ‘reaches’  the  other  side  of  v 
consistent  with  the  side  R'12  of  /3i2,  making  v  a  vertex 
of  a  popular  face  (on  0-3)  of  the  cell  containing  Z  in 
the  reduced  arrangement.  To  exploit  this  observation, 
•we  apply  the  following  variant  of  the  preceding  random 
sampling  argument.  Fix  a  parameter  r  =  n/^,  and  draw 
a  random  sample  R  oir  surfaces  of  S.  Let  E'[R]  be  the 
expected  number  of  vertices  in  A{R)  of  index  <  j  which 
are  incident  to  popular  faces  of  Cz{R)  (counted  with 
the  appropriate  multiplicity).  By  definition,  E'[R]  < 
(Note  that  the  index  of  a  vertex  does  not  change 
when  we  pass  to  a  subset  R,  as  long  as  the  3  surfaces 
defining  the  vertex  belong  to  R.)  Now,  using  a  similar 
argument  to  the  one  given  above,  the  probability  that 
our  vertex  v  will  show  up  as  a  vertex  of  such  a  popular 
face  of  Cz{R)  is  >  ("]r-3^)/(r)'  (”)  possible 

ways  of  choosing  R,  we  consider  those  samples  for  which 
O'! )  o'2, 0-3  &R,  and  none  of  the  other  t  surfaces  crossing 
/3i2  is  chosen  in  7^;  as  already  noted,  each  such  choice 
(and  possibly  others  as  well)  will  make  v  appear  as  a 
vertex  of  a  popular  face  of  the  cell  under  consideration. 
Hence,  we  have 

i=0 

where  Gt  is  the  number  of  0-borders  (u,  R)  in  the  full 
arrangement  that  fall  into  the  present  subcase,  with 
j  (^)  <  j  with  exactly  t  surfaces  crossing  the  corre¬ 

sponding  arc  /3i2.  Arguing  exactly  as  above,  we  obtain, 
for  r  =  n/^, 

t=o 

in  other  words,  the  number  of  0-borders  in  this  subcase 
is  0(^^7r(^)(n/f)). 

(d)  In  the  remaining  case  (which  can  occur  only  if 
j{v)  >  0),  the  third  surface  as  intersects  I3i2  in  at  least 
one  point  w;  if  there  are  several  such  points  (no  more 


than  j  by  assumption),  we  take  w  to  be  the  point  lying 
furthest  from  v  along  (3i2-  Let  t  denote  the  number  of 
distinct  surfaces  of  E,  excluding  as,  that  intersect  f3i2- 
We  consider  the  following  two  subcases: 

(d.i)  t  >  ^:  In  this  case  we  charge  {v,R)  to  a  block  of 
^  vertices  of  the  full  arrangement  A(E)  which  lie  along 
/3i2,  in  complete  analogy  to  the  construction  in  case  (c.i) 
above,  except  that  the  surface  <73  is  excluded  from  the 
construction.  Since  each  such  vertex  can  be  charged 
in  this  manner  only  a  constant  number  of  times,  and 
all  these  vertices  lie  at  restricted  level  <  ^  -b  1,  as  is 
easily  checked,  it  follows,  exactly  as  above,  that  the  total 
number  of  0-borders  v  in  this  subcase  is 
(d.ii)  t  <  ^:  In  this  case,  if  we  remove  these  t  surfaces 
(without  removing  0-3),  the  point  w,  together  with  a  side 
consistent  with  must  form  a  0-border  {w,Rw) 
of  the  cell  containing  Z  in  the  reduced  arrangement. 
Indeed,  recall  that  we  are  assuming  that  the  other  end¬ 
point  Z12  of  /3i2  forms  a  0-border  {212,  R12)  of  0, 

side  i?*2  lying  across  from  a  side  consistent  with  R^^- 
By  assumption,  the  portion  of  /3i2  between  w  and  2:12  is 
not  crossed  by  <73,  so,  when  the  other  t  surfaces  crossing 
I3i2  are  removed,  the  cell  C  expands  from  the  side  i?i2 
and  ‘reaches’  w  from  212  along  the  side  {l3i2,E'u)- 
charge  (u,  R)  to  {w,  R^)-  Clearly,  each  such  (w,  R^)  is 
charged  in  this  manner  only  a  constant  number  of  times. 

We  next  estimate  the  number  of  0-borders  {w,Rw) 
of  this  kind.  We  apply  a  random-sampling  argument 
similar  to  those  used  above.  That  is,  we  fix  a  parameter 
r  =  n/^,  and  draw  a  random  sample  R  oi  r  surfaces 
of  S.  Let  E"\R]  be  the  expected  number  of  0-borders 
{w,Ru,)  of  Cz{R),  such  that  w  has  index  <  j  -  1.  By 
definition,  E"[R]  =  £;[/^C-l)(7^)]  < 

The  probability  that  the  charged  0-border  {w,Rw) 
will  show  up  as  such  a  0-border  of  Cz{R)  is  > 
("7-3 *)/(”)■  of  (”)  possible  ways  of  choosing  7^, 
w'e  consider  those  samples  for  which  ai,a2,as  &R,  and 
none  of  the  other  t  surfaces  crossing  /3i2  is  chosen  in 
R]  each  such  choice  (and  possibly  other  choices  too) 
will  make  {w,Rw)  appear  as  a  0-border  of  Cz{R),  as 
argued  above.  Hence,  we  have 

E  =  0{E"[R])  =  , 

t=0  Vr-J 

where  Ht  is  the  number  of  0-borders  {v,  R)  in  the  full  ar¬ 
rangement  that  fall  in  the  present  subcase,  with  j (v)  <  j 
and  with  exactly  t  surfaces  crossing  the  corresponding 
arc  (}i2  (excluding  the  corresponding  surface  as).  Ar¬ 
guing  as  above,  we  obtain,  for  r  =  n/^, 

<=o 

in  other  words,  the  number  of  0-borders  {v,R)  in  this 
subcase  is  0(i^^^c^f~^l(u/^)). 
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Figure  3:  Two  adjacent  popular  faces  giving  rise 
to  a  popular  edge  62 


Hence,  summing  over  all  cases,  we  obtain  the  fol¬ 
lowing  recurrence  for  (where,  for  j  =  0,  we  put 
=  0  in  the  right-hand  side): 

K^^\n)  =  0{^^K{n/0  +  + 

+  n'^  +  Tc^^^n))  .  (1) 

In  order  to  solve  the  recurrence  (1)  we  first  have  to 
bound  the  functions  7r^^^(n),  which  is  done  in  the  next 
subsection. 

3.1  The  Number  of  Vertices  on  Popular 
Faces 

To  bound  the  number  of  such  vertices,  we  adapt  the 
analysis  given  above  to  bound  the  functions  7rl^^(n), 
rather  than  the  functions  K^^^(n). 

Let  V  be  an  inner  vertex  of  a  popular  face  /  of  the 
cell  C;  assume  that  /  lies  on  a  surface  a  €  S,  and 
that  the  two  other  surfaces  incident  to  v  are  cr^,  <72  G 
S.  As  above,  we  may  assume  that  v  is  non-singular 
on  any  of  these  surfaces,  and  that  these  surfaces  meet 
transversally  at  v.  Denote  the  two  sides  of  /  by  /?+  and 
Let  7,'  =  (j,-  n  (7,  for  i  —  1,2.  Each  7,-  contains  an 
edge  e;  having  v  as  an  endpoint  and  bounding  /.  Let 
R~  denote  the  two  sides  of  Ci  that  are  consistent 
with  i?+,  i?“,  respectively. 

If,  say  7i  has  another  edge  ej  incident  to  v  such  that 
(a)  ej  bounds  another  popular  face  /'  on  a,  and  (b)  / 
and  /'  share  the  edge  62,  then  62  is  a  popular  edge  of 
C,  meaning  that  all  its  four  sides  lie  in  C  locally  near 
62  (see  Figure  3).  We  claim  that  the  number  of  popular 
edges  of  C  is  0{n‘^).  This  follows  from  the  observation 
that  V  must  be  a  locally  x-extreme  vertex  of  one  of  these 


four  sides^  (assuming  general  position),  and,  by  Theo¬ 
rem  2.1,  the  number  of  such  vertices,  and  hence  also  the 
number  of  popular  edges,  is  O(n^).  Moreover,  if  both 
Cl  and  62  emanate  from  v  in  the  positive  x-direction, 
or  if  both  emanate  in  the  negative  2:-direction,  then,  as 
is  easily  checked,  v  must  be  a  locally  x-extreme  ver¬ 
tex  of  one  of  the  two  sides  of  /,  so  the  number  of  such 
vertices  is  also  O(n^).  Hence,  in  what  follows  we  may 
assume  that  neither  of  the  edges  Ci,  €2  is  adjacent  along 
a  to  another  popular  face  of  C,  and  that  one  of  these 
edges  emanates  from  v  in  the  positive  x-direction  and 
one  emanates  in  the  negative  x-direction. 

As  above,  our  method  is  to  derive  a  recurrence  re¬ 
lationship  for  7r(n),  by  bounding  each  of  the  functions 
in  terms  of  (with  a  special  handling  of  7r^°^); 

the  solution  of  the  resulting  system  of  recurrences  will 
yield  the  asserted  bounds. 

So  let  us  fix  an  index  j,  and  assume  that  the  vertex  v 
under  consideration  has  index  <  j.  By  our  assumptions, 
for  i  =  1, 2,  the  curve  7,-  must  contain  a  maximal  open  x- 
monotone  connected  portion  /3,-  having  v  as  an  endpoint 
and  satisfying  the  following  property:  let  i?)'*',  R'~  be 
the  two  sides  of  (5i  which  are  consistent  with  the  two 
respective  sides  Rf ,  R~  of  e,;  then  f3i  does  not  contain 
any  point  at  which  both  sides  R[^ ,  R'~  lie  locally  in  C. 
Let  Zi  denote  the  other  endpoint  of  /3{.  By  assumption, 
one  of  these  arcs,  say  I3i ,  emanates  from  v  in  the  positive 
x-direction  and  the  other  emanates  in  the  negative  x- 
direction. 

Several  cases  can  arise: 

(a)  At  least  one  of  the  arcs  f3i  ends  at  a  point  z,-  which 
is  either  an  endpoint  of  the  original  intersection  curve 
7i,  or  a  point  of  local  x-extremum  on  that  curve.  We 
then  charge  v  to  the  point  z,,  and  note  that  the  num¬ 
ber  of  such  points  is  O(n^),  and  that  each  such  point  is 
charged  only  a  constant  number  of  times  in  this  manner, 
thus  implying  that  the  number  of  vertices  v  of  this  kind 
is  only  In  what  follows  we  assume  that  this  sit¬ 

uation  does  not  arise,  which  means,  in  particular,  that 
zi  is  a  vertex  of  another  popular  face  of  C,  whose  two 
sides  are  consistent  with  (appropriate  extensions  along 
7i  of)  respectively. 

In  the  full  version  of  the  paper  we  distinguish  and 
analyze  two  additional  cases:  (b)  (3i  is  not  intersected 
by  the  third  surface  (J2,  and  (c)  the  surface  a2  intersects 
/?!  in  at  least  one  point  (which  can  occur  only  when 
j{v)  >  0).  Due  to  lack  of  space,  we  omit  further  details 
for  these  cases  here,  and  we  just  summarize  with  the 
resulting  recurrence  for  (where,  for  j  =  0,  we  put 
=  0  in  the  right-hand  side): 

7r(^)(n)  =  0(^^7r(n/f)  -f  -f-  .  (2) 

^Strictly  speaking,  v  is  locally  x-extreme  in  one  of  the  four 
portions  of  C  into  which  these  sides  ‘point’;  we  allow  ourselves 
here  and  below  this  slight  abuse  of  notation. 
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3.2  Solving  the  Recurrences 

We  next  proceed  to  solve  the  recurrences  (1)  and  (2). 
We  start  with  the  latter  equation,  fix  some  £  >  0,  and 
claim  that  its  solution  is  for  j  = 

0, . . . ,  s  -  1,  where  the  constants  Bj  depend  on  e,  j,  and 
on  the  maximum  degree  b.  By  definition,  this  implies 
that  x(n)  < 

We  prove  this  claim  by  induction  on  n.  We  first 
rewrite  (2),  using  a  different  parameter  for  each  j, 
as 

7r^°^(n)  <  cfo7r(n/fo)  +  and  (3) 

7r(^')(n)  <  c(f7r(n/^j)  +  , 

j  =  l,...,s-  1, 

for  appropriate  positive  constants  c,  ^o, . . .  with¬ 

out  loss  of  generality,  we  assume  c  >  1 .  We  take  to  be 
sufficiently  large,  and  put  for  j  —  0, . . . ,  s  -  1; 

note  that  for  j  =  1, , , , ,  s  -  1.  We  note  that, 

by  choosing  the  Bj’s  to  be  sufficiently  large,  we  can  as¬ 
sume  that  the  claimed  bounds  hold  for  all  n  <  ^o-  (E.g., 
choose  the  Bj’s  to  be  larger  than  some  appropriate  mul¬ 
tiple  of  foi  and  use  the  fact  that  all  the  quantities  we 
want  to  estimate  are  bounded  by  O(n^),  where  the  con¬ 
stant  of  proportionality  depends  only  on  the  maximum 
degree  b  of  the  surfaces  and  of  their  boundaries.) 

For  n  >  ^0)  we  apply  the  induction  hypothesis  in  the 
right-hand  side  of  (3),  and  conclude  that  the  asserted 
bounds  continue  to  hold  for  n  too,  provided  that  the 
following  inequalities  are  satisfied: 


cBg-i  ^  D 

<  -00  , 
so 

and 

^  +  $  +  7  =  1, 

Sj  so 

...,s-  1. 

To  achieve  this,  we  choose  Bq  >  , 

and  Bj  — 

,  and  require  that  the  f/s  be  sufficiently 

large  so  that  ^5"  >  2sc*  .  We  leave  it  to  the  reader  to 
verify  that  this  choice  of  coefficients  satisfies  the  above 
inequalities.  This  inductive  step  completes  the  solution 
of  the  recurrence  for  the  functions  7r^^^(n). 

In  a  similar  manner  we  solve  the  recurrences  (1).  We 
show  that  the  solution  of  these  equations  is  < 

for  j  =  0, . . . ,  s  —  1,  where  the  constants  Aj 
depend  on  e,  j,  and  the  maximum  degree  b.  By  def¬ 
inition,  this  implies  that  K(n)  <  j4s_in^+®.  For  lack 
of  space,  we  omit  the  fairly  routine  inductive  proof  of 
these  bounds. 

Using  rather  standard  arguments  (given  in  the  full 
version  of  this  paper),  one  can  show  that  the  overall 
combinatorial  complexity  of  C  is  dominated  asymptot¬ 
ically  by  the  above  bound  for  the  number  of  its  inner 
vertices.  We  thus  obtain  the  main  result  of  the  paper: 


Theorem  3.2  The  combinatorial  complexity  of  a  sin¬ 
gle  cell  in  an  arrangement  of  n  algebraic  surface 
patches  in  3-space,  satisfying  the  conditions  (i)-(iv),  is 
for  any  e  >  0,  where  the  constant  of  propor¬ 
tionality  depends  on  e,  s  and  b. 

4  Zone  Complexity 

An  interesting  application  of  Theorem  3.2  is  to  bound 
the  combinatorial  complexity  of  the  zone  of  a  surface 
in  an  arrangement  of  other  surfaces  in  3-space.  Specifi¬ 
cally,  let  S  be  a  collection  of  n  algebraic  surface  patches 
in  3-space,  and  let  a  be  another  such  surface,  so  that 
the  surfaces  in  E  U  {cr}  satisfy  conditions  (i)-(iv).  The 
zone  of  a  in  A(S)  is  the  collection  of  all  cells  of  A(E) 
that  are  crossed  by  cr.  The  complexity  of  the  zone  is  the 
sum  of  the  complexities  of  all  its  cells. 

Theorem  4.1  The  combinatorial  complexity  of  the 
zone  of  (7  in  A(E)  is  for  any  e  >  0,  where 

the  constant  of  proportionality  depends  on  e,  s  and  b. 

Proof.  We  extend  the  idea  used  in  [12]  for  the  anal¬ 
ysis  of  zones  in  2-dimensional  arrangements.  That  is, 
assume  first  that  cr  is  a  connected  surface.  We  cut  each 
(7i  €  E  into  a  constant  number  of  sub-patches  along  its 
curve  of  intersection  with  cr.  If  we  shrink  these  pieces 
by  a  small  amount,  all  the  cells  of  the  zone  become 
connected  to  each  other,  and  form  a  single  cell  in  the 
arrangement  of  the  new  patches.  Since  these  patches 
are  easily  seen  to  also  satisfy  conditions  similar  to  (i)- 
(iv),  the  asserted  bound  follows.  If  a  is  not  connected,  it 
consists  of  a  constant  number  of  connected  components, 
and  we  apply  the  above  argument  to  each  component 
separately,  n 

5  Conclusion 

In  this  paper  we  have  obtained  a  near-quadratic  bound 
for  the  combinatorial  complexity  of  a  single  cell  in  an 
arrangement  of  n  algebraic  surface  patches  of  constant 
maximum  degree,  each  bounded  by  a  constant  number 
of  algebraic  arcs  of  constant  maximum  degree  as  well. 
This  almost  settles  a  long-standing  conjecture,  and  pro¬ 
vides  a  fairly  satisfactory  extension  of  the  2-dimensional 
Davenport-Schinzel  theory  developed  in  [14].  Our  proof 
extends  recent  techniques  developed  in  [16,  17,  26],  all 
based  on  the  probabilistic  technique  of  [9,  25]  for  deriv¬ 
ing  bounds  on  the  number  of  generalized  ‘(<  k)-sets’  in 
arrangements. 

As  noted  in  the  introduction,  our  result  has  immedi¬ 
ate  applications  to  the  general  motion  planning  problem 
with  three  degrees  of  freedom.  That  is,  for  rather  gen¬ 
eral  systems  with  three  degrees  of  freedom,  the  combi¬ 
natorial  complexity  of  the  connected  component  of  the 
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free  configuration  space,  consisting  of  all  robot  place¬ 
ments  which  are  reachable  from  some  given  initial  free 
placement,  is  for  any  £  >  0,  where  n  is  the 

number  of  contact  surfaces,  as  defined  in  the  introduc¬ 
tion.  This  still  falls  short  of  efficient  construction  of  such 
a  component  (in  near-quadratic  running  time,  which  is 
one  of  the  major  open  problems  that  we  pose  in  this 
paper).  However,  at  least  we  know  that  the  complexity 
of  such  a  component  is  nearly  an  order  of  magnitude 
smaller  than  the  worst-case  complexity  of  the  entire  ar¬ 
rangement,  and,  in  most  cases,  also  of  the  entire  free 
configuration  space. 

Another  open  problem  that  the  paper  raises  is  to 
further  improve  the  bound  that  we  have  obtained,  to 
the  conjectured  bound  of  0(nAs(n)),  or  at  least  to 
0(n^polylog(n)).  Another  open  problem  is  to  extend 
our  results  to  arrangements  in  d  dimensions.  We  be¬ 
lieve  that  this  is  doable,  and  are  currently  exploring 
this  problem.  One  main  subproblem  here  is  to  extend 
Theorem  2.1  to  higher  dimensions. 

Other  open  problems  are  to  extend  our  analysis  to 
obtain  sharp  bounds  on  the  complexity  of  many  cells 
in  3-D  arrangements,  and  to  derive  bounds  on  the  sum 
of  squares  of  cell  complexities  in  an  entire  arrangement; 
see  [2,  5]  for  related  work. 
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On  Translational  Motion  Planning  in  3-Space* 


Boris  Aronov^ 


Micha  Sharir^ 


Abstract 

Let  S  be  a  convex  polyhedron  translating  in  3-space 
amidst  k  convex  polyhedral  obstacles  with 

pairwise  disjoint  interiors.  The  free,  configuration  space 
(space  of  all  collision-free  placements)  of  B  can  be  repre¬ 
sented  as  the  complement  of  the  union  of  the  Minkowski 
sums  Pi  =  Aj  0  {—B),  for  i  =  I,.  ..,k.  We  show  that 
the.  combinatorial  complexity  of  the  free  configuration 
space  of  B  is  0(nk  log^  k),  where  ri  is  the  total  complex¬ 
ity  of  the  individual  Minkowski  sums  Pi, . . .,  Pk-  The 
bound  is  almost  tight  in  the  worst  case.  We  also  derive 
an  efficient  randomized  algorithm  that  constructs  this 
configuration  space  in  expected  time  0{nk\o^  k). 

1  Introduction 

Let  Ai,...,Ak  be  k  closed  convex  polyhedra  in  three 
dimensions  with  pairwise  disjoint  interiors,  and  let  B 
be  another  closed  convex  polyhedron,  which,  with  no 
loss  of  generality,  is  assumed  to  contain  the  origin  O. 
In  the  context  of  motion  planning,  fl  is  a  ‘robot’  free 
to  only  translate  in  3-space,  and  the  A,  ’s  are  obstacles 
which  B  must  avoid.  Suppose  Ai  has  qi  faces,  B  has  p 
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faces,  and  put  q  =  9«'-  Let 

Pi  =  Ai®  {-B)  -  {a-b\a  e  Ai,b  e  B) 

be  the  Minkowski  sum  of  Ai  and  —B,  for  i  =  I,. .  .,k, 
let  V  =  be  the  resulting  collection  of  these  so- 

called  expanded  obstacles,  and  let  U  =  Ui^=i  Bi  be  their 
union.  As  is  well  known,  the  complement  C  of  U  (also 
called  the  common  exterior  of  P)  represents  the  free 
configuration  space  FP  of  B,  in  the  sense  that,  for  each 
point  z  E  C,  the  placement  of  B,  for  which  the  reference 
point  O  lies  at  z,  does  not  intersect  any  of  the  obstacles 
Ai,  and  all  such  free  placements  are  represented  in  this 
manner. 

As  is  well  known,  the  combinatorial  complexity  (i.e., 
the  number  of  vertices,  edges,  and  faces  on  the  bound¬ 
ary)  of  each  F,-  is  at  most  Q{pqi),  so  the  sum  of  the 
complexities  of  the  expanded  obstacles  is  n  =  0{pq).  In 
practice,  n  is  usually  much  smaller  than  pq.  The  bounds 
derived  in  this  paper  depend  only  on  n  and  k,  and  not 
on  p  and  q. 

Our  main  result  is  that  the  combinatorial  complexity 
of  the  union  U ,  and  thus  also  of  FP  =  C,  is  0{nk  log^  k) 
and  Fl{nka{k))  in  the  worst  case.  This  should  be  com¬ 
pared  to  the  recent  bound  obtained  by  the  authors  [4] 
on  the  combinatorial  complexity  of  the  union  of  any  k 
convex  polyhedra  in  3-space  with  a  total  of  n  faces;  it 
is  shown  there  that  the  maximum  complexity  of  such  a 
union  is  0{k^  +  7ik\og^  k)  and  Q{k^  +  nka{k)).  Thus 
the  convex  polyhedra  Pi  arising  in  the  context  of  trans¬ 
lational  motion  planning  have  special  properties  that 
yield  the  above  improved  bound. 

The  problem  of  obtaining  sharp  bounds  for  the  com¬ 
binatorial  complexity  of  the  union  of  Minkowski  sums, 
as  above,  has  been  open  for  the  past  eight  years,  and 
has  been  studied  during  this  period  in  several  papers. 
It  was  raised  by  Kedem  et  al.  [16]  (see  also  [20]),  where 
the  2-dimensional  version  of  the  problem  was  success¬ 
fully  tackled.  Related  work  on  the  3-dimensional  case 
is  described  in  [2,  3,  4,  10,  14,  19].  Except  for  the 
papers  [4,  14]  (and  parts  of  [2]),  these  papers  studied 
different  (though  related)  problems,  involving  the  com¬ 
plexity  of  the  lower  envelope  or  of  a  single  cell  in  an 
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arrangement  of  triangles  in  3-space.  A  recent  paper  of 
Halperin  and  Yap  [14]  analyzes  the  complexity  of  the 
union  of  Minkowski  expansions,  as  above,  for  the  case 
where  if  is  a  box,  and  obtains  a  bound  of  0(n^n(n)). 
Our  result  is  general,  and  thus  almost  settles  this  open 
problem,  except  for  the  polylogarithmic.  factor  in  our 
upper  bound;  we  conjecture  that  the  correct  bound  is 
0{7ika(k)). 

We  also  consider  the  algorithmic  problem  of  efficient 
construction  of  FP.  We  present  an  efficient  and  rather 
simple  randomized  algorithm  that  computes  FP  in  ex¬ 
pected  time  0{7ik\og^  k).  The  algorithm  and  its  anal¬ 
ysis  are  adapted  from  a  very  similar  algorithm  given  in 
[4]  for  constructing  the  union  of  arbitrary  convex  poly- 
hedra  in  3-space. 

The  paper  is  organized  as  follows.  In  Section  2  we 
bound  the  number  of  components  and  local  minima  of 
CF  The  analysis  of  the  topology  of  U,  and  of  several 
related  structures  constructed  from  a  family  of  convex 
polyhedra,  continues  in  Section  3.  Section  4  establishes 
the  main  result  of  the  paper,  namely  the  bounds  on  the 
combinatorial  complexity  of  U .  Section  5  describes  the 
randomized  algorithm  for  computing  the  boundary  of 
U  and  Section  6  concludes  with  some  remarks  and  open 
problems. 

2  The  Number  of  Holes  of  LI 
and  Local  Extrema  of  C 

We  first  simplify  the  analysis  by  assuming  that  the  given 
polyhedra  A,  and  B  are  in  general  ■position,  meaning 
that  the  coordinates  of  the  vertices  of  the  A,’s  and  of  B 
are  all  transcendentals  that  are  algebraically  indepen¬ 
dent  over  the  rationals.  As  argued  in  [4],  the  general 
position  assumption  involves  no  real  loss  of  generality. 
Without  loss  of  generality,  in  what  follows  we  will  also 
assume  that  U  is  bounded  and  that  C  has  thus  a  single 
unbounded  component. 

To  establish  our  upper  bound  on  the  complexity  of 
the  union  U,  it  clearly  suffices  to  bound  the  number  of 
vertices  of  U .  Moreover,  it  suffices  to  bound  the  number 
of  vertices  of  U  which  are  formed  by  the  intersection 
of  the  relative  interiors  of  three  faces  of  three  distinct 
polyhedra  Pi,  because  the  number  of  all  other  vertices, 
namely,  vertices  of  the  original  Pi’s  and  intersections  of 
edges  of  the  original  Pi’s  with  faces  of  other  expanded 
obstacles,  is  only  0{nk).  This  is  a  consequence  of  the 
following  easy  proposition  (cf.  [4]): 

Proposition  2.1  The  number  of  vertices  of  Pi  D  Pj, 
summed  over  all  Pi,Pj  G  V,  is  0{nk). 

We  first  bound  the  number  of  connected  components 
of  the  complement  C  oi  U  (the  common  exterior  of  V). 
Under  the  assumptions  made  above,  C  has  a  unique  un¬ 
bounded  component,  so  it  suffices  to  estimate  the  num¬ 
ber  of  bounded  components  of  C,  i.e.,  “holes”  in  U. 


Let  an  intersection  edge  be  an  edge  of  the  intersection 
of  the  boundaries  of  two  distinct  expanded  obstacles. 
Each  such  edge  e  meets  the  boundary  of  the  union  U 
in  at  most  k  —  1  intervals,  as  it  intersects  each  of  the 
remaining  k  —  2  polyhedra  in  at  most  one  interval,  and 
the  complement  (within  e)  of  the  union  of  these  k  —  2 
intervals  consists  of  at  most  1—1  intervals. 

We  introduce  a  real  parameter  t  G  [0,  1]  (we  refer  to 
it  as  ‘time’),  define  Pi{t)  =  Ai  ©  {—tB),  for  i  —  I, . . . ,  k, 
and  U{t)  =  (jf_i  Piit),  and  let  C{t)  denote  the  comple¬ 
ment  of  U (t).  Note  that  the  general  position  assumption 
does  not  imply  that  polyhedra  {Pi{t)}  are  in  general  po¬ 
sition,  for  all  values  of  t.  It  does  imply,  though,  that,  at 
any  given  t,  we  can  have  at  most  one  degenerate  contact 
between  the  Pi(t)’s.  Such  a  contact  can  be  expressed  as 
a  polynomial  equality  in  t  and  in  the  coordinates  of  the 
vertices  of  the  AUs  and  of  B.  Moreover,  there  is  only  a 
finite  number  of  values  of  t  at  which  such  a  degeneracy 
can  occur.  For  example,  if  fo  >  0  is  the  first  instant 
at  which  Pi[t)  and  Pj{t)  meet,  then  Pi(t)  n  Pj{t)  must 
consist  of  a  single  point,  for  otherwise  we  would  have  at 
least  two  algebraically-independent  polynomial  equali¬ 
ties  holding  simultaneously,  which  is  forbidden  by  the 
general  position  assumptions.  Moreover,  such  an  initial 
single-point  contact  of  Pi{i)  and  Pjft)  must  occur  ei¬ 
ther  between  a  vertex  of  one  of  these  polyhedra  and  a 
face  of  the  other,  or  between  two  edges,  one  from  each 
polyhedron.  Similarly,  it  is  easy  to  verify  that  the  first 
contact  between  three  growing  obstacles  must  occur  at 
a  single  point. 

Before  continuing,  we  recall  a  well-known  fact:  Let 
P  and  Q  be  two  convex  polyhedra.  Denote  by  A/p  the 
normal  diagram  (also  known  as  the  Gaussian  diagram) 
of  P-,  this  is  the  decomposition  of  the  Gaussian  sphere 
of  orientations  into  regions,  each  consisting  of  the 
outward-directed  normal  directions  of  all  planes  sup¬ 
porting  P  at  some  vertex  (each  edge  of  Mp  is  a  great 
circular  arc  consisting  of  the  outward-directed  normal 
directions  of  all  planes  supporting  P  at  some  edge,  and 
each  vertex  of  Mp  is  the  outward-directed  normal  di¬ 
rection  of  some  face  of  P).  The  normal  diagram  Mq  of 
Q  is  defined  analogously.  Then  the  overlay  of  Mp  and 
Mq  is  the  normal  diagram  of  the  Minkowski  sum  P^dQ. 
This  implies  that  the  combinatorial  structure  (i.e.,  the 
number  of,  and  incidence  relations  between,  faces  of  all 
dimensions)  of  P  ©  Q  is  completely  determined  by  the 
orientations  of  the  faces  of  P  and  of  Q  and  by  their 
incidence  structures.  Hence,  in  particular,  the  combi¬ 
natorial  structure  of  Pi{t)  =  Ai  ©  {—tB)  is  the  same 
for  all  t  >  0.  This  implies  the  following  property.  For 
t  G  [0,  1],  let  s{t)  denote  the  Minkowski  sum  either  of  a 
fixed  edge  of  Ai  and  {—t)  times  a  fixed  vertex  of  B,  or 
of  a  fixed  vertex  of  A,-  and  (—t)  times  a  fixed  edge  of  B. 
If  s(<o)  is  an  edge  of  Pi(<o),  for  some  to  G  (0,  1],  then 
s{t)  is  an  edge  of  Pi{t)  for  all  t  G  (0,  Ij. 
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We  want  to  keep  track  of  the  number  of  holes  of  U{t) 
as  t  increases  from  0  to  1.  At  <  =  0,  the  number  of  holes 
of  U{t)  is  0.  As  t  increases,  the  holes  shrink,  so  the 
number  of  holes  can  increase  only  when  a  component  of 
C’(<)  is  split  into  two  components. 

Lemma  2.2  A  component  of  C{t)  can  split  into  two 
components  only  at  times  t  when  an  edge  s{t)  of  some 
Pi{i)  meets  an  intersection  edge  e{t)  of  two  other  poly- 
hedra,  Pi{i),  Pj{t),  at  some  point  v,  so  that  the  inter¬ 
section  of  C{t)  with  a  S7nall  neighborhood  of  v  is  discon¬ 
nected. 

Proof:  Easily  follows  from  an  observation  of  de  Berg  et 
al.  [5]  that  a  split  can  occur  only  when  some  Pi{t),  Pj{t), 
P{{t)  come  to  intersect  for  the  first  time,  and  from  gen¬ 
eral  position  assumption.  We  omit  the  details.  □ 

Suppose  that  such  a  configuration  does  indeed  arise 
at  some  time  t,  and  suppose  that  e(<)  is  formed  by  the 
intersection  of  a  pair  of  faces  of  Pi{t)  and  of  Pj{t).  The 
edge  s{t)  intersects  both  Pi{t)  and  Pj{t)  at  two  respec¬ 
tive  intervals  having  a  common  endpoint.  Lemma  2.2 
then  implies  that  the  two  intervals  s(t)  fl  Pi{t)  and 
s{t)  n  Pj{t)  must  have  disjoint  relative  interiors. 

Lemma  2.3  If  at  time  t  the  intervals  s{t)  fl  Pi{t)  and 
s{t)r)Pj{t)  intersect,  then  s{t')C\Pi{t')  and  s{t')C\Pj{t') 
also  intersect,  for  all  t'  >  t. 

Proof:  Omitted  in  this  version. 

Lemmas  2.2  and  2. .3  imply  that,  for  each  t  at  which  a 
connected  component  of  C  is  split  into  two  components, 
there  exist  two  polyhedra,  Pi{t),  Pj{t),  and  an  edge  s(<) 
of  another  polyhedron  Pt{t),  such  that  the  two  intervals 
s(t)r\Pi{t)  and  s(t)r\Pj(t)  meet  at  a  common  endpoint, 
and  continue  to  overlap  for  all  t'  >  t.  Thus  the  number 
of  such  ‘hole-splitting’  events,  involving  the  same  edge 
s{t)  of  any  of  the  expanded  polyhedra,  is  at  most  ^  —  3, 
so  the  total  number  of  holes  that  the  union  can  have  is 
at  most  0(71^:).  That  is,  we  have  shown: 

Theorem  2.4  The  number  of  connected  components  of 
the  common  exterior  of  Minkowski  sums  of  k  polyhedra, 
as  above,  is  0{nk). 

Remark:  We  do  not  know  if  this  bound  is  tight,  as  the 
best  lower  bound  we  can  construct  is  f](Ar^). 

Let  S  be  any  closed  polyhedral  set  in  3-space  in  gen¬ 
eral  position.  (In  this  paper  we  only  consider  polyhedral 
sets  bounded  by  a  finite  number  of  edges,  vertices,  and 
faces.)  For  a  point  p,  let  a  neighborhood  of  p  in  S  be 
the  intersection  of  5  with  a  ball  centered  at  p,  whose 
radius  is  smaller  than  the  distance  from  p  to  any  edge, 
face,  or  vertex  of  S  not  incident  to  p.  A  point  p  of  S  is 
a  local  minimum  if  it  is  a  point  of  smallest  z-coordinate 
in  some  neighborhood  of  p  in  S.  (Under  the  general 
position  assumptions,  no  face  or  edge  can  be  parallel  to 


the  xp-plane,  so  p  will  be  the  only  point  in  its  neighbor¬ 
hood  with  this  z-coordinate.)  A  local  maximum  is  de¬ 
fined  similarly.  The  following  theorem  is  a  consequence 
of  Theorem  2.4: 

Theorem  2.5  The  number  of  local  maxima  and  min¬ 
ima  of  the  closure  C  of  C  is  0{nk). 

Proof:  We  prove  the  claim  only  for  the  number  of  lo¬ 
cal  minima  of  C,  since  the  treatment  of  local  maxima 
is  fully  symmetric.  Any  bounded  convex  component 
K  of  C  has  exactly  one  local  minimum  (which  is  also 
the  global  minimum  of  K),  because  K  does  not  have 
a  lowest  horizontal  edge,  as  implied  by  our  general  po¬ 
sition  cissumption.  Hence  the  number  of  such  minima 
is  0(nk),  by  Theorem  2.4.  This  also  applies  to  com¬ 
ponents  of  C  all  of  whose  horizontal  cross  sections  are 
connected,  because  such  a  component  has  at  most  one 
local  (and  global)  minimum  as  well.  So  we  will  proceed 
to  bound  the  number  of  local  minima  of  those  com¬ 
ponents  that  have  at  least  one  disconnected  horizontal 
cross  section.  We  will  in  fact  prove  that  the  number  of 
such  minima  is  only  0{k‘^). 

Sweep  C  with  a  plane  tt  parallel  to  the  xp-plane  and 
moving  upwards,  in  the  positive  z-direction,  and  keep 
track  of  the  number  /  of  connected  components  of  C’flTr. 
This  number  is  initially  1  (when  tt  is  below  all  vertices 
of  6’).  I  increases  by  1  when  tt  sweeps  through  a  local 
minimum  of  C,  or  when  a  connected  component  of  CCitt 
splits  into  two  subcomponents;  I  decreases  by  1  when 
TT  sweeps  through  a  local  maximum  of  C,  or  when  two 
components  of  6’  O  tt  merge  into  a  single  component. 
The  events  at  which  components  may  split  or  merge 
occur  only  when  tt  sweeps  through  the  topmost  or  bot¬ 
tommost  vertex  of  the  intersection  of  some  pair  P,,  Pj 
of  expanded  obstacles,  for  some  i  ^  j.  The  number  of 
such  events  is  clearly  bounded  by  2(2)  =  0{P). 

Consider  the  following  dynamic  scheme  for  assigning 
weights  to  each  component  of  (7  fl  tt.  When  tt  sweeps 
through  a  local  minimum  point  of  C,  a  new  component 
of  CriTT  is  created,  and  is  assigned  weight  —1.  When  two 
components  of  C  n  tt  merge,  the  weight  assigned  to  the 
new  component  is  2  plus  the  sum  of  the  weights  of  the 
merged  components.  When  a  component  shrinks  and 
disappears,  its  final  weight  is  added  to  a  global  count. 
When  a  component  is  split  into  two  subcomponents, 
each  of  them  is  assigned  weight  1  +  where  w  is  the 
weight  of  the  split  component. 

We  claim  that,  at  any  given  time  during  the  sweep, 
the  weight  of  any  component  of  C  fl  tt  is  always  at  least 
—  1,  and  the  weight  of  a  component  that  was  formed  by 
a  splitting  or  merging  operation  is  non-negative.  (We 
refer  to  components  of  C  that  have  a  cross-sectional 
component  of  the  latter  type  as  interesting  components.) 
Both  claims  are  easy  to  prove  by  induction  on  the  sweep 
events.  Now  suppose  that,  at  some  point  during  the 
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sweep,  there  are  s  local  minima  of  C'  below  tt,  so  that 
the  cross-sectional  components  that  they  have  created 
have  already  been  split  or  merged,  and  that  the  num¬ 
ber  of  such  splittings  and  mergings  below  tt  is  At  (which 
is  at  most  0{k'^)).  Then  the  total  weight  of  the  com¬ 
ponents  of  C'  n  TT  that  have  been  formed  by  splittings 
and  mergings,  plus  the  total  weight  of  ail  the  interest¬ 
ing  components  of  6*  that  have  terminated  below  tt,  is 
easily  seen,  by  induction  on  the  sweep  events,  to  be 
'2N  —  s.  As  claimed  above,  this  weight  is  non-negative, 
thus  s  <  2N  =  0{k^).  When  the  sweep  plane  passes 
the  topmost  vertex  of  C,  this  inequality  shows  that  the 
number  of  local  minima  of  C  contained  in  interesting 
components  is  0{k'^).  Adding  the  number  of  local  min¬ 
ima  in  non-interesting  components  of  C,  we  obtain  the 
bound  asserted  in  the  theorem.  □ 

3  On  the  Structure  of  Levels  in  A{V) 

Define  the  arrangement  A{V)  of  the  collection  V  of  the 
expanded  obstacles  to  be  the  decomposition  of  space 
into  vertices,  edges,  faces,  and  3-dimensiona!  cells,  in¬ 
duced  by  the  faces  of  the  polyhedra  of  P;  for  more 
details  on  arrangements,  see  [2,  9].  For  each  s  = 
0, . . . ,  —  3,  let 

{/(■’)  =  [j{Pj^  n...nPjJi<h<j2<...<js<k]. 

Furthermore,  put  6’^’^  =  IR^  \  and  = 

We  call  the  s-ih  level  of  the  ar¬ 
rangement  A{V).  We  also  denote  by  the  closure  of 
Note  that  =  U  is  just  the  union  of  the  poly¬ 
hedra  of  V ,  =  C  is  their  common  exterior,  and 

is  the  common  boundary  of  U  and  C.  These  definitions 
are  illustrated  in  the  planar  case  in  Figure  1. 

In  this  section  we  prove  some  geometric  and  topologi¬ 
cal  properties  of  and  U'A) ,  which  will  be  used  in  the 
proof  of  our  main  theorem  (Theorem  4.2  in  Section  4). 

Lemma  3.1  The  number  of  local  minima  and  the  num¬ 
ber  of  connected  components  of  for  any  collectton 
of  k  polyhedra  with  a  total  of  n  faces  in  general  position, 
are  bounded  by  the  number  of  local  maxima  o/C’h)  plus 

0(P). 

Proof:  is  the  union  of  triple  intersections  of  poly¬ 

hedra  in  V.  Thus  a  local  minimum  of  1/(3)  is  necessar¬ 
ily  a  local  minimum  of  one  of  these  triple  intersections. 
(This  immediately  bounds  the  number  of  local  minima 
by  (3).)  Since  the  number  of  all  pairwise  intersections 
of  polyhedra  in  V  is  only  (3),  it  suffices  to  consider 
only  those  minima  that  are  formed  as  intersections  of 
three  distinct  polyhedron  boundaries.  By  the  general 
position  assumption,  such  an  intersection  must  be  the 
unique  common  point  v  of  the  relative  interiors  of  three 
faces  of  three  distinct  polyhedra  in  V.  In  this  case,  it 


Figure  1:  A  family  of  convex  polygons;  and 

are  shaded  successively  darker. 

is  easy  to  see  that  v  must  be  a  local  maximum  of  6’*-'^. 
The  lemma  follows.  We  omit  the  rather  easy  details.  □ 
Applying  Theorem  2.5,  we  thus  obtain: 

Corollary  3.2  If  V  is  a  collection  of  polyhedra  with  a 
total  of  n  faces,  arising  as  Minkowski  expansions  of  a 
collection  of  k  convex  polyhedra,  with  pairwise-disjoint 
interiors,  by  another  convex  polyhedron,  then  the  num¬ 
ber  of  local  minima  and  the  number  of  connected  com¬ 
ponents  of  is  0{nk). 

We  next  study  the  topological  structure  of  We 
begin  by  deriving  a  topological  property  of  general  poly¬ 
hedral  sets.  Let  S  be  a  closed  polyhedral  set  in  general 
position;  in  particular,  no  two  vertices  of  S  have  the 
same  ^-coordinate.  Note  that  a  neighborhood  of  a  point 
p  in  S,  as  defined  in  the  previous  section,  is  star-shaped 
with  respect  to  p,  so  its  intersection  with  a  horizontal 
plane  through  p  is  also  star-shaped  and  thus  connected. 

A  merge  point  of  S  is  a  point  p  £  S  such  that,  for  any 
neighborhood  of  p,  any  horizontal  plane  lying  below  p 
and  sufficiently  close  to  it  intersects  the  neighborhood  in 
a  disconnected  set.  We  define  the  merge  number  ni[p)  of 
p  to  be  the  number  of  connected  components  in  this  in¬ 
tersection,  less  1.  By  star-shapedness  of  neighborhoods, 
m{p)  is  well  defined.  (We  can  naturally  extend  this  defi¬ 
nition  to  any  non-merge  point  p'  by  putting  m(p')  =  0.) 
We  will  apply  our  analysis  to  ,S’  =  U^'^\  so  in  this  case 
m(p)  <  2  for  any  point  in  the  set,  as  is  easily  implied 
by  the  general  position  assumption  on  V. 

The  first  Betti  number  of  S,  /?i(.S'),  is  the  rank  of  the 
first  singular  homology  group  of  ,S'  [11].  Informally,  it  is 
the  number  of  ‘linearly-independent’  homotopy  classes 
of  closed  cycles  in  S,  where  each  class  consists  of  all 
cycles  homotopic  within  S  to  some  given  cycle  and  not 
contractible  in  S. 
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Proposition  3.3  The  first  Betti  number  fii(S)  of  any 
compact  polyhedral  set  S  in  general  position  does  not 
exceed  the  sum  of  the  merge  numbers  of  its  vertices. 

Proof:  Let  z  :  .S'  — >  IR  be  the  ^-coordinate  function. 
For  a,6  G  IR,  let  ,S[a]  =  .S[a,6]  =  2~'([a,&]), 

and  ,S<[n]  =  oo,  a]). 

We  begin  the  proof  by  stating  a  polyhedral  analogue 
of  a  basic  fact  of  Morse  theory  (cf.  [18,  Theorem  3.1]). 
The  proof  is  an  easy  and  standard  exercise  in  topology, 
we  omit  it  in  this  version. 

Fact  3.4  Let  S  be  a  triangulated  polyhedral  set  in  gen¬ 
eral  position.  If  S[a,b]  contains  either  no  vertex  of  S, 
or  exactly  one  vertex  v  and  the  z-coordinate.  of  v  is  a, 
then  ,S[a]  is  a  deformation  retract^  ofS[a,b]. 

We  now  triangulate  S.  This  may  add  new  vertices 
to  .S,  but  their  presence  does  not  affect  the  statement 
of  the  proposition,  since  they  all  have  merge  number  0. 
Let  2  =  c  be  a  plane  below  all  the  vertices  of  S.  .Since 
.S  is  assumed  to  be  bounded,  we  have  .S<[c]  =  0.  It 
suffices  to  prove  that,  as  t  increases  from  c  to  -foo,  the 
Betti  number  /?i(.S'<[t])  increases  at  (the  z-coordinate 
of)  each  vertex  of  S  by  at  most  the  merge  number  of 
the  vertex,  and  never  changes  otherwise. 

Fact  3.4  implies  that  Pi{S<  [f])  changes  only  when  the 
plane  z  =  t  sweeps  through  a  vertex  of  .S';  indeed,  if 
<'  >  <  is  such  that  no  vertex  of  .S  has  z  coordinate 
in  (<,<'],  then  .S'<[<]  is  a  deformation  retract  of  .S<[<'], 
and  thus  .S<[f]  and  ,S<[<']  have  the  same  homologies 
[21,  Corollary  1.12].  Let  v  be  a  vertex  of  .S.  We  want 
to  bound  the  difference  /?i(.S<[<])  —  A  (^  [^^])>  where 
t  =  z{v)  and  t'  <t  \s  appropriately  close  to  t. 

One  can  easily  verify  that  the  transformation  from 
.S<[<']  to  .S<[f]  is  equivalent,  topologically,  to  gluing  a 
ball  D  centered  at  v  to  .S<[<'],  choosing  the  size  of  D  to 
be  such  that  it  intersects  all  triangles  of  Tf  at  least  one 
of  whose  vertices  approaches  v  as  f'  — *■  f,  and  no  other 
triangles.  An  easy  topological  calculation^  yields 

fii{S<[t])  =  fi,iS<[t']U  D)  < 

A(.S<[f'])  +  -F  /lo(.S<[f']  n  D)  -  1. 

Here  /?o(A')  is  the  number  of  connected  components  of 
X.  Now  l3i{D)  =  0  and  /?o(5<[f']  n  D)  is  exactly  equal 

'For  a  topological  space  X,  a  subspace  F  C  X  is  called  a 
deformation  retract  of  X  if  there  exists  a  continuous  map  p  : 
X  X  [0,1]  — »  X  such  that  p(a:,0)  =  x,  for  all  x  £  X,  p(X,l)  =  K, 
and  p{y,  1)  =  y  for  all  y  £  F ;  see,  for  example,  [21,  pp.  15,19]. 

^It  is  easy  to  show  that,  for  two  path-coimected  topological 
spaces  X  and  F,  /3i(X  uF)  <  /3i (X)  -h  ,01  (F)  n  F)  -  1. 

An  elementary  proof  of  tliis  fact  can  be  found  in  [15].  If  X  is  not 
path-coimected,  but  F  is,  the  statement  follows  by  repeated  ap¬ 
plications  of  the  path-coimected  version.  Note  that  in  our  case  D 
is  path-coimected  but  5<  [i'j  need  not  be.  The  statement  also  eeis- 
ily  follows  from  arguments  based  on  the  Mayer- Vietoris  sequence; 
see,  for  example,  Vick  [21,  p.  22]. 


to  the  number  of  components  of  the  intersection  of  a 
neighborhood  of  v  in  .S'  with  the  plane  z  =  t' ,  lying 
just  below  v,  namely,  it  is  equal  to  m(v)  -F  1.  Hence 
/?i(5'<[t])  —  /?i(.S<[<'])  <  m(v),  as  claimed.  This  con¬ 
cludes  the  proof  of  Proposition  3.3.  □ 

Lemma  3.5  For  a  collection  of  k  arbitrary  convex 
polyhedra  in  general  position,  with  a  total  of  n  faces, 
is  at  most  proportional  to  plus  the.  num¬ 
ber  of  local  minima  of  Hence,  for  collections  of 

Minkowski  expansions,  as  above,  this  number  is  0{nk). 

Proof:  Applying  Proposition  3.3  to  .S'  =  U^'^\  we  con¬ 
clude  that  it  suffices  to  bound  the  number  of  merge 
points  of  This  is  because  the  merge  number  of 

any  point  of  is  at  most  2,  as  follows  from  the  gen¬ 
eral  position  assumption  (and  already  noted  above).  Let 
V  6  be  a  merge  point;  then  u  must  be  a  vertex  of 

.4('P).  We  may  assume  that  d  is  a  vertex  formed  by  the 
transversal  intersection  of  some  three  faces,  F,-,  Fj,  F{, 
belonging  to  three  respective  distinct  expanded  polyhe¬ 
dra  Pi,  Pj,  P(.  (Indeed,  any  other  vertex  is  either  a 
vertex  of  some  P,  or  of  some  Pi  D  Pj ,  for  i,  j  =  I, . . . ,  k. 
No  vertex  of  the  former  type  can  be  a  merge  point  of 
any  as  is  easily  seen.  A  vertex  of  the  latter  type, 
on  the  other  hand,  can  be  a  merge  point  only  if  it  is  the 
bottommost  point  of  P,-  fi  Pj,  as  is  readily  verified  using 
the  general  position  assumption.  Thus  the  number  of 
these  merge  points  is  at  most  (2).)  An  easy  case  analy¬ 
sis  shows  that  any  vertex  v  of  this  form  can  be  a  merge 
point  of  only  if  it  is  a  local  minimum  of  We 
omit  the  details.  □ 

4  The  Complexity  of  the  Union 

Having  all  this  machinery  available,  we  now  turn  to  the 
analysis  of  the  complexity  of  U.  The  initial  portion 
of  the  analysis  is  identical  to  that  of  [4],  and  here  we 
only  go  quickly  through  its  main  steps;  we  refer  the 
reader  to  that  paper  for  full  details.  Specifically,  let 
tl>{k,  n)  denote  the  total  number  of  edges  of  U  contained 
in  Ui<»<j<ifc  dPiPdPj ,  maximized  over  all  collections  V 
of  k  Minkowski  sums,  as  above,  with  a  total  of  ii  faces, 
in  general  position.  We  apply  the  inductive  analysis 
scheme  of  [4],  which  yields  the  following  recurrence  for 
■tj){k,n): 

k  . 

{k-2)tl){k,n)  <  'Y^ip{k-l,n-7ii)-\- -ijP\k,n)  ,  (1) 

l=:l 

where  tii  is  the  number  of  faces  of  Pi,  for  i  =  1, . .  .,k, 
and  where  i/f^^(k,n)  is  the  maximum,  taken  over  all 
collections  P  as  above,  of  the  quantity  tv{f),  with 
the  sum  ranging  over  all  faces  /  of  here  w{f)  = 
max(4e/  —  6,  0)  and  e/  is  the  number  of  edges  on  df  that 
lie  on  the  boundary  of  the  only  polyhedron  containing  / 
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Figure  2:  A  special  quadrilateral  Q. 


in  its  interior.  (This  definition  of  is  not  used 

here,  and  is  given  only  for  the  sake  of  completeness.) 

We  next  estimate  n),  exactly  as  in  [4],  by  using 

the  same  inductive  scheme.  This  leads  to  the  recurrence 

k 

{k-2)^)^^\k,n)  =  ■if>^^\k-\,n-ni)+0{7ik+ij^^\k,  n)) 

i=l 

(2) 

where  is  the  number  of  “special  quadrilater¬ 

als”  of  the  following  form,  maximized  over  all  collec¬ 
tions  V  as  above.  Such  a  quadrilateral  Q  is  schemat¬ 
ically  depicted  in  Figure  2;  it  is  defined  by  a  triple 
(F(Q),  P'iQ),  P"{Q)),  where  P'  =  P'{Q),  P"  =  P"{Q) 
are  distinct  polyhedra  of  V,  and  F  —  F{Q)  is  a  face 
of  another  polyhedron,  P{Q),  of  T;  the  intersection 
P'  n  P"  n  f  is  the  quadrilateral  Q,  which  is  assumed  to 
be  disjoint  from  all  other  polyhedra,  the  four  vertices  of 
Q  are  vertices  of  the  union  (/,  two  opposite  edges  of  Q 
are  contained  in  dP'  fl  F  and  in  the  interior  of  only  P", 
and  the  two  other  opposite  edges  of  Q  are  contained  in 
dP”  n  F  and  in  the  interior  of  only  P'.  In  other  words, 
if  we  remove  P'  and  P"  from  V  and  replace  them  by 
P'  C\P" ,  then  dQ  appears  on  the  boundary  of  the  union 
of  the  modified  collection,  and  the  union  of  all  poly¬ 
hedra  in  P,  except  for  P'  and  P" ,  contains  Q  on  its 
boundary.  In  the  notation  of  Section  3,  Q  is  a  special 
type  of  face  of  .  The  structure  of  locally  near 
Q  is  schematically  depicted  in  Figure  3. 

In  the  case  of  general  convex  polyhedra,  the  authors 
give  a  bound  of  0{P  -b  nk)  for  ^l>^‘^\k,n),  and  show 
that  it  is  tight  in  the  worst  case  [4].  However,  we  show 
here  that  the  special  properties  of  .4(P)  in  the  case 
of  Minkowski  sums,  as  established  in  Section  3,  lead 
to  an  improved  bound  of  only  0{nk).  Plugging  this 
bound  into  the  recurrences  (1)  and  (2),  we  obtain,  fol¬ 
lowing  the  analysis  of  [4],  'ij>^^\k,7i)  =  0{nk\ogk)  and 
^{>{k,  n)  =  0{nk\og‘^  k),  which  thus  completes  the  proof 
of  the  upper  bound  of  our  main  result. 

We  actually  prove  the  following  stronger  result: 


Lemma  4.1  For  collections  V  of  arbitrary  convex  poly¬ 
hedra  tn  general  position,  the  number  of  special  quadri¬ 
laterals  is  at  most  proportional  to  nk  plus  the  number  of 
local  minima  ofU^^\  Hence,  for  Minkowski  expansions, 
as  above,  this  number  is  0{nk). 

I  Proof:  Let  <3  be  a  special  quadrilateral  defined  by 
{F{Q),  P'{Q),  P''{Q)),  as  above.  The  boundary  dQ 
of  Q  is  necessarily  a  bounding  cycle  of  a  connected 
component  of  d{P'  n  P")  n  Following  an  argu¬ 

ment  similar  to  that  of  [4],  we  observe  that  the  over¬ 
all  number  of  special  quadrilaterals  Q  whose  bound¬ 
aries  are  contractible  to  a  point  in  the  corresponding 
sets  d{P'{Q)  D  P"{Q))  H  is  just  0{nk),  since  in  the 
process  of  contracting  dQ  one  has  to  encounter  a  ver¬ 
tex  of  P'{Q)  n  P"{Q),  and  it  is  easily  checked  that  no 
vertex  of  this  form  is  charged  by  more  than  one  con¬ 
tractible  quadrilateral,  for  fixed  P'  and  P".  The  claim 
now  follows  from  Proposition  2.1.  We  thus  need  to 
consider  only  non-trivial  quadrilaterals,  namely  those 
whose  boundary  is  not  contractible  in  the  above  fash¬ 
ion. 

Let  Q  be  such  a  non-trivial  special  quadrilateral.  By 
definition,  Q  does  not  meet  the  interior  of  but  is 


P"  n  p 

Figure  3:  near  a  special  quadrilateral  Q. 
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fully  contained  in  Moreover,  dQ  C  (see  Fig¬ 

ure  3).  “Clut”  along  Q  (or,  rather,  shift  Q  slightly 
away  from  P{Q)  and  then  cut  along  the  shifted 

quadrilateral).  We  apply  this  cutting  procedure  to  ev¬ 
ery  non-trivial  special  quadrilateral,  in  some  arbitrary 
order.  Since,  by  definition,  the  relative  interiors  of  any 
pair  of  distinct  special  quadrilaterals  are  disjoint,  it  fol¬ 
lows  that  the  cuts  do  not  interfere  with  each  other. 

Each  cut  of  performed  along  some  non-trivial 

special  quadrilateral  Q,  has  one  of  two  possible  effects: 
Either  the  first  Betti  number  of  decreases  by  1,  or 
the  number  of  connected  components  of  increases 
by  1,  according  to,  respectively,  whether  points  of 
lying  on  the  two  sides  of  Q  sufficiently  near  Q  can  or  can¬ 
not  be  connected  by  a  path  that  avoids  Q  in  the  current 
version  of  The  number  of  cuts  is  thus  proportional 
to  the  increase  in  the  number  of  connected  components 
of  plus  the  decrease  in  the  first  Betti  number  of 

as  effected  by  the  cuts.  Since  none  of  the  cuts 
can  increase  the  first  Betti  number  of  U^'^\  it  follows, 
by  Lemma  3.5,  that  the  latter  quantity  (the  decrease  in 
is  bounded  by  0{k'^)  plus  the  number  of  local 
minima  of 

To  estimate  the  former  quantity,  consider  a  cut  per¬ 
formed  along  a  special  quadrilateral  Q,  defined  by  the 
triple  {F{Q),  P'{Q),  P''{Q)),  which  increases  the  num¬ 
ber  of  components  of  the  current  version  of  Since 
Q  is  non-trivial,  “dragging”  Q  along  d{P'(Q)  fl  P''{Q)) 
in  either  direction  encounters  a  third  polyhedron,  and 
thus  also  a  distinct  component  of  To  be  more  pre¬ 
cise,  one  of  the  new  components  of  denoted 
contains,  near  Q,  points  belonging  to  P{Q)  H  P'{Q)  H 
P”{Q),  so  contains  a  component  of  The  other 
component  of  denoted  is  bounded,  near  Q, 

by  the  connected  portion  K  of  d{P'{Q)r\  P"{Q))r\ 
incident  to  Q.  It  is  clear  that  K  must  also  be  incident 
(before  any  cut  was  made)  to  some  third  polyhedron, 
for  otherwise  K  would  have  been  homeomorphic  to  a 
disk  whose  boundary  corresponds  to  dQ,  and  Q  would 
then  have  been  a  trivial  quadrilateral,  contractible  to  a 
point  in  K  C  d{P'{Q)  PI  P"{Q))  D  There  are  now 
two  subcases  to  consider: 

(i)  The  component  also  contains  a  component  of 

(Note  that  no  component  of  is  ever  split  by 
the  cuts,  because  all  special  quadrilaterals  are  disjoint 
from  the  interior  of  and  the  cuts,  performed  along 
slightly  shifted  copies  of  the  quadrilaterals,  are  thus  dis¬ 
joint  from  The  total  increase  in  the  number  of 

components  of  formed  by  cuts  of  this  kind  cannot 
exceed  the  number  of  components  of  and  can  thus 
be  bounded  by  the  number  of  local  minima  of 
(2) 

(ii)  The  component  f/^  ^  does  not  contain  a  component 

of  Since  the  corresponding  boundary  K,  as  de¬ 

fined  above,  was  incident,  before  any  cut  was  made,  to 
another  component  of  it  follows  that  this  subcase 


occurs  because  previous  cuts,  along  other  special  quadri¬ 
laterals  incident  to  A',  have  separated  K  from  all  other 
adjacent  components  of  In  this  case,  we  charge  the 
current  cut  to  one  of  these  preceding  cuts.  It  is  easily 
checked  that  no  cut  is  charged  in  this  manner  more  than 
once.  This  implies  that  the  number  of  special  quadrilat¬ 
erals  in  this  subcase  is  at  most  equal  to  the  number  of 
non-trivial  special  quadrilaterals  of  the  preceding  types 
(those  decreasing  and  those  appearing  in  case 

(i)  above). 

This  completes  the  proof  for  collections  of  general 
convex  polyhedra.  In  the  case  of  Minkowski  expansions, 
as  above.  Corollary  3.2  implies  that  the  number  of  spe¬ 
cial  quadrilaterals  is  0{nk).  □ 

We  have  thus  shown  that,  in  the  case  of  Minkowski  ex¬ 
pansions,  =  0{7ik),  which  in  turn  completes 

the  analysis  of  ■ij){k,n)  and  yields  the  upper  bound  in 
the  main  result  of  the  paper: 

Theorem  4.2  Lei  Ai,. .  .,A};  be  k  convex  polyhedra  in 
3-space  with  pairwise  disjoint  interiors,  and  let  B  be 
another  convex  polyhedron.  The  combinatorial  complex¬ 
ity  of  the  union  of  the  Minkowski  .sums  Aj  ®  {—B),  for 
i  —  I,. . .  ,k,  is  0{nk  \og^  k),  where  n  is  the  overall  com¬ 
plexity  of  the  individual  Minkowski  sums.  In  the  worst 
case,  this  complexity  can  be  Q{nka{k)). 

Remark:  The  preceding  argument  assumes  general 
position  of  the  polyhedra  Ai  and  B.  Nevertheless,  the 
theorem  also  holds  for  collections  of  polyhedra  not  in 
general  position,  as  can  be  argued  using  a  perturbation 
scheme  [4].  We  omit  the  details  in  this  version. 

Proof  of  the  lower  bound:  We  make  use  of  a  planar 
construction,  given  in  [1],  of  k  convex  polygons  with 
a  total  of  n  edges,  so  that  their  union  has  Q(?icv(A')) 
edges  and  vertices.  Additionally,  the  polygons  can  be 
arranged  so  that  their  union  is  star-shaped,  say  with 
respect  to  the  origin,  and  at  least  some  fixed  fraction 
of  its  vertices  are  visible  from  the  point  (0,  -boo),  in  the 
strong  sense  that  there  exists  some  fixed  angle  fd  >  0 
(independent  of  k  and  n),  so  that,  for  any  such  visible 
vertex  v,  the  wedge  whose  apex  is  v,  whose  bisecting 
ray  is  parallel  to  the  positive  y-axis,  and  whose  angle  is 
/?,  does  not  meet  any  of  the  polygon  interiors. 

Without  loss  of  generality,  assume  that  k  is  even  and 
n  >  4.5k.  We  start  our  three-dimensional  construc¬ 
tion  with  a  set  of  k/2  convex  polygons  in  sy-plarie  with 
n  —  5k  edges  altogether,  so  that  their  union  (/'  has 
Q{jia{k))  vertices  visible  from  (0,-boo,0)  in  the  above 
strong  sense.  By  scaling,  we  may  assume  that  the  entire 
construction  is  contained  in  the  unit  disk  about  the  ori¬ 
gin  in  the  x^Z-plane.  Now  slightly  shift  and  expand  the 
polygons  in  the  ^-direction,  each  by  a  different  amount, 
to  produce  pairwise-disjoint  flat  and  thin  convex  prisms, 
all  contained  in  the  slab  \z\  <  0.1,  say;  see  Figure  4.  The 
second  set  of  k/2  polyhedra  consists  of  points  (0,  M,  i), 
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Figure  4:  The  lower  bound  construction,  not  to  scale;  (a)  a  view  from  above;  (b)  a  side  view. 


for  i  =  1, . .  .,k/‘2  (or,  rather,  tiny  tetrahedra  centered 
around  these  points),  where  M  ^  ^  is  an  appropri¬ 
ate  parameter.  This  gives  us  a  collection  of 

k  pairwise-disjoint  convex  polyhedra  with  a  total  of  n 
faces.  The  polyhedron  S  is  a  tetrahedron  with  vertices 
(0,0,  ±M')  and  (±M',M,  0),  where  M'  <  MjA  is  an¬ 
other  parameter,  chosen  so  that  the  dihedral  angles  of 
B  at  its  horizontal  and  vertical  edges  are  both  equal  to 
some  /?'  <  /?;  note  that,  by  the  choice  of  M',  we  have 
tan  ^  <  1/4,  an  inequality  that  will  be  needed  below. 
See  Figure  4  for  an  illustration. 

Let  t)  be  a  vertex  of  U'  visible  from  (0,-|-oo,0)  in  the 
above  sense.  By  construction,  we  can  place  B  so  that 
its  vertical  edge  touches  the  two  prisms  correspond¬ 
ing  to  the  two  polygons  whose  boundaries  intersect  at  v; 
moreover,  we  can  slide  B  vertically  upwards  and  down¬ 
wards,  by  a  total  distance  of  close  to  2M',  so  that 
maintains  these  two  contacts,  while  the  interior  of  B  re¬ 
mains  disjoint  from  any  of  the  shifted  prisms.  It  is  easily 
seen  that,  for  an  appropriate  choice  of  M ,  independently 
of  the  choice  of  v,  the  boundary  of  B  will  meet  each  of 
the  tiny  tetrahedra  Ai  around  the  points  (0,  M,  i)  during 
the  vertical  motion.  Moreover,  our  choice  of  parameters 
also  implies  that  the  intersection  of  B  with  the  vertical 
line  X  =  0,y  =  M  has  length  less  than  2tan|^  <  1/2, 
so,  when  B  touches  one  of  these  Ai’s,  its  interior  re¬ 
mains  disjoint  from  all  the  other  polyhedra  Ai  \  see  Fig¬ 
ure  4(b)  for  an  illustration.  In  other  words,  each  of  the 
appropriate  Q.(na{k))  vertices  of  U'  gives  rise  to  Q.{k) 
placements  of  B  where  it  makes  three  contacts  with  the 
Ti’s,  while  its  interior  remains  disjoint  from  all  these 
polyhedra.  Since  each  of  the  resulting  Q{nka{k))  place¬ 
ments  of  B  corresponds  to  a  vertex  of  the  union  of  the 
expanded  polyhedra  Ai  ©  (— S),  the  lower  bound 
of  the  theorem  follows.  □ 


Corollary  4.3  The  combinatorial  complexity  of  the 
free  configuration  space  of  a  convex  polyhedron  B,  trans¬ 
lating  in  3-space  amidst  a  collection  of  k  convex  obsta¬ 
cles,  Ai, . . . ,  Ak,  having  pairwise  disjoint  interiors,  is 
0{nk\o^  k)  and  can  be  lil(nk(y{k))  in  the  worst  case, 
where  n  is  the  overall  complexity  of  the  Minkowski  sums 
Ai  ©  (-B),  for  i  =  .  .,k. 

Remarks:  (1)  Going  back  to  the  parameters  p  and  q 
that  count  the  number  of  faces  of  B  and  of  all  the  j4j’s 
together,  we  can  state  the  upper  bound  of  Theorem  4.2 
and  Corollary  4. .3  as  0{pqk  log^  k),  and  the  lower  bound 
as  Q{qka{k))  for  constant  p.  A  sharper  dependence  of 
the  bound  on  p  (when  q  and  k  are  fixed)  remains  an 
open  problem. 

(2)  Our  proof  of  Theorem  4.2  makes  use  of  the  fact 
that  the  Pfs  are  Minkowski  sums  only  in  the  proof  of 
Theorem  2.4.  Hence  our  analysis  also  applies  to  any 
collection  of  k  general  convex  polyhedra  with  a  total 
of  n  faces,  with  the  property  that,  for  any  subset  of  r 
of  these  polyhedra,  the  number  of  components  of  the 
complement  of  their  union  is  0{rm),  where  m  is  the 
total  number  of  faces  of  those  r  polyhedra.  We  pose  it 
as  an  open  problem  to  find  other  natural  examples  of 
collections  of  convex  polyhedra  with  this  property. 

5  Efficient  Construction  of  the  Union 

We  next  describe  an  efficient  randomized  algorithm  for 
constructing  the  union  [/  of  a  collection  V  of  expanded 
polyhedra,  as  above.  The  input  to  the  algorithm  con¬ 
sists  of  the  original  polyhedra  Ai  and  B,  so  vie  first  com¬ 
pute  the  individual  Minkowski  sums  Pi  =  Ai  ©  {—B), 
for  i  =  1, . . . ,  This  computation  can  be  done  in  sev¬ 
eral  ways,  the  most  efficient  of  which  is  by  using  the 
technique  of  Guibas  and  Seidel  [13].  Each  P,-  can  be 
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constructed  in  time  0{p  +  qi  +ni),  where  tii  is  the  com¬ 
plexity  of  Pi.  Thus  the  cost  of  this  stage  is  0{pk  +  q+n). 

The  main  algorithm  is  essentially  identical  to  the  one 
given  in  [4]  for  the  case  of  general  polyhedra;  for  the  sake 
of  completeness,  we  briefly  review  the  algorithm  here, 
and  revise  its  analysis  to  exploit  the  improved  combina¬ 
torial  bounds  derived  above. 

The  algorithm  proceeds  as  follows.  We  first  compute 
all  the  pairwise  intersections  P,-  fl  Pj,  for  1  <  z  <  j  <  ^. 
This  can  be  done  in  time  0{nk),  as  shown  in  [4].  In 
additional  0{nk)  time,  we  can  also  extract,  for  each 
face  P  of  a  polyhedron  Pi  E  V,  the  collection  Qf  of 
the  convex  polygons  Qj  =  F  D  Pj ,  for  j  ^  i.  The  set 
Up  —  P  \  [Jjyii  Qj  i®  ^'he  portion  of  F  that  appears  on 
dU ,  so  the  algorithm  computes  the  sets  Up,  over  all 
faces  F.  Reconstructing  the  boundary  of  U  from  this 
information  is  relatively  straightforward,  by  gluing  the 
sets  Up  to  each  other  in  an  appropriate  manner.  The 
construction  of  the  sets  Up  is  done  exactly  as  in  [4]: 
For  each  face  F  of  some  Pi  £  V,  we  choose  a  random 
order  of  the  polyhedra  in  V  \  {Pi}  and  insert  the  poly¬ 
gons  Qj  £  Qp,  one  by  one,  in  the  corresponding  order, 
maintaining  the  complement  of  their  union  (within  the 
plane  containing  F)  as  we  go.  For  this  we  use  the  same 
technique  as  in  [7,  12,  17],  which  maintains  a  vertical 
decomposition  (relative  to  some  direction  within  F)  of 
the  complement  into  trapezoids;  see  [4]  for  more  details. 
At  the  end  of  the  incremental  procedure,  it  is  relatively 
straightforward  to  truncate  the  resulting  complement  to 
within  F.  The  cost  of  this  truncation  is  easily  seen  to 
be  proportional  to  the  complexity  of  F  plus  the  number 
of  final  trapezoids  forming  the  complement  of  (JQf’. 
Thus  this  step  does  not  increase  the  asymptotic  com¬ 
plexity  of  the  algorithm.  In  the  motion  planning  ap¬ 
plication,  mentioned  below,  we  also  need  support  for 
efficient  point  location  queries  in  (7^^.  No  additional  pro¬ 
cessing  is  needed,  as  the  data  structure  used  to  compute 
Up,  as  adapted  from  the  algorithms  of  [7,  17],  already 
supports  logarithmic-time  point  location  queries. 

The  cost  of  the  algorithm  is  proportional  to  the  num¬ 
ber  of  trapezoids  that  are  created  during  the  execution 
of  the  algorithm,  plus  the  sum  of  their  weights,  plus  the 
initial  cost  0{7ik)  of  computing  all  the  collections  Qp. 
Here  the  weight  of  a  trapezoid  t  C  F  is  the  number  of 
polygons  Qj  £  Qp  that  intersect  the  relative  interior 
of  r;  each  of  them  either  fully  contains  r  or  its  bound¬ 
ary  crosses  r.  Define  a  canonical  trapezoid  t  C  F  to 
be  a  trapezoid  that  occurs  in  the  trapezoidal  decompo¬ 
sition  of  the  complement  (within  the  plane  containing 
F)  of  the  union  of  some  subset  oi  Qp.  Such  a  trape¬ 
zoid  is  defined  by  between  1  and  5  polyhedra — Pi  plus 
the  at  most  4  polyhedra  that  define  its  sides  and  cor¬ 
ners  in  the  plane  containing  F.  The  following  argument 
applies  only  to  canonical  trapezoids  defined  by  exactly 
5  polyhedra.  It  has  to  be  repeated,  with  straightfor¬ 


ward  modifications,  for  trapezoids  defined  by  1,  2,  3, 
or  4  polyhedra.  Adapting  the  analysis  of  [4],  and  using 
the  Clarkson-Shor  probabilistic  technique  [8],  one  easily 
shows; 

Claim  1:  The  probability  that  a  canonical  trapezoid  r, 
lying  in  the  plane  containing  F  and  having  weight  w,  is 
created  during  the  incremental  construction  is 
Claim  2:  The  overall  number  T^,  of  canonical  trape¬ 
zoids  with  weight  less  than  w,  over  all  faces  F  of  the 
given  polyhedra,  is  0{riPnk\o^  ^). 

The  expected  running  time  of  the  algorithm,  over  all 
faces  F ,  is  thus 
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where  is  the  total  number  of  canonical  trapezoids 
whose  weight  is  exactly  w.  Since  —  Tw  (with 

To  =  0),  we  can  rewrite  the  above  sum  as 
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Hence  we  have: 


Theorem  5.1  The  union  of  a  collection  of  k  expanded 
convex  polyhedra  in  3-space,  as  above,  with  a  total  of 
n  faces,  can  be  computed  in  randomized  expected  time 
0(nfe  log®  k). 

We  omit  here  the  rather  easy  details  of  the  following 
application  to  motion  planning: 

Corollary  5.2  Given  a  convex  polyhedron  B,  free  to 
translate  among  k  convex  polyhedral  obstacles  with 
pairwise- disjoint  interiors,  the  entire  free  configuration 
space  of  B  can  be  computed  and  preprocessed  in  ran¬ 
domized  expected  time  O {nk  log^  k),  where  n  is  the  total 
number  of  vertices  of  the  Minkowski  sums  of  the  obsta¬ 
cles  and  of —B.  Then,  given  two  placements,  <^2; 
of  B,  we  can  decide,  in  0(^’log?i)  tixtie,  whether  B  can 
translate  in  a  collision-free  manner  from  to  C,2- 


6  Conclusions 

In  this  paper  we  have  shown  that  the  combinatorial 
complexity  of  the  union  of  the  Minkowski  sums  of  k 
convex  polyhedra  in  three  dimensions,  having  pairwise- 
disjoint  interiors,  with  another  convex  polyhedron,  is 
0{nk  log^  k)  (and  Q{7ika{k))  in  the  worst  case),  where 
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n  is  the  overall  complexity  of  the  individual  Minkowski 
sums.  We  have  also  presented  an  efficient  and  rather 
simple  randomized  algorithm  for  computing  the  union 
in  expected  time  0{nk  \og^  k).  Both  the  combinatorial 
bound  and  the  algorithm  have  applications  to  transla¬ 
tional  motion  planning  of  a  convex  polyhedral  object  in 
a  3-dimensional  environment  amidst  polyhedral  obsta¬ 
cles,  and  we  have  also  discussed  these  applications. 

These  results  almost  settle  a  long-standing  open  prob¬ 
lem,  but  raise  a  whole  collection  of  new  open  problems 
(some  of  these  problems  have  already  been  mentioned 
in  earlier  sections).  One  open  problem  is  to  tighten 
the  remaining  gap  between  the  lower  and  upper  bounds 
on  the  complexity  of  the  union.  We  conjecture  that 
the  correct  worst-case  bound  is  Q(7ika(k)).  There  are 
also  the  problems  of  designing  an  efficient  determinis¬ 
tic  algorithm  for  computing  the  union,  and  of  improving 
the  query  performance  of  the  motion  planning  algorithm 
mentioned  above. 

The  more  challenging  and  interesting  open  problems, 
however,  involve  generalizations  and  extensions  of  our 
results  and  techniques.  First,  what  is  the  combinatorial 
complexity  of  the  union  of  Minkowski  sums  T,  B, 
where  the  ^i’s  are  k  convex  polyhedra  with  pairwise 
disjoint  interiors,  and  5  is  a  ball?  Even  the  special 
case  where  the  Ai’s  are  lines  seems  to  be  open;  in  this 
case  we  want  to  bound  the  combinatorial  complexity 
of  the  union  of  k  congruent  infinite  cylinders,  where 
the  conjecture  is  that  this  complexity  is  near-quadratic 
in  k.  This  problem  arises  in  motion  planning,  when 
applying  a  standard  heuristic  of  enclosing  the  moving 
(rigid)  object  by  a  ball,  and  planning  the  motion  of  the 
enclosing  ball. 
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Motion  Planning  amidst  Fat  Obstacles  * 

{extended  abstract) 

A.  Frank  van  der  Stappen  ^  Mark  H.  Overmars  ^ 


Abstract 

We  present  an  efficient  and  simple  paradigm  for  motion  plan¬ 
ning  amidst  fat  obstacles.  The  paradigm  fits  in  the  cell  decom¬ 
position  approach  to  motion  planning  and  exploits  workspace 
properties  that  follow  from  the  fatness  of  the  obstacles.  These 
properties  allow  us  to  decompose  the  workspace,  subject 
to  some  constraints,  rather  than  to  decompose  the  higher¬ 
dimensional  free  space  directly.  A  sequence  of  uniform  steps 
transforms  the  workspace  decomposition  into  a  free  space  de¬ 
composition  of  asymptotically  the  same  (expectedly  small) 
size.  The  approach  applies  to  robots  with  any  fixed  number 
of  degrees  of  freedom  and  turns  out  to  be  successful  in  many 
cases:  it  leads  to  nearly  optimal  O(nlogn)  algorithms  for 
motion  planning  in  2D,  and  for  motion  planning  in  3D  amidst 
obstacles  of  comparable  size.  In  addition,  we  obtain  algo¬ 
rithms  for  planning  3D  motions  among  polyhedral  obstacles, 
running  in  0{n^  log  n)  time,  and  among  arbitrary  obstacles, 
running  in  time  O(n^). 

1  Introduction 

Fatness  turns  out  to  be  an  interesting  phenomenon  in  com¬ 
putational  geometry.  Several  papers  discuss  the  surprising 
impact  of  fatness  of  the  objects  under  consideration  on  com¬ 
binatorial  complexities  and  algorithm  efficiencies.  Fat  ob¬ 
jects  are  compact  to  some  extent,  rather  than  long  and  thin. 
Examples  of  the  influence  of  fatness  are  described  in  papers 
by  Alt  et  al.  [1]  and  Matousek  et  al.  [5]  which  show  that  the 
combinatorial  complexity  of  the  union  of  geometric  figures 
is  low  if  the  figures  are  fat.  Overmars  [6]  presents  an  efficient 
algorithm  for  point  location  in  fat  subdivisions. 
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Robot  motion  planning  considers  the  problem  of  finding  a 
collision-free  path  for  a  robot  in  a  workspace  W  with  ob¬ 
stacles  £  from  an  initial  to  a  final  placement,  i.e.,  a  path  that 
avoids  collision  of  B  with  the  obstacles.  The  motion  planning 
problem  is  often  studied  as  a  problem  in  the  configuration 
space  C,  which  is  the  space  of  parametric  representations  of 
robot  placements.  The  free  space  FP  is  the  subspace  of  C  of 
points  that  represent  placements  of  the  robot  in  which  it  does 
not  intersect  any  obstacle  in  £.  The  problem  of  finding  a 
collision-free  path  for  B  corresponds  to  finding  a  continuous 
curve  in  FP  (or  its  closure)  connecting  the  points  represent¬ 
ing  the  initial  and  final  robot  placement.  The  combinatorial 
complexity  of  FP  largely  influences  the  complexity  of  finding 
such  a  curve. 

In  [1 1],  we  examine  the  role  of  fatness  in  motion  planning. 
We  show  that  the  combinatorial  complexity  of  the  free  space 
FP  is  linear  in  the  number  of  obstacles  if  these  (stationary) 
obstacles  are  fat  and  satisfy  certain  weak  additional  assump¬ 
tions,  whereas  it  would  be  much  higher  if  the  obstacles  are  not 
fat.  The  additional  assumptions  mainly  require  the  obstacles 
and  the  robot  to  be  of  constant  complexity,  and  the  robot  to  be 
not  too  large  compared  to  the  obstacles.  The  obstacle  fatness 
and  the  additional  assumptions  provide  a  realistic  framework 
for  efficient  motion  planning. 

A  question  that  immediately  comes  to  mind  when  consider¬ 
ing  the  combinatorial  result  of  [1 1]  is  whether  this  reduced 
complexity  opens  the  way  to  efficient  motion  planning  algo¬ 
rithms  for  such  realistic  environments.  To  answer  this  impor¬ 
tant  question,  we  have  considered  various  existing  (planar) 
motion  planning  algorithms  and  examined  the  influence  of 
obstacle  fatness  on  their  efficiency.  The  boundary-vertices 
retraction  algorithm  by  Sifrony  and  Sharir  [10]  for  a  ladder 
moving  in  a  planar  workspace  with  polygonal  obstacles  runs 
in  time  0{K  logn),  where  K  is  the  number  of  pairs  of  ob¬ 
stacle  corners  that  lie  less  than  the  length  of  the  ladder  apart. 
Fatness  of  the  obstacles  causes  K  to  be  only  0{n),  whereas 
it  would  be  0{n^)  for  arbitrary  obstacles.  The  boundary  cell 
decomposition  algorithm  by  Avnaim,  Boissonnat,  and  Faver- 
jon  [2],  running  in  time  0{n^  log  n)  for  a  constant  complexity 
polygonal  robot  amidst  arbitrary  polygonal  obstacles,  can  be 
shown  to  run  in  O(nlogn)  time  in  the  realistic  setting  of 
fat  obstacles,  due  to  the  bounded  local  complexity  (see  e.g. 
[9,  10])  of  the  workspace  caused  by  the  fatness.  The  0{n^) 
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algorithm  by  Schwartz  and  Sharir  [7]  for  planning  the  motion 
of  a  ladder  or  a  polygonal  robot  amidst  polygonal  obstacles 
can  be  shown  to  run,  unmodified,  in  time  O(n^)  if  the  ob¬ 
stacles  are  fat,  whereas  a  minor  modification  enhances  the 
efficiency  to  a  running  time  ofO(ri  log?t)  [12,  13], 

Algorithms  for  efficient  motion  planning  in  3D  workspaces 
are  scarce:  approaches  in  contact  space,  like  the  algorithms 
mentioned  above  by  Sifrony  and  Sharir,  and  by  Avnaim, 
Boissonnat,  and  Faverjon,  were  never  shown  to  generalize  to 
higher  dimensions.  General  approaches  to  motion  planning 
(e.g.  by  Schwartz  and  Sharir  [8])  are  computationally  ex¬ 
pensive.  Three-dimensional  workspaces  imply  at  least  three- 
dimensional  configuration  spaces  with  arrangements  defining 
the  free  portions.  Naturally,  the  structure  of  such  higher¬ 
dimensional  arrangements  is  considerably  more  complex  to 
understand,  let  alone  to  subdivide  the  free  arrangement  cells 
into  simple  subcells  or  catch  their  structure  in  some  one¬ 
dimensional  roadmap.  At  this  point,  however,  fatness  comes 
to  our  help  to  provide  us  with  a  very  beneficial  property  of 
the  workspace,  which  in  fact  also  led  to  the  enhanced  perfor¬ 
mance  of  the  planar  motion  planning  algorithms:  the  bounded 
local  complexity  of  the  workspace  implied  by  the  fatness  of 
the  objects  makes  it  possible  to  partition  the  workspace  W 
(rather  than  the  configuration  space)  into  regions  R  such  that 
the  free  part  of  the  configuration  space  cylinder  obtained  by 
lifting  R  into  configuration  space  has  constant  complexity. 
Moreover,  the  bounded  local  complexity  also  establishes  ex¬ 
istence  of  small  partitions  into  such  regions. 

We  formalize  and  exploit  the  workspace  properties  outlined 
above  to  obtain  a  paradigm  for  planning  the  motion  of  a  con¬ 
stant  complexity  robot  moving  amidst  constant  complexity 
fat  obstacles.  The  paradigm  follows  the  cell  decomposi¬ 
tion  approach  to  motion  planning.  The  size  of  the  resulting 
decomposition  of  the  free  space  into  simple  subcells  is  deter¬ 
mined  by  the  size  of  some  constrained  workspace  partition. 
The  running  time  of  algorithms  based  on  the  paradigm  de¬ 
pends  on  the  time  to  find  such  a  partition.  In  Section  2  we 
discuss  some  geometric  preliminaries.  Section  3  explains  the 
paradigm  and  its  context  and  shows  how  its  performance  de¬ 
pends  on  the  size  of  a  constrained  partition  of  the  workspace. 
Section  4  discusses  the  construction  of  small  partitions  for  2D 
workspaces  with  arbitrary  fat  obstacles  and  3D  workspaces 
with  polyhedral  fat  obstacles,  leading  to  0{n\ogn)  and 
logrt)  motion  planning  algorithms  respectively,  inde¬ 
pendent  of  the  number  of  degrees  of  freedom  of  the  robot. 
We  end  the  section  by  mentioning  a  nice  additional  result, 
and,  finally,  draw  some  conclusions  in  Section  5. 

2  Preliminaries:  fatness  and  wrappings 

In  this  section  we  recall  the  definition  of  fatness  introduced 
in  [1 1]  and  formulate  a  lemma  stating  a  useful  object  density 


property  for  a  collection  of  fat  objects  placed  in  More¬ 
over,  we  prove  a  low  complexity  bound  for  arrangements  of 
sufficiently  tight  wrappings  of  fat  objects.  Such  arrangements 
play  an  important  role  throughout  this  paper. 

2.1  Fatness 

Contrary  to  many  other  definitions  of  fatness  in  literature,  the 
definition  given  below  applies  to  general  shapes  in  arbitrary 
dimension.  It  involves  a  parameter  k  which  supplies  a  quali¬ 
tative  measure  of  fatness:  the  smaller  the  value  of  k,  the  fatter 
the  object  must  be. 

Definition  2.1  [fc-fatness] 

Let  E  C  be  an  object  and  let  k  be  a  positive  constant. 
The  object  E  is  k-fat  if  for  all  hyperspheres  S  centered  inside 
E  and  not  fully  containing  E: 

k  ■  volume{E  H  5)  >  volume{S). 

The  definition  forbids  fat  obstacles  to  be  long  and  thin  or  to 
have  long  or  thin  parts. 

The  following  “proximity  lemma”  will  be  crucial  in  the  rest 
of  this  paper.  For  a  sketch  of  the  proof  we  refer  to  [  1 1  ] ,  where 
a  similar  lemma  is  proven. 

Lemma  2.2  Let  8  be  a  set  of  non-intersecting  k-fat  objects 
inTf^  and  let  c  >  0  be  a  constant.  Then  the  number  of  objects 
E  E  S  with  diameter  at  least  5  intersecting  any  hypercubic 
box  with  side  length  c  -  6  is  bounded  by  a  constant  depending 
on  k  and  c. 

In  other  words,  when  all  objects  have  diameter  at  least  <5, 
any  hypercube  with  side  length  c  ■  S  contains  only  a  constant 
number  of  objects.  This  workspace  property  resembles  a 
property  that  is  referred  to  as  bounded  local  complexity  [9]. 
The  motion  planning  paradigm  outlined  in  the  section  will 
actually  work  for  any  set  of  obstacles  satisfying  Lemma  2.2, 
not  only  for  fat  obstacles. 

2.2  Wrappings  of  fat  objects 

Sufficiently  tight  wrappings  of  objects  play  a  crucial  role 
throughout  this  paper.  Apart  from  providing  the  justification 
that  the  paradigm  for  motion  planning  amidst  fat  obstacles 
presented  here  indeed  works,  the  wrappings  also  help  in  find¬ 
ing  efficient  instances  of  the  paradigm. 

An  e-wrapping  of  an  object  E  is  an  enclosing  shape  of  E, 
with  the  property  that  the  distance  from  the  wrapping  to  E 
never  exceeds  e. 

Definition  2.3  [e-wrapping] 

Let  E  C  R'^  and  let  e  E  R"*".  Any  A  satisfying  E  C  A  C 
{  p  G  R^^  I  d{p,  E)  <  e  }  is  an  c -wrapping  of  E. 
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The  complexity  of  a  scene  of  n  disjoint  A;-fat  objects  is  clearly 
0{n).  If  we  expand  the  objects  in  some  way  then  they  will 
start  intersecting,  and  eventually  the  combinatorial  complex¬ 
ity  of  the  scene  will  increase.  We  may  expect,  however, 
that  the  complexity  will  remain  0{n)  if  we  expand  the  fc-fat 
objects  by  some  bounded  amount.  Theorem  2.4  states  the 
circumstances  that  lead  to  this  result.  An  obvious  way  to 
express  a  bound  on  the  expansion  of  an  object  E  is  to  state 
that  the  expanded  object  is  an  f-wrapping  of  the  object  E 
itself,  for  some  bounded  positive  e. 

Theorem  2.4  Let  £  be  a  set  ofn  non-intersecting  k-fat  ob¬ 
jects  in  R'^  of  constant  complexity  each.  Let  6  be  a  lower 
bound  on  the  diameters  of  the  objects  E  £  £  and  let  c  >  0 
be  some  constant.  Assume  that  a  constant  complexity  (c  •  J)- 
wrapping  A{E)  is  given  for  every  object  E  £  £.  Then: 

(a )  the  complexity  of  the  arrangement  A{  A)  of  all  wrapping 

boundaries  dA(E)  is  0(n), 

(b)  every  point  p  £  R"^  lies  inside  at  most  0(1)  wrappings 

ME). 

Proof:  We  assume,  to  prove  the  (a)-part,  that  the  objects 
in  £  are  ordered  by  increasing  size:  Ei,. .  .,E„.  We  intend 
to  count  for  each  object  Ei  the  subspaces  of  dimensions  0  to 
d-l  that  are  defined  by  the  intersection  of  dA{Ei)  and  wrap¬ 
ping  boundaries  dA{Ej)  with  j  >  i.  Two  (c  •  (5)-wrapping 
boundaries  dA{Ei)  and  dA(Ej)  {i  <  j)  can  only  intersect 
if  the  objects  Ei  and  Ej  are  less  than  2c  •  <5  apart.  By  the  fact 
that  5  is  a  lower  bound  on  all  object  diameters,  Ei  and  Ej 
must  also  be  less  than  2c  ■  5 e,  apart,  which  in  turn  implies 
that  Ej  must  have  non-empty  intersection  with  the  hypercube 
with  side  length  (4c  -f  1)  •  having  a  smaller  i!,' -enclosing 
hypercube  with  side  length  Se^  at  its  center  (see  Figure  1 
for  a  2D  example).  Application  of  Lemma  2.2  yields  that 
there  can  only  be  a  constant  number  of  such  Ej ’s,  so  there  is 
at  most  a  constant  number  of  wrapping  boundaries  dA{Ej) 
(j  >  i)  that  intersect  dA(Ei).  By  the  additional  assumption 
that  all  wrappings  have  constant  complexity,  there  is  only  a 
constant  number  of  subspaces  of  dimension  between  0  and 
d-l  defined  by  the  intersection  of  dA{Ei)  and  wrapping 
boundaries  dA(Ej)  (j  >  i).  Adding  the  contributions  of  all 
wrappings  amounts  to  a  total  of  0{n)  subspaces  of  dimen¬ 
sions  0  to  d  —  1  in  the  arrangement  .4(A).  The  linear  bounds 
on  the  number  of  these  subspaces  imply  the  same  bound  of 
0{n)  on  the  number  of  d-faces  in  .4(A),  making  the  total 
combinatorial  complexity  of  the  arrangement  0{n). 

The  (b)-part  follows  immediately  from  the  proof  of  the  (a)- 
part.  Let  Ei  be  the  smallest  object  for  which  the  point  p  £  R'^ 
lies  inside  the  wrapping  A{Ei).  Since  there  is  only  a  con¬ 
stant  number  of  wrappings  of  larger  objects  intersecting  Efs 
wrapping,  the  point  p  can  be  in  no  more  than  a  constant  num¬ 
ber  of  additional  wrappings.  D 


c  ■  5e,  c  ■  Se,  ^e,  c  ■  5e,  c  ■  Se, 

Figure  1:  The  object  Ei  can  be  fully  enclosed  in  a  box 
with  side  length  Ssi  ■  The  (c  •  5)-wrapping  of  Ei  (with  bold 
boundary)  lies  completely  inside  a  concentric  box  with  side 
length  (2c  -f  1)  •  ,  because  5  <  Js. .  A  wrapping  of  some 

other  obstacle  Ej  can  only  intersect  the  wrapping  of  Ei  if  it 
intersects  the  concentric  box  as  well.  By  repeating  the  same 
reasoning,  the  object  Ej  itself  must  then  intersect  the  outer 
concentric  box  with  side  length  (4c  -|-  1)  ■  Ssi- 


3  Motion  planning  amidst  fat  obstacles 

A  number  of  cell  decomposition-based  motion  planning  al¬ 
gorithms  (see  e.g.  [7,  8])  use  a  projection-like  approach  to 
partition  the  free  space,  i.e.,  they  (recursively)  decompose 
a  lower-dimensional  subspace  of  the  configuration  space  C 
and  lift  the  decomposition  regions  into  C.  The  free  part  of  the 
resulting  cylinders  is  subsequently  partitioned  into  a  number 
of  simple  subcells.  The  objective  is  to  keep  the  number  of 
subcells  as  low  as  possible. 

Our  paradigm  for  motion  planning  amidst  fat  obstacles  fits 
in  the  approach  outlined  above.  The  circumstances  that  we 
consider  allow  for  a  base  decomposition  in  the  workspace 
whose  regions  correspond  to  cylinders  in  configuration  space 
in  which  the  free  part  is  easily  decomposable  into  (a  small 
number  of)  simple  subcells.  Below  we  first  summarize  the 
realistic  assumptions  that  guarantee  the  feasibility  of  the  ap¬ 
proach. 

We  consider  a  not  too  large  robot  B  with  a  constant  number  / 
of  degrees  of  freedom  moving  in  a  d-dimensional  Euclidean 
workspace  amidst  n  fc -fat  obstacles  E  £  £.  Furthermore,  we 
assume  that  no  part  of  the  robot  B  can  collide  with  any  other 
part  of  B.  Some  point  O  £  B  is  appointed  to  be  the  refer- 
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ence  point  of  the  robot.  Our  paradigm  applies  to  situations 
where  the  robot’s  configuration  space  C  can  be  defined  as 
the  Cartesian  product  of  its  d-dimensional  workspace  W  and 
some  other  space  D  (of  dimension  at  least  /  —  d), 

C  =  W  X  D, 

such  that  the  position  of  the  robot’s  reference  point  in  the 
robot’s  workspace  is  part  of  the  specification  of  its  placement. 
A  placement  ^  of  the  robot  can  thus  be  written  as  .^  = 
{Z\Y .  Zo),  where  Zw  G  W  =  and  Zo  G  D.  In  cases  of 
a  free-flying  robot  this  assumption  will  be  satisfied. 

The  reach  ps  of  a  robot  B  is  defined  to  be  the  maximum 
distance  from  the  reference  point  O  €  B  to  any  point  in  B  in 
any  placement  Z  of  B.  More  formally; 


Definition  3.1  [reach  ps  of  a  robot  B] 

Let  Z\\j  be  some  arbitrary  position  of  the  reference  point  O 
of  the  robot  B.  Then  the  reach  pg  of  the  robot  B  is  defined  as 


Pb  =  sup  max 

Zd^D  peB[(Zw,.^D)] 


d{p,  Zyi), 


where  B[Z]  stands  for  the  set  of  points  in  W  covered  by  B 
when  placed  at  Z. 


In  other  words,  the  reach  pg  of  a  robot  B  is  the  maximum 
distance  that  any  point  in  the  robot  B  can  ever  have  to  the 
reference  point,  which  is  also  equal  to  how  far  the  robot 
can  reach,  measured  from  its  reference  point.  Naturally,  the 
reach  is  independent  of  the  actual  position  of  the  reference 
point.  Using  the  notion  of  the  reach  of  a  robot,  we  state 
the  additional  assumption  that  the  robot  is  not  too  large:  the 
reach  pg  of  the  robot  B  should  not  exceed  b  ■  Smin  for  some 
reference  point  0  £  B  and  some  constant  b,  where  6min  is  a 
lower  bound  on  the  diameter  of  the  obstacles  in  E. 

If  either  the  obstacles  are  non-fat  or  the  robot  is  arbitrarily 
large,  the  robot  B  with  its  reference  point  O  fixed  at  some 
point  p  G  W  may  be  able  to  touch  all  obstacles  E  £  E.  The 
circumstances  summarized  above,  however,  make  this  im¬ 
possible:  the  robot  is  able  to  touch  only  a  constant  number  of 
obstacles.  (This  follows  quite  easily  from  Theorem  2.4.)  As 
a  result,  the  workspace  can  be  partitioned  into  regions  where 
the  robot  can  only  touch  a  constant  number  of  obstacles.  Note 
that  B  will  definitely  be  unable  to  touch  the  obstacle  E  if  it  is 
further  away  from  E  than  its  reach  pg .  We  define  the  notion 
of  grown  obstacles  to  formalize  this  observation. 


Definition  3.2  [grown  obstacle  G{E,  p)] 

Let  E  be  an  obstacle  in  R'^  and  let  p  £  R"*".  The  p-grown 
obstacle  E  is  defined  as: 

G(U,p)  =  {pGR"|d(p,i?)<p}. 


Clearly,  G{E,  pis)  is  a  pg-wrapping  of  E.  Now  the  robot’s 
reference  point  must  lie  inside  G(U,  pg )  in  order  for  the  robot 


B  to  be  in  contact  with  E.  Because  Pb  <b  ■  Smin,  it  follows 
from  Theorem  2.4  that  the  arrangement  ^(G)  of  grown  ob¬ 
stacle  boundaries  dG(E,  pg)  (E  £  E)  has  complexity  0{n) 
and  each  of  the  arrangement’s  d-faces  is  the  intersection  of 
only  0(1)  grown  obstacles. 

If  we  take  some  arbitrary  subset  R  of  the  workspace  W, 
then  the  subset  will  intersect  a  number  of  grown  obstacle 
regions.  The  set  of  obstacles  for  which  the  grown  obstacle 
region  intersects  the  set  R  will  be  referred  to  as  i?’s  coverage 
Cov(R). 

Definition  3.3  [coverage  Cov{R)] 

Let  RCW  =  R'^. 

Cov{R)  ^  {E  £E\RnG{E,pB)  ^0}. 

A  direct  consequence  of  the  conclusion  that  each  d-face  of 
.4(G)  is  the  intersection  of  only  0(1)  grown  obstacles  is 
that  the  workspace  W  of  the  robot  can  be  partitioned  into 
regions  R  with  \Cov{R)\  =  0(1).  In  a  general  setting  such 
a  partitioning  could  be  impossible  as  their  may  be  points  that 
lie  in  a)(l)  grown  obstacles  simultaneously.  Partitionings 
of  the  robot’s  workspace  into  constant  complexity  regions 
with  constant  size  coverage  provide  a  first  step  towards  a 
decomposition  of  the  free  space  into  constant  complexity 
subcells.  We  refer  to  workspace  partitions  of  this  kind  as 
cc-partitions  (constant-size  coverage,  constant  complexity). 

Definition  3.4  [cc-partition] 

A  cc-partition  V  of  a  workspace  W  with  obstacles  E  is  a  par¬ 
tition  of  W  into  regions  R  satisfying  the  following  additional 
constraints: 

•  |Gou(/?)|  =  0(1), 

•  R  has  constant  complexity. 

The  cc-partition  Vw  of  the  workspace  W  can  be  lifted  into 
a  cylindrical  decomposition  of  the  configuration  space  G  by 
taking  the  Cartesian  product  of  the  partition  Uw  and  the 
space  D.  The  specific  properties  of  a  region  R  £  Uw,  i-c.,  its 
constant  complexity  and  constant-size  coverage,  are  shown 
below  to  imply  a  constant  descriptional  complexity  of  the  free 
part  FP  n  (i?  X  D)  of  the  configuration  space  cylinder  Rx  D. 
The  maximal  connected  components  of  FP  Cl  (R  x  D)  are 
adequate  subcells  in  a  cell  decomposition  of  the  free  space  as 
they  allow  for  easy  path  finding  within  their  interior,  due  to 
their  constant  complexity  and  connectedness.  Free  subcells 
in  a  single  configuration  space  cylinder  R  x  D  can  only  be 
adjacent  to  free  subcells  in  a  cylinder  implied  by  a  region 
R!  that  is  adjacent  to  R  in  the  cc-partition  of  the  workspace 
W  As  both  subcells  have  constant  complexity,  their  shared 
boundary  must  also  have  constant  complexity  and,  as  a  result, 
allow  for  simple  crossing  from  one  subcell  into  another  one. 
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Algorithm  FatMot  outlined  below  exploits  the  ideas  summa¬ 
rized  in  the  previous  paragraph.  The  algorithm  consists  of 
two  parts.  The  first  part  is  a  computation  in  the  workspace 
that  aims  at  finding  a,  hopefully  small,  cc-partition  Vw  of  the 
workspace  W  with  obstacles  E.  More  precisely,  it  computes 
a  graph  (Vw ,  consisting  of  a  set  Vw  of  closed  regions 
and  a  set  =  {  (i?,  B!)  G  Vw  x  Vw  |  dR  n  dR!  0  } 
of  adjacencies  of  Vw’s  regions V  Finally,  the  graph  must  be 
accompanied  by  the  computed  function  Cov  ;  Vw 
linking  the  coverage  Cov{R)  to  each  region  R  G  Vw-  The 
actual  “shape”  of  the  first  part  of  Algorithm  FatMot  depends 
on  the  specific  workspace  W  and  type  of  obstacles  in  £  un¬ 
der  consideration,  as  does  the  computation  time  T{n)  that  is 
spent  in  this  phase.  The  computation  time  T{n)  turns  out  to 
highly  affect  the  efficiency  of  the  entire  Algorithm  FatMot, 
hence,  a  thorough  investigation  of  instances  of  W  and  £  is 
justified.  This  is  the  subject  of  the  next  section. 

The  remaining  steps  of  Algorithm  FatMot  take  place  in  con¬ 
figuration  space,  where  the  cc-partition  graph  ( Vw ,  Ew)  and 
the  function  Cov  are  used  to  deduce  a  connectivity  graph 
CG  =  (VcyEc)  consisting  of  a  set  of  constant  complexity 
regions  Vc  that  partition  the  set  of  free  placements  FP,  and 
the  set  Ec  =  {  {A,  A')  e  Vc  x  Vc  \dAn  dA'  ^  <ll }.  The 
sizes  of  the  sets  Vc  and  Ec  are  of  the  same  order  of  magni¬ 
tude  as  the  sets  Vw  and  Ey/  respectively:  \Vc\  =  0(|Vw|). 
\Ec\  =  0(|i^w|)-  The  outline  of  the  second  part  is,  con¬ 
trary  to  the  first  part,  independent  of  the  workspace  W  and 
obstacles  £  under  consideration. 

Algorithm  FatMot 

Find  a  cc-partition  graph  (Vw,  ■S'w)  and  compute  Cov, 

Vc  :=  0; 

Ec  ~  0; 

for  all  R  G  Vw  do 

1.  Desc{R)  :=  0; 

2.  for  all  E  G  Cov{R)  do 

for  all  features  <j)  oiB  and  $  of  S'  do 
compute  constraint  surface 

3.  compute  the  arrangement  A  of  surfaces 

4.  use  A  to  compute  FP  fl  (i?  x  D); 

5.  for  all  maximal  connected  components  A 
ofFPn(i?x  D)  do 

5.1.  Vc  ■■=  VcU{A]-, 

5.2.  Desc{R)  :=  Desc{R)  U  {A}; 
for  all  (i?i ,  R2)  G  Ew  do 

for  all  Ai  G  Desc{Ri)  A  A2  G  Desc{R2)  do 

if  dAi  n  dA2  7^  0  then  Ec  :=  Ec  U  {(^1,^2)}. 

Figure  2  gives  a  pictorial  explanation  of  Algorithm  FatMot. 


^  We  do  not  distinguish  between  a  node  in  the  graph  and  the  region  that  is 
represented  by  that  node,  hence  if  we  speak  of  a  node  R  (or  A)  we  actually 
mean  the  node  representing  the  region  R  (or  the  free  subcell  A). 


Figure  2:  The  relation  between  the  cc-partition  graph 
( Vw ,  Evj)  in  the  workspace  W  at  the  top,  and  the  connectiv¬ 
ity  graph  {Vc,Ec)  in  the  configuration  space  at  the  bottom. 
Each  node  R  €  V\n  defines  at  most  (9(1)  nodes  A  G  Vc, 
collected  in  a  set  Desc{R).  Two  nodes  A  and  A'  in  Vc  can 
only  be  connected  if  the  corresponding  nodes  R  and  R'  in 
Vw  are  connected,  so  Ai  may  be  connected  to  all  nodes  in 
Desc{R'),  but  Ai  and  A4  cannot  be  connected. 


Crucial  to  the  validity  of  the  approach  sketched  by  Algorithm 
FatMot  is  a  rather  simple  observation  regarding  constraint 
surfaces  in  configuration  space.  Let  us  consider  the  surface 
of  placements  in  which  a  robot  feature  (p  touches  a 
feature  $  of  some  obstacle  E.  A  substantial  part  of  the 
representation  of  a  robot  placement  is  the  position,  in  the 
workspace  W,  of  the  robot’s  reference  point.  This  position 
must  be  less  than  pb  away  from  E,  otherwise  B  would  be 
unable  to  touch  E.  As  a  result,  a  point  p  =  {pw,Pc)  C 
W  X  n  lying  on  the  constraint  surface  satisfies  pw  G 

G{E,pb). 

Lemma  3.5  Let  (fi  a  robot  feature  and  let  $  be  a  feature  of  an 
obstacle  E,  such  that  the  placements  in  which  f  is  in  contact 
with  $  define  a  hypersurface  f,f,^is>  In  C,  then: 

f<P,i  C  G{E,  Pb)  X  D. 

An  implication  of  Lemma  3.5  is  that  a  configuration  space 
cylinder  R  x  D  can  only  be  intersected  by  constraint  sur¬ 
faces  for  which  $  is  a  feature  of  some  E  G  Cov(R). 
By  the  constant-size  coverage  of  a  region  R  G  Vw  and  the 
constant  complexity  of  both  B  and  each  obstacle  E  £  £,  the 
number  of  constraint  surfaces  intersecting  a  cylinder 
Rx  D  with  Re  Vw ,  is  bounded  by  a  constant.  Since  each 
constraint  surface  is  assumed  to  have  constant  complexity. 
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the  entire  arrangement  of  constraint  surfaces  inside  R  x  D 
has  constant  complexity.  Together  with  the  constant  com¬ 
plexity  of  the  cylinder  R  x  D  itself  -  following  from  R’s 
constant  complexity  -  this  implies  the  constant  complexity  of 
FP  n  (i?  X  D).  Steps  1-5  in  the  first  for-loop  compute  the 
0(1)  (constant  complexity)  maximal  connected  components 
ofFPn(i?x£)).  A  possible  way  to  compute  this  free  part  (in 
constant  time)  could  be  to  apply  the  techniques  from  [8]  to 
the  constant  number  of  constraint  hypersurfaces  intersecting 
the  cylinder  R  x  D.  Each  step  is  easily  verified  to  run  in 
0(1)  time.  The  set  Desc(R)  stores  the  nodes  in  (Vc,  Ec) 
that  correspond  to  free  subcells  \nRx  D.  Note  that  the  set 
Desc{R)  has  constant  cardinality. 

Two  free  subcells  Ai  and  A2  can  only  be  adjacent  if  their  con¬ 
taining  cylinders  Ri  x  D  and  R2  x  D  are  adjacent  in  C  and, 
hence,  Ri  and  Ro  are  adjacent  in  W.  An  adjacency  {Ri ,  Ro) 
gives  rise  to  only  a  constant  number  of  adjacencies  of  nodes 
Ai  and  A2  in  Desc{Ri)  and  Desc(R2)  respectively  due  to 
the  constant  cardinality  of  Desc{Ri)  and  Desc(R2).  Two 
free  subcells  Ai  and  A2  in  adjacent  cylinders  are  adjacent 
if  they  share  a  common  boundary.  Such  a  common  bound¬ 
ary  has  constant  complexity  since  both  involved  free  subcells 
have  constant  complexity.  The  nested  for-loop  in  the  second 
for-loop  takes  constant  time  by  the  above  considerations. 

The  output  of  Algorithm  FatMot  is  a  decomposition  of  the 
free  space  into  constant  complexity  subcells  that  are  collected 
in  a  set  Vc.  The  connectivity  of  the  subcells  is  captured  in 
a  set  Ec  of  pairs  of  adjacent  subcells.  The  running  time  of 
FatMot  is  easily  seen  to  be  0{T{n)  -h  |Vvv|  +  |£'w|)-  So, 
the  efficiency  of  Algorithm  FatMot  is  fully  determined  by 
the  size  of  the  cc-partition  graph  (Pw ,  fi'w)  and  the  time  to 
compute  the  graph  and  the  function  Cov.  Since  the  time 
T{n)  to  explicitly  compute  a  cc-partition  graph  (Fw,  £'w) 
and  the  function  Cov  dominates  the  time  0(|Fw|  +  |£'w|)  to 
simply  report  the  graph  we  may  conclude  that  the  T(n)-factor 
is  dominant. 

Theorem  3.6  Algorithm  FatMot  computes  a  cell  decompo¬ 
sition  of  the  free  space  of  size  0(|Fw|  +  l^^wi)  in  time 
0{T{n)),  where  T{n)  is  the  time  to  compute  a  cc-partition 
graph  (Fw,  i?w)  and  the  function  Cov  :  Fw  — >  Rif). 

The  size  and  computing  time  of  a  cc-partition  graph  influence 
both  the  size  of  the  cell  decomposition  that  is  obtained  and 
the  time  necessary  to  compute  it.  A  small  and  efficiently 
computable  partition  is  therefore  crucial  to  the  success  of  our 
method.  At  this  stage,  one  may  expect  the  method  to  be  rather 
efficient  since  the  paradigm  reduces  the  problem  of  finding  a 
cell  decomposition  of  the  free  part  of  the  configuration  space 
to  the  problem  of  finding  some  constrained  partition  of  the 
lower-dimensional  workspace.  Finding  small  and  efficiently 
computable  cc-partitions  is  the  subject  of  the  next  section. 

Finally,  we  mention  that  the  problem  of  solving  a  mo¬ 
tion  planning  query  “find  a  free  path  from  a  placement 


Zi  —  {Ziw,Zid)  to  another  placement  .^2  =  {Z2W ,  Z2d)” 

basically  reduces  to  a  point  location  query  with  ^iw  and 
Z2W  in  Fw  to  find  Ri  3  Ziw  and  R2  3  Z2w.  Hence, 
only  point  location  in  the  (2D  or  3D)  workspace  is  required, 
which  can  be  done  efficiently.  After  that  it  takes  0(1)  to  find 
Ai  3  Zi  using  Desc{Ri)  and  A2  3  Z2  using  Desc{R2), 
followed  by  a  search  in  the  connectivity  graph  (Vc ,  Ec)  for 
a  sequence  of  subcells  connecting  Ai  to  A2.  The  constant 
complexities  of  the  subcells  and  of  the  common  boundaries 
of  pairs  of  adjacent  subcells  facilitate  the  transformation  of 
the  subcell  sequence  into  an  actual  free  path  for  B. 

4  Small  and  efficiently  computable 
workspace  partitions 

In  this  section  we  consider  the  problem  of  finding  small  cc- 
partitions  of  workspaces  W  with  obstacles  F.  Different  types 
of  workspaces  and  obstacles  lead  to  different  choices  for  a  cc- 
partition.  Below  we  first  discuss  a  general  partition  scheme 
for  a  2D  Euclidean  workspace  with  constant  complexity  ob¬ 
stacles.  The  cc-partition  has  optimal  size  and  is  computable 
by  a  simple  plane  sweep.  Next,  we  move  on  to  3D,  where 
we  consider  the  case  of  polyhedral  obstacles.  In  addition, 
we  mention  some  results  obtained  for  general  obstacles  and 
obstacles  from  a  bounded  size  range. 

4.1  Two-dimensional  Euclidean  workspace 

The  proposed  partition  scheme  for  2D  workspaces,  a  ver¬ 
tical  decomposition  of  the  arrangement  of  grown  obstacle 
boundaries,  works  for  any  kind  of  obstacles,  as  long  as  their 
boundaries  consist  of  a  constant  number  of  constant  degree  al¬ 
gebraic  curves.  The  approach  differs  from  the  one  presented 
in  [13]  which  is  a  modified  version  of  Schwartz  and  Sharir’s 
ladder  algorithm  [7]  and  as  such  dedicated  to  a  polygonal 
robot  and  polygonal  obstacles. 

The  first  step  towards  a  cc-partition  computes  the  grown  ob¬ 
stacle  boundaries  dG{E,  ps)  for  all  obstacles  E  e  S.  Each 
boundary  is  obtained  in  0(1)  time  leading  to  0(n)  time  for 
computing  all  boundaries.  As  a  preparation  for  the  next  step, 
each  grown  obstacle  boundary  dG(E,  ps )  is  cut  up  into  0(  1) 
arcs  which  are  maximal  connected,  a:-monotonous  boundary 
parts  having  no  vertices  in  their  interiors.  Note  that  arc 
endpoints  are  generally  incident  to  two  arcs.  For  future  pur¬ 
poses  we  label  each  arc  from  dG{E,pB)  with  the  obstacle 
name  ‘A”.  By  Theorem  2.4,  the  resulting  0{n)  arcs  define 
only  0{n)  (yet  unknown)  arc  intersections  and  partition  the 
workspace  into  regions  with  constant-size  coverage. 

In  a  second  step  we  compute  the  vertical  decomposition  of  the 
arrangement  .4(G)  of  grown  obstacle  boundaries,  by  sweep- 
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ing  the  plane  [3]  with  the  arcs  with  a  vertical  line,  meanwhile 
extending  walls  in  upward  and  downward  vertical  direction 
from  every  arc  endpoint  (known  in  advance)  and  all  arc  in¬ 
tersections  (to  be  determined  during  the  sweep).  The  arcs 
and  walls  and  their  endpoints  form  a  planar  graph  consisting 
of  0{n)  edges  and  vertices,  subdividing  the  plane  into  0{n) 
constant  complexity  regions:  the  regions  of  Vw  •  Theseti/vv 
of  pairs  of  adjacent  regions  has  size  0{n)  as  well,  as  each 
adjacency  is  represented  by  one  of  the  0{n)  edges  in  the 
graph. 

Lemma4.1  \Vw\  =  0(n)  and\E-w\  =  0{n). 

The  sweep  must  not  only  compute  the  regions  of  Vw,  but 
also  the  coverages  Cov{R)  of  the  regions  R  G  Vw,  and 
the  region  adjacencies  of  Ew-  Throughout  the  sweep  we 
maintain  as  invariant  that  all  regions,  coverages,  and  adja¬ 
cencies  left  of  the  sweep-line  are  computed.  The  sweep-line 
status  and  event  point  schedule,  both  stored  in  appropriate 
data  structures,  facilitate  the  maintenance  of  the  invariant. 
The  sweep-line  status  is  a  top-to-bottom  cross-section  of  the 
vertical  decomposition  of  A(G)  and  the  vertical  sweep-line. 
This  alternating  sequence  of  arcs  and  regions  accompanied 
by  their  coverages  is  stored  in  a  binary  tree.  The  event  point 
schedule  is  the  ordered  sequence  (by  a:-coordinate)  of  all  arc 
endpoints  and  the  potential  intersections  of  consecutive  arcs 
in  the  sweep-line  status.  A  priority  queue  stores  the  schedule. 

The  processing  of  the  next  event,  i.e.,  the  extension  of  vertical 
walls  from  an  intersection  point  or  endpoint,  marks  the  end 
of  at  most  three  consecutive  regions  in  the  sweep-line  status. 
These  regions  and  their  coverages  are  reported  to  maintain 
the  invariant.  Moreover,  the  regions  are  deleted  from  the 
sweep-line  status  and  replaced  by  the  (at  most  three)  newly 
started  regions.  Appropriate  adjustments  w.r.t.  the  separat¬ 
ing  arcs  are  made  as  well.  In  addition,  we  must  report  the 
adjacencies  of  the  new  regions  and  compute  their  coverages. 
These  coverages  are  easily  computed  from  the  coverages  of 
the  ended  regions  and  the  arc  labels  (see  [14]  for  details  on 
the  update).  Potential  intersections  of  the  0(1)  newly  created 
pairs  of  consecutive  arcs  must  be  inserted  in  the  event  queue. 
In  summary,  the  processing  of  an  event  requires  a  constant 
number  of  searches,  deletions,  and  insertions  on  a  binary  tree, 
and  a  constant  number  of  insertions  in  the  event  queue.  All 
listed  operations  take  0(log  n)  time,  so  the  processing  of  all 
events,  and,  hence,  the  entire  sweep,  takes  0(n  log  n)  time. 

Lemma  4.2  T(n)  =  O(nlogn). 

Substitution  of  the  vertical  decomposition  in  the  first  step  of 
Algorithm  FatMot  leads  to  a  cell  decomposition  of  size  0{n), 
computed  in  0{n  logn)  time. 

Theorem  4.3  The  motion  planning  problem  amidst  fat  ob¬ 
stacles  in  can  be  solved  in  time  0{n  log  n). 


4.2  Three-dimensional  Euclidean  workspace 

The  problem  of  finding  cc-partitions  for  three-dimensional 
Euclidean  workspaces  is  much  harder  than  its  two- 
dimensional  equivalent,  which  is  illustrated  by  the  relatively 
small  number  of  results  on  partitioning  3D  arrangements. 
Below  we  first  consider  the  case  of  polyhedral  obstacles  in 
some  detail,  and  then  mention  some  results  for  two  other 
classes  of  obstacles. 

Like  in  the  two-dimensional  case,  we  first  partition  the 
workspace  into  regions  with  constant-size  coverage,  and  sub¬ 
sequently  subdivide  these  regions  into  constant  complexity 
regions.  Instead  of  using  the  grown  obstacle  boundaries  to 
achieve  the  first  decomposition  we  now  use  the  boundary 
of  a  polyhedral  outer  approximation  of  these  grown  obsta¬ 
cles,  which  still  achieves  a  decomposition  of  W  into  regions 
of  constant-size  coverage,  but  additionally  allows  for  subse¬ 
quent  application  of  a  vertical  trapezoidation  algorithm  (to 
the  triangulated  polyhedral  arrangement)  to  obtain  O(n^) 
constant  complexity  regions.  A  tight  outer  approximation 
of  the  grown  obstacle  G{E,  ps)  is  the  Minkowski  difference 
H{E,  pb)  of  E  and  a  cube  with  side  length  2pB’- 

H{E,pb)  = 

£'0(0  •  {(a:i,a:2,a:3)  £  -  Pb  <  a;i,a;2,a;3  <  pe}), 

where  0  represents  the  Minkowski  difference  operator  and 
0  €  50(3)  is  some  arbitrary  rotation  matrix  to  establish  that 
none  of  the  cube’s  faces  is  vertical,  thus  preventing  future 
vertical  triangles.  Computation  of  a  (constant  complexity) 
Minkowski  difference  H{E,  ps)  takes  0(1)  time. 

The  polyhedral  arrangement  A{H)  of  all  Minkowski  differ¬ 
ence  boundaries  dH{E,  ps),  £  G  5,  satisfies  properties  that 
are  similar  to  the  properties  of  the  arrangement  .4(0)  in  the 
2D  case. 

Lemma  4.4  (a)  The  complexity  of  A{H)  is  0{n), 

(b)  \Cov{A)\  =  0{1)  for  each  3-face  A  inA(H). 

Proof:  Each  Minkowski  difference  H (£,  ps)  is  a  t/3  •  pe¬ 
wrapping  of  £,  and,  because  pg  <  b-Smin^AsoahV^-Smin- 
wrapping  of  £.  Since  Smin  is  a  lower  bound  on  the  diameter 
of  all  obstacles  £  G  5,  the  (a)-part  follows  immediately 
from  Theorem  2.4.  Moreover,  the  same  theorem  implies 
that  each  point  p  G  W  lies  in  0(1)  Minkowski  differences 
H(E,  Pb)  simultaneously,  which  implies  that  a  3-face  A  lies 
in  the  intersection  of  0(1)  Minkowski  differences.  Because 
H{E,pb)  2  G{E,pb),  Cov{A)  is  a  subset  of  the  set  of 
obstacles  £  for  which  H{E,  pb)  encloses  A  and,  hence, 
\Cov{A)\  =  0(1).  a 

A  naive,  but  sufficiently  efficient,  computation  of  the  0(n) 
complexity  arrangement  .4(£)  takesO(n^)  timeandasubse- 
quent  constrained  triangulation  of  all  0(n)  planar  Minkowski 
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faces  and  the  segments  defined  by  intersections  with  other 
faces  can  be  performed  in  near-linear  time.  The  constrained 
triangulation  of  the  faces  of  A{H)  does  not  introduce  new 
vertices.  The  result  is  that  we  end  up  with  a  collection 
of  non-intersecting  triangles.  Although  the  triangles  are  non¬ 
intersecting,  they  do  touch  each  other,  i.e.,  they  share  edges 
and  vertices. 

De  Berg,  Guibas,  and  Halperin  [4]  present  an  algorithm  for 
computing  the  vertical  trapezoidation  of  an  arrangement  of 
triangles  in  3-space.  In  the  restricted  case  of  non-intersecting 
triangles,  their  algorithm  leads  to  a  decomposition  of  3-space 
into  O(n^)  constant  complexity  regions.  The  computation 
takes  0{n~  log  n)  time.  Below  we  first  briefly  discuss  the 
algorithm  and  the  structure  of  the  trapezoidation.  After  that, 
we  show  that  application  of  the  algorithm  to  the  set  of  tri¬ 
angles  leads  to  a  cc-partition  graph  {V\\r,Ew),  with 

|I/\v|  =  0(n“)  but,  unfortunately,  a  larger  set  Ew.  To  rem¬ 
edy  this  we  will  replace  each  triangle  by  a  flat  tetrahedron 
and  then  show  that  the  resulting  set  f’(T^(H))  of  tetrahe- 
dra  solves  the  problem  as  its  vertical  trapezoidation  leads  to 
iVwl  =  l^^wl  =  O(n^)  and  T(n)  =  O(n^logn). 

The  computation  of  the  vertical  trapezoidation  proceeds  in 
two  steps.  The  first  step  results  in  the  vertical  decom¬ 
position  of  the  arrangement  of  non-intersecting  triangles. 
The  vertical  decomposition  partitions  the  arrangement  into 
regions  of  points  with  equal  vertical  visibility  w.r.t.  T, 
both  in  upward  and  downward  direction,  by  extending  walls 
in  vertical  direction  from  all  triangle  edges.  A  region 
in  the  vertical  decomposition  is  a  maximal  connected  set 
{a;  G  ]R^|up(a;)  =  A  down(x)  =  G}  where  ti,t2  G  T 
and  iip{x)ldoivn{x)  denotes  the  first  triangle  in  T  that  is  hit 
by  the  vertical  ray  emanating  from  x  in  upward/downward 
direction.  The  floors  and  ceilings,  i.e.,  the  bounding  faces 
in  vertical  direction,  of  the  resulting  regions  are  parts  of  tri¬ 
angles.  More  precisely,  they  are  2-faces  in  an  arrangement 
of  ending  walls  on  a  triangle  side.  Note  that  the  floor  and 
ceiling  of  a  vertical  decomposition  region  have  equivalent 
projections  onto  the  (x,  y)-plane.  The  second  step  refines  the 
vertical  decomposition  into  a  vertical  trapezoidation,  consist¬ 
ing  of  regions  bounded  by  six  planar  faces,  by  adding  walls 
parallel  to  the  {x,  z)-plane.  The  refining  is  obtained  through 
a  planar  vertical  decomposition  of  the  floor  and  ceiling  of 
the  region,  in  which  segments  are  extended  within  the  floor 
or  ceiling  parallel  to  the  y-axis  from  every  vertex  on  the 
boundary  of  the  floor  or  ceiling.  These  segments  delimit  the 
additional  walls  parallel  to  the  (a;,  z)-plane.  The  resulting 
trapezoidal  faces  are  the  floors  and  ceilings  of  the  vertical 
trapezoidation  regions. 

The  vertical  trapezoidation  algorithm  results  in  0(rr  )  con¬ 
stant  complexity  regions  in  time  0{n~  logn).  The  output 
consists  of  the  input  triangles  accompanied  by  the  vertically 
decomposed  arrangements  of  ending  walls.  These  arrange¬ 
ments  implicitly  contain  all  relevant  information  regarding 


the  vertical  trapezoidation.  The  representation  of  a  region 
follows  immediately  from  its  floor  and  ceiling.  Adjacent  re¬ 
gions  in  the  trapezoidation  share  a  common  boundary  that 
is  either  embedded  in  a  vertical  wall,  either  extended  from 
a  triangle  edge  or  added  during  the  trapezoidation  phase,  or 
embedded  in  a  triangle.  If  the  triangles  are  in  general  position 
then  it  can  be  shown  that  regions  that  share  a  vertical  boundary 
face  must  have  either  adjacent  floors  or  adjacent  ceilings  in 
some  vertically  decomposed  triangle  arrangement.  Regions 
that  share  a  common  boundary  that  is  embedded  in  a  triangle, 
have  floor  and  ceiling  with  non-empty  intersection  on  either 
side  of  a  triangle.  Hence,  the  representation  of  the  regions  as 
well  as  the  adjacency  information  is  completely  contained  in 
the  vertically  decomposed  triangle  arrangements. 

Adjacent  regions  in  the  decomposition  share  a  face  that  is 
either  a  part  of  a  vertical  wall  or  a  part  of  some  triangle.  It 
turns  out  that  the  number  of  adjacencies  of  the  latter  type 
is  more  than  0{n^).  Walls  extended  from  other  triangles’ 
edges  end  on  both  sides  of  a  triangle  t  G  T^{h)  and  define 
arrangements  of  line  segments  on  these  sides.  The  complex¬ 
ity  of  a  single  arrangement  and  its  vertical  decomposition  can 
be  as  high  as  O(n^),  and,  hence,  the  number  of  2-faces  de¬ 
fined  by  the  vertically  decomposed  arrangement  is  bounded 
by  O(n^).  These  2-faces  are  the  floors  (or  ceilings)  of  the 
vertical  trapezoidation  regions.  Each  non-empty  intersection 
of  2-faces  on  either  side  of  a  single  triangle  corresponds  to 
an  adjacency  of  two  vertical  trapezoidation  regions.  There 
are  pairs  of  2-faces  on  opposite  sides  of  a  triangle 

and  it  seems  impossible  to  bound  the  number  of  non-empty 
intersections,  and.  hence,  the  number  of  region  adjacencies, 
by  anything  close  to  O(n^). 

An  elegant  way  to  avoid  the  problem  outlined  above  is  by 
replacing  the  set  of  triangles  Tj^{h)  by  a  set  of  tetrahedra 
F{Ta{H))  that  are  sufficiently  flat  to  prevent  them  from  in¬ 
tersecting.  The  tetrahedra  have  the  initial  triangles  of  T^(h) 
as  one  of  their  faces.  The  triangular  faces  of  the  tetrahedra 
in  F{T^(f{))  form  again  a  set  of  non-intersecting,  though 
touching,  triangles.  This  set  is  four  times  larger  than  T_a(h) 
(and  therefore  still  of  size  0{n))  but,  more  importantly,  it 
has  the  simple  though  very  beneflcial  property  that  one  of  the 
sides  of  each  triangle  faces  the  interior  of  a  tetrahedron.  As  a 
result,  the  number  of  walls  ending  on  the  interior-facing  side 
of  a  triangular  face  t  of  r  G  F{Tj,^h))  is  constant  as  such 
walls  can  only  be  extended  from  edges  of  the  tetrahedron  r. 

We  first  discuss  how  to  replace  each  triangle  t  G  Ta(h)  by 
a  tetrahedron  r  having  t  as  one  of  its  faces,  such  that  the 
resulting  tetrahedra  are  non-intersecting.  Recall  that  Ta[h) 
is  a  set  of  triangles  that  may  share  a  vertex  or  an  edge  but 
do  not  intersect  each  others’  interiors.  Let  e  be  the  minimum 
distance  between  any  pair  of  disjoint  triangles.  Furthermore, 
let  7i  be  the  minimum  over  all  dihedral  angles  between  pairs 
of  (touching)  triangles  that  share  an  edge  and  72  be  the  min¬ 
imum  over  all  angles  between  the  supporting  plane  of  a  tri- 
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angle  t  G  Ta{h)  and  an  edge  of  another  triangle  t'  incident 
to  a  vertex  of  t.  We  define  7  =  min(7i  ,72)-  We  construct  a 
set  F{Tj^^h))  by  applying  the  following  procedure  to  every 
t  e  T.  Let  vi  ,V2,V3  be  the  vertices  of  t. 

1.  The  planes  7ri,7r2,7r3  through  ui,V2,r'3  that  make  a 
positive  angle  7/2  with  the  top  side  (facing  z  =  00)  of 
t  intersect  in  a  point  v.  If  the  distance  from  u  to  f  is 
strictly  less  than  e,  then  the  tetrahedron  is  defined  by  the 
vertices  v,  vi,V2,  va- 

2.  If  the  distance  from  u  to  f  is  at  least  e,  then  we  take  the 
half-line  h  through  v  and  perpendicular  to  and  ending 
on  t.  The  tetrahedron  is  defined  by  vi,V2,  V3  and  the 
unique  point  v'  on  the  half-line  h  with  distance  e/2  to  t. 

The  following  lemma  now  holds  for  the  resulting  set 
F{Ta{H))  of  tetrahedra.  (For  a  proof,  we  refer  to  [14].) 
We  denote  the  (open)  interior  of  a  closed  set  r  by  mt(r). 

Lemma  4.5  Vr,  r'  G  F(Tj_(^h))  ■  iniij)  H  int{T')  =  0. 

Application  of  the  trapezoidation  algorithm  to  the  triangu¬ 
lar  faces  of  F{Ta{h))  yields  a  vertical  trapezoidation  of 
complexity  O(n^)  and  therefore  consisting  of  O(n^)  regions 
with  constant  complexity.  The  trapezoidation  regions  are  ap¬ 
pointed  to  be  the  regions  of  Vw .  The  coverage  of  each  region 
Re  Vw  clearly  has  constant  size  as  is  a  subset  of  a  3-face 
A  in  the  arrangement  A{H). 

Let  us  now  consider  the  number  of  adjacencies  of  regions  in 
Vw-  The  complexities  of  the  vertically  decomposed  triangle 
arrangements  is  crucial  in  determining  the  number  of  adja¬ 
cencies,  so  we  first  study  these  complexities.  The  complexity 
of  the  entire  vertical  trapezoidation  is  O(n^).  As  a  result  the 
cumulative  complexity  of  all  vertically  decomposed  triangle 
arrangements  is  O(n^)  as  well.  Each  triangle  t  G  F{T^(^h^) 
has  a  side  facing  the  interior  of  the  tetrahedron  r  it  belongs  to 
and  a  side  facing  outward.  The  complexity  rrit  of  the  verti¬ 
cally  decomposed  arrangement  on  the  inward-facing  side  of 
t  is  constant,  since  only  a  constant  number  of  walls,  extended 
from  one  of  the  edges  of  r,  ends  on  this  side:  =  0(1), 

for  all  t.  The  complexity  rit  of  the  vertically  decomposed 
arrangement  on  the  outward-facing  side  of  a  single  triangle 
t,  however,  can  be  as  high  as  0(n2).  The  number  of  ad¬ 
jacencies  of  2-faces  in  a  decomposed  triangle  arrangement 
is  of  the  same  order  of  magnitude  as  the  complexity  of  the 
decomposed  arrangement,  hence,  the  numbers  of  adjacen¬ 
cies  on  the  inward  and  outward-facing  sides  of  a  triangle  t 
are  0(to()  and  0(^0  respectively,  adding  up  to  a  total  of 
0{mt  -t-  Tii)  adjacencies  of  2-faces  on  t.  We  have  stressed 
earlier  that  each  pair  of  adjacent  trapezoidation  regions  has 
either  adjacent  floors  or  ceilings  in  some  vertically  decom¬ 
posed  triangle  arrangement,  or  intersecting  floor  and  ceiling 
on  opposite  triangle  sides.  The  number  of  adjacencies  of  the 
first  type  equals  0{mt  +  nt)  =  O(n^),  as  it  is  of  the 
same  order  of  magnitude  as  the  cumulative  complexity  of  the 


vertically  decomposed  triangle  arrangements.  The  number 
of  non-empty  intersections  on  opposite  sides  of  a  triangle  t  is 
0{mt  ■  nt)  =  0{nt)  due  to  mt  =  0(1).  Hence,  the  number 
ofadjacencies  ofthe  second  type  equals  0(nt)  =  O(n^). 
In  summary,  the  total  number  of  adjacencies  in  Ew  is  O(n^). 

Lemma  4.6  |Vw|  =  O(n^)  and  \Ew  \  =  O(n^). 

After  applying  the  0(n^  log  n)  trapezoidation  algorithm,  we 
traverse  the  resulting  O(n^)  constant  complexity  regions  us¬ 
ing  the  decomposed  triangle  arrangements  to  extract  explicit 
descriptions  of  the  regions,  regions  adjacencies,  and  the  cov¬ 
erages  of  the  regions.  The  latter  computation  requires  ad¬ 
ditional  explanation.  Instead  of  attempting  to  compute  the 
constant-size  coverages  directly,  we  first  compute  the  set 
of  intersecting  Minkowski  differences  H(E,  pe),  using  the 
fact  (throughout  the  traversal)  that  adjacent  regions  are  inter¬ 
sected  by  the  same  set  of  Minkowski  difference  unless  they 
are  separated  by  a  triangle  t  that  is  part  of  some  Minkowski 
difference  boundary  dH {E,  pb))  in  which  case  the  sets  of  in¬ 
tersecting  Minkowski  differences  differ  by  exactly  {£'}.  The 
set  of  Minkowski  differences  H{E,  ps)  intersecting  a  region 
G  Vw  is  a  superset  of  the  set  of  grown  obstacles  G{E,  pb) 
intersecting  R,  since  G{E,  ps)  C  H {E,  ps)-  The  latter  set, 
which  is  the  coverage  Cov{R)  of  R  can  be  computed  from 
the  set  of  intersecting  Minkowski  differences,  as  this  set  has 
constant  size  as  well  by  Lemma  4.4.  The  traversal  of  the 
O(n^)  regions  takes  time  proportional  to  the  number  of  re¬ 
gions,  leading  to  the  conclusion  that  the  computation  time 
T{n)  is  dominated  by  the  running  time  of  the  trapezoidation 
algorithm. 

Lemma  4.7  T(n)  =  O(n^logn). 

Substitution  of  the  described  vertical  trapezoidation  in  the 
first  step  of  Algorithm  FatMot  leads  to  a  cell  decomposition 
of  size  O(n^)  computed  in  0(n^  logn)  time. 

Theorem  4.8  The  motion  planning  algorithm  amidst  fat 
polyhedral  obstacles  in  can  be  solved  in  time  0{n“  log  n). 

If  the  assumption  of  polyhedral  obstacles  is  dropped  we  are  no 
longer  able  to  tightly  wrap  the  obstacles.  As  a  consequence, 
we  are  also  no  longer  able  to  obtain  an  arrangement  of  wrap¬ 
ping  boundaries  of  size  0{n)  that  partitions  the  workspace 
into  regions  of  constant-size  coverage.  Hence,  a  different 
strategy  for  finding  a  cc-partition  is  necessary.  It  can  be 
shown  [14]  that  a  possible  cc-partition  of  the  workspace  is 
given  by  the  supporting  planes  of  the  faces  of  the  bound¬ 
ing  boxes  of  the  grown  obstacles  G(£',ps).  This  partition 
yields  O(n^)  regions  and  adjacencies,  and  leads  to  an  O(n^) 
motion  planning  algorithm,  which  seems  definitely  open  for 
improvement. 

Another  nice  result  is  obtained  for  general  obstacles  from 
a  bounded  size  range.  If  we  assume  all  obstacle  diameters 
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to  be  in  the  range  Smin  ■  ■  ■  ^max  =  u  ■  ^min  for  some  con¬ 
stant  u  >  1,  then  a  cc-partition  of  optimal  0{n)  size  can 
be  constructed.  The  idea  is  to  put  an  imaginary  orthogo¬ 
nal  grid  ,P2&min  ,Pz5rain)\Pl,P2,P3  G  Z}  in  the 

workspace  W  and  partition  W  into  axis-parallel  boxes  w'ith 
the  grid  points  as  their  corners.  A  key  observation  is  that  the 
bounding  box  of  each  grown  obstacle  can  be  covered  with  a 
constant  number  of  grid  cubes  with  side  length  5min^  leading 
to  0{n)  such  cubes  with  constant  size  coverage  by  Lemma 
2.2.  The  complement  of  the  cubes  is  decomposable  by  0{n) 
axis-parallel  boxes  of  measure  x  ^min  x  h  ■  Smin  >  with 
h  E  N,  and  0(n)  axis-parallel  boxes  that  are  unbounded  in 
the  z-direction.  The  resulting  regions  provide  a  cc-partition 
graph  ( V\v ,  f?w)  of  size  0{n),  that  can  be  computed  together 
with  the  region  coverage  Cov  in  time  0{n  logn)  (see  again 
[14]  for  the  details). 

Theorem  4.9  The  motion  planning  problem  amidst  fat  ob¬ 
stacles  in  with  diameters  in  the  range  Smin  ■  ■  -  u  ■  Smin, 
for  some  constant  u  >  1,  can  be  solved  in  time  0(n  log  n). 


5  Conclusion 

In  this  paper,  we  have  presented  a  paradigm  for  planning  the 
motion  of  a  constant  complexity  robot  amidst  fat  constant 
complexity  obstacles.  The  efficiency  of  the  method  relies 
on  the  ability  to  find  small  and  efficiently  computable  so- 
called  cc-partitions  of  the  workspace.  We  have  furthermore 
shown  that  small  cc-partitions  of  the  workspace  do  indeed  ex¬ 
ist  for  various  instances  of  workspaces  with  obstacles.  A  cc- 
partition  of  optimal  size,  i.e.,  0{n),  has  been  found  for  planar 
Euclidean  workspaces  with  arbitrarily  shaped  obstacles  and 
for  3D  Euclidean  workspaces  with  arbitrary  obstacles  of  com¬ 
parable  sizes.  The  partitions,  both  computable  in  0{n  log  n), 
lead  to  nearly-optimal  algorithms  for  motion  planning  in  the 
corresponding  settings.  Additional  cc-partitions  have  been 
found  for  Euclidean  workspaces  with  polyhedral  and  arbi¬ 
trary  obstacles.  They  lead  to  motion  planning  algorithms  in 
the  respective  settings  with  running  times  0{n~  \ogn)  and 
O(n^),  based  on  quadratic  and  cubic  size  partitions.  An 
obvious  open  question  is  whether  it  is  possible  to  enhance 
the  latter  results  by  finding  smaller,  efficiently  computable, 
cc-partitions  for  these  environment  instances. 

The  presented  paradigm  has  a  large  applicability.  The  listed 
bounds  and  algorithms  are  independent  of  the  actual  number 
of  degrees  of  freedom  of  the  robot,  as  long  as  this  number 
is  constant.  The  paradigm  therefore  not  only  caters  for  rigid 
robots  but  is  instead  also  capable  of  handling  robots  that 
both  move  and  have  articulated  arms  and  legs.  Moreover, 
preliminary  results  on  using  (a  slightly  generalized  version 
of)  the  paradigm  for  planning  coordinated  motions  of  two  or 
more  robots  amidst  fat  obstacles  are  promising. 


References 

[1]  H.  Alt,  R.  Fleischer,  M.  Kaufmann,  K.  Mehlhorn,  S. 
Naher,  S.  Schirra,  and  C.  Uhrig,  Approximate  motion 
planning  and  the  complexity  of  the  boundary  of  the  union  of 
simple  geometric  figurts,  Algorithmica  8  (1992),  pp.  391-406. 

[2]  F.  Avnaim,  J.-D.  Boissonnat,  and  B.  Faverjon,  A  practical 
exact  motion  planning  algorithm  for  polygonal  objects  amidst 
polygonal  obstacles,  Proc.  Geometry  and  Robotics  Workshop 
(J.-D.  Boissonnat  and  J.-P.  Laumond  Eds.),  Lecture  Notes  in 
Computer  Science  391  (1988),  pp.  67-86. 

[3]  J.L.  Bentley  and  T.A.  Ottman,  Algorithms  for  reporting 
and  counting  geometric  intersections,  IEEE  Transactions  on 
Computers  28  (1979),  pp.  643-647. 

[4]  M.  DE  Berg,  L. J.  Guibas,  and  D.  Halperin,  Vertical  decom¬ 
positions  for  triangles  in  3-space,  Proc.  10th  ACM  Symp.  on 
Computational  Geometry  (1994). 

[5]  J.  Matousek,  N.  Miller,  J.  Pach,  M.  Sharir,  S.  Sifrony, 
AND  E.  Welzl,  Fat  triangles  determine  linearly  many  holes, 
Proc.  32nd  IEEE  Symp.  on  Foundations  of  Computer  Science 
(1991),  pp.  49-58. 

[6]  M.H.  Overmars,  Point  location  in  fat  subdivisions,  Inform. 
Proc.  Lett.  44  (1992),  pp.  261-265. 

[7]  J.T.  Schwartz  and  M.  Sharjr,  On  the  piano  movers’  prob¬ 
lem  I.  The  case  of  a  two-dimensional  rigid  polygonal  body 
moving  amidst  polygonal  boundaries.  Comm.  Pure  Appl. 
Math.  36  (1983),  pp.  345-398. 

[8]  J.T.  Schwartz  and  M.  Sharir,  On  the  piano  movers’  prob¬ 
lem  11.  General  techniques  for  computing  topological  proper¬ 
ties  of  real  algebraic  manifolds,  Adv.  in  Applied  Mathematics 
4(1983),  pp.  298-351. 

[9]  J.T.Schwartz  and  M.  Sharir,  Efficient  motion  planning  al¬ 
gorithms  in  environments  of  bounded  local  complexity,  Report 
164,  Department  of  Computer  Science,  Courant  Inst.  Math. 
Sci.,  New  York  NY  (1985). 

[10]  S.  Sifrony  and  M.  Sharir,  A  new  efficient  motion  plan¬ 
ning  algorithm  for  a  rod  in  two-dimensional  polygonal  space, 
Algorithmica  2  (1987),  pp.  367-402. 

[1 1]  A.F.  VAN  der  Stappen,  D.  Halperin,  and  M.H.  Overmars, 
The  complexity  of  the  free  space  for  a  robot  moving  amidst 
fat  obstacles,  Computational  Geometry:  Theory  and  Applica¬ 
tions,?,  {1992),  pp.  353-373. 

[12]  A.F.  van  der  Stappen,  The  complexity  of  the  free  space  for 
motion  planning  amidst  fat  obstacles.  Journal  of  Intelligent  & 
Robotic  Systems,  in  press. 

[13]  A.F.  VAN  der  Stappen,  D.  Halperin,  and  M.H.  Overmars, 
Efficient  algorithms  for  exact  motion  planning  amidst  fat  ob¬ 
stacles,  Proc.  of  the  IEEE  Int.  Conf  on  Robotics  and  Automa¬ 
tion,  Atlanta  GA  (1993),  Vol.  1,  pp.  297-304. 

[14]  A.F.  VAN  der  Stappen  and  M.H.  Overmars,  Motion  plan¬ 
ning  amidst  fat  obstacles,  in  preparation. 


40 


Approximate  Euclidean  Shortest  Path  in  3-Space 

(Extended  Abstract) 


Joonsoo  Choi  Jurgen  Sellen*  Chee-Keng  Yap^ 
Courant  Institute  of  Mathematical  Sciences 
New  York  University 


Abstract 

Papadimitriou’s  approximation  approach  to  the  Eu¬ 
clidean  shortest  path  (ESP)  problem  in  3-space  is  re¬ 
visited.  As  this  problem  is  NP-hard,  his  approach 
represents  an  important  step  towards  practical  algo¬ 
rithms.  Unfortunately,  there  are  non-trivial  gaps  in 
the  original  description.  Besides  giving  a  complete 
treatment,  we  also  give  an  alternative  to  his  subdivi¬ 
sion  method  which  has  some  nice  properties.  Among 
the  tools  needed  are  root-separation  bounds  and  non¬ 
trivial  applications  of  Brent’s  complexity  bounds  on 
evaluation  of  elementary  functions  using  floating  point 
numbers. 


1  Introduction 

The  Euclidean  shortest  path  (ESP)  problem  can  be 
formulated  as  follows;  given  a  collection  of  polyhe¬ 
dral  obstacles  in  physical  space  S,  and  source  and  tar¬ 
get  points  soi^o  e  S,  find  a  shortest  obstacle-avoiding 
path  between  sq  and  to-  Here  S  is  typically  or  E^ . 
It  is  evident  that  this  is  a  basic  problem  in  applica¬ 
tions  such  as  robotics.  The  case  5  =  E^  has  recently 
seen  a  breakthrough,  with  the  O(nlogn)  algorithm  of 
Suri  and  Hershberger  [7].  However,  our  interest  is  the 
case  S  -  If?.  Here,  Canny  and  Reif  [2]  proved  that  the 
problem  is  NP-hard.  In  contrast,  Papadimitriou  [6] 
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described  a  polynomial-time  approximate  algorithm. 
The  approximate  Euclidean  shortest  path  problem  has, 
in  addition  to  the  above  inputs,  an  input  parameter 
e  >  0  and  we  are  required  to  construct  an  obstacle¬ 
avoiding  path  whose  length  is  at  most  (l-fe)  times  the 
length  of  the  shortest  path.  As  Papadimitriou  sug¬ 
gested,  his  algorithm  offers  hope  for  practical  imple¬ 
mentation.  The  approach  is  basically  simple  and  may 
be  very  similar  to  ones  that  practitioners  intuitively 
use,  although  without  a  corresponding  analysis.  Un¬ 
fortunately,  there  are  several  non-trivial  gaps  in  this 
important  paper.  There  is  an  alternative  approach 
to  the  approximate  ESP  problem  due  to  Clarkson  [3]; 
but  it  is  not  directly  comparable  to  Papadimitriou’s 
result  because  the  complexity  of  Clarkson’s  algorithm 
depends  on  an  additional  parameter  p. 

Most  geometric  algorithms  are  developed  within 
the  context  of  one  of  two  distinct  computational 
frameworks,  the  algebraic  framework  ot  the  bit  frame¬ 
work.  In  the  algebraic  framework,  we  count  the  num¬ 
ber  of  algebraic  operations,  assuming  that  each  al¬ 
gebraic  operation  returns  an  exact  value.  Computa¬ 
tional  models  appropriate  for  the  algebraic  framework 
include  arithmetic  circuits  and  real  RAMS.  In  the  bit 
framework,  we  are  interested  in  the  representation  of 
algebraic  values  as  binary  strings.  Algebraic  opera¬ 
tions  must  be  reduced  to  Boolean  operations.  Com¬ 
putational  models  that  cater  to  the  bit  framework  in¬ 
clude  Boolean  circuits  and  Turing  machines. 

Most  basic  problems  in  the  algebraic  framework 
have  only  one  input  size  parameter  n,  which  is  usu¬ 
ally  proportional  to  the  number  of  algebraic  values  in 
the  input.  For  instance,  the  problem  of  multiplying 
two  polynomials  of  degree  n  is  said  to  have  input  size 
n,  and  can  be  solved  in  0(n  log  n)  algebraic  opera¬ 
tions.  The  result  of  Suri  and  Hershberger  mentioned 
above  also  falls  under  the  algebraic  framework.  Ba¬ 
sic  problems  in  the  bit  framework  usually  have  two 
parameters,  n  and  L,  where  n  is  as  before  and  L  is 
the  maximum  bit-size  of  any  algebraic  value.  For  in¬ 
stance,  isolating  the  roots  of  a  polynomial  of  degree  n 
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where  each  coefficient  has  at  most  L  bits.  Sometimes 
n  is  bounded  and  may  be  omitted;  an  example  is  the 
GCD  of  two  integers.  Most  approximation  problems 
(such  as  the  approximate  ESP  above)  naturally  be¬ 
long  to  the  bit  framework.  In  this  case,  we  have  an 
additional  input  parameter,  e.  Usually,  e  is  specified 
by  an  integer  s  where  e  =  2“®. 

The  main  source  of  trouble  in  Papadimitriou’s  pa¬ 
per  (and,  for  that  matter,  Clarkson’s  paper)  lies  in 
the  fact  that  it  is  a  result  in  the  bit  model  and  yet  it 
lapses  into  the  algebraic  framework  at  several  critical 
moments.  This  is  inadmissible  because  we  know  of  no 
sensible  combination  of  the  bit  framework  with  the  al¬ 
gebraic  framework.  After  all,  the  AP-hardness  of  ESP 
result  falls  under  the  bit  framework,  and  the  approxi¬ 
mation  result  of  Papadimitriou  is  generally  viewed  as 
an  approach  to  circumvent  this  hardness  barrier.  As 
an  example  of  the  lapses,  note  that  lengths  of  polygo¬ 
nal  paths  are  sums  of  square-roots  of  rational  numbers. 
These  values  are  no  problem  to  compute  in  the  alge¬ 
braic  framework,  but  in  the  bit  framework,  one  must 
work  out  the  precision  to  which  one  need  to  compute 
these  values.  We  will  see  that  the  precision  needed 
depends  on  how  the  numbers  are  to  be  used.  One 
cannot  take  this  precision  issue  lightly  as  it  is  pre¬ 
cisely  (no  pun  intended)  the  large  precision  needed  for 
the  sums  of  square-roots  computation  that  prevents  us 
from  placing  the  ESP  problem  (even  the  planar  case) 
into  the  class  NP. 

Once  one  realizes  these  lapses,  it  is  still  not  a  rou¬ 
tine  matter  to  decide  the  precision  to  which  we  must 
compute  these  algebraic  numbers.  We  will  employ  ba¬ 
sic  techniques  of  root  bounds  and  algebraic  computa¬ 
tion  (see  [9]) .  One  complication  is  that  we  will  be  using 
approximations  of  approximations  (and  so  on).  The 
second  difficulty  is  that  we  need  to  invoke  Brent’s  re¬ 
sults  [1]  concerning  the  evaluation  of  elementary  func¬ 
tions  to  carry  out  the  various  approximations.  But  ap¬ 
plication  of  Brent’s  results  requires  some  arguments, 
as  we  will  see. 

The  heart  of  Papadimitriou’s  algorithm  is  a  subdi¬ 
vision  scheme  to  break  up  edges  into  smaller  segments. 
In  our  reworking  of  Papadimitriou’s  approach,  we  in¬ 
troduce  a  new  subdivision  scheme  that  has  two  nice 
features.  If  e  is  halved,  we  can  reuse  all  the  points  in 
the  previous  subdivision.  This  can  save  recomputa¬ 
tion  in  practice  (although  we  do  not  know  if  this  is  an 
asymptotic  improvement).  Further,  it  turns  out  that 
our  subdivision  uses  fewer  points  (this  is  an  asymp¬ 
totic  improvement  when  e  is  small  enough). 

The  rest  of  this  abstract  is  organized  as  follows: 
In  section  2,  we  sketch  Papadimitriou’s  algorithm  in 


three  parts  and  indicate  where  the  difficulties  come 
into  each  part.  In  section  3,  the  new  subdivision 
method  for  edges  is  described.  In  section  4,  we  give 
some  general  lemmas  on  application  of  Brent’s  com¬ 
plexity  bounds  for  floating-point  number  computa¬ 
tions.  In  sections  5  to  8,  we  sketch  the  fixes  to  the 
original  algorithm.  We  conclude  in  section  9. 

2  Outline  of  an  Approximate  Algo¬ 
rithm 

Assume  we  are  given  a  collection  of  polyhedral  ob¬ 
stacles,  source  and  target  points  SqToi  and  an  0  < 
£  <  1/2.  We  assume  that  the  obstacles  are  bounded 
by  triangular  facets.  There  are  a  total  of  n  edges  in 
the  collection  of  obstacles.  Each  vertex  of  obstacles 
and  So,  to  are  specified  by  L-bit  integers.  Throughout 
this  paper,  we  use  a  parameter 

e 

Con’ 

where  Co  is  a  suitably  large  constant.  For  the  present 
discussion,  Co  >  8  suffices.  The  algorithm  of  Pa¬ 
padimitriou  can  be  divided  into  three  parts: 

(I)  We  divide  each  edge  e  into  a  set  of  segments  by 

introducing  points  po,  P±i,  P±2,  •  •  ••  We  call  the 
Pi’s  the  break  points  and  po  the  origin  of  e.  Each 
segment  is  of  the  form  ai  =  [pi,pi+i]  for  i  G  Z. 

(II)  The  visibility  graph  G  =  (A,  A)  is  constructed. 
The  nodes  A  of  the  graph  comprise  the  segments 
from  (I)  and  also  So,to.  We  can  view  sq  and  to 
as  degenerate  segments.  The  visibility  edges  A 
comprise  pairs  (tr,  a')  of  nodes  that  can  see  each 
other,  meaning  that  there  exists  x  e  o’,  x'  E  cr' 
such  that  the  line  segment  [x,  x']  avoids  the  (in¬ 
terior  of)  obstacles.  We  call  such  a  line  segment 
[a;,  x']  a  visibility  segment. 

(III)  We  compute  for  each  visibility  edge  (cr,  cr')  a 
“nominal  cost”  C{a,cr'),  defined  to  be  the  Eu¬ 
clidean  distance  between  the  midpoints  of  a  and 
a' .  We  now  apply  Dijkstra’s  shortest  path  algo¬ 
rithm  to  the  weighted  graph  G'  =  (A,  A;  so,C). 
The  shortest  path  tt  in  G'  from  sq  to  to  is  output. 

The  correctness  of  the  algorithm  amounts  to  the 
claim  that  the  cost  C(7r)  is  at  most  (l-|-e)C'(7r*)  where 
C(ir*)  denotes  the  Euclidean  length  of  an  actual  short¬ 
est  path  TT*.  According  to  the  original  paper,  every¬ 
thing  hinges  on  a  clever  subdivision  scheme  for  the 
points  p±i  (i  =  0,1,.. .)  in  part  (I). 
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We  now  give  some  counter  indications: 

(I)  ’  The  origin  po  of  an  edge  e  is  a  specially  chosen 

rational  point.  For  |i|  >  0,  ||po  “  Pi||2  is  the  rational 
number  ei||so  -poll2(l  +  But  this  means  that 

the  coordinates  of  pi  would  in  general  be  irrational. 
The  original  paper  assumes  that  pi  can  be  exactly  de¬ 
termined.  But  in  fact,  we  must  determine  the  preci¬ 
sion  to  which  to  compute  the  points.  This  precision 
depends  on  the  subsequent  use  of  these  points  -  see 
(III)’  below. 

(II) ’  For  any  edge  e,  let  span(e)  denote  the  line 
through  e.  In  general,  for  any  set  X  of  points, 
span(X)  is  the  affine  span  of  X.  Following  Papadim- 
itriou,  let  us  fix  two  edges  e,  /  and  let  x  and  y  be  real 
parameters  for  points  in  span{e)  and  span{f),  respec¬ 
tively.  Let  e(x)  and  f{y)  denote  the  corresponding 
points.  Now  consider  a  triangular  facet  T  bounding 
an  obstacle.  We  view  T  as  the  intersection  of  three 
half-planes  contained  in  span(T).  Let  H  be  one  such 
half-plane.  So  its  boundary  dH  contains  an  edge  of 
T.  The  set  of  potential  visibility  segments  [e(x),f{y)] 
such  that  span{e{x),  f{y))  intersects  dH  is  a  hyper¬ 
bola 

(x  -  a){y  -  6)  =  c 

for  some  a,  6,  c,  as  noted  by  Papadimitriou  and  Guibas 
[6].  However,  the  set  H*  of  points  (x,y)  correspond¬ 
ing  to  potential  visibility  segments  that  intersect  H  is 
never  of  the  form  {x  —  a)(y—b)  <  c  or  {x  —  a)(y—b)  >  c, 
contrary  to  [6] .  This  is  intuitively  obvious  since  the  hy¬ 
perbola  is  solely  determined  by  dH,  and  does  not  take 
into  account  the  actual  plane  span{H).  Assuming  that 
we  correctly  figure  out  the  configuration  obstacle  H* 
corresponding  to  any  half-plane  H ,  the  corresponding 
configuration  obstacle  T*  of  T  is  now  given  by 

T*  =  HI  n  h;  n  h; 

where  ifi  n  Lf2  H  iLs  =  T.  Finally,  the  free-space  FP 
in  the  {x,y)  parameter  space  is  the  complement  of 
the  union  of  all  the  T*’s.  The  boundary  of  FP  will 
consist  of  axes-parallel  line  segments  and  hyperbolic 
segments.  In  the  plane  sweep  algorithm  proposed  to 
determine  FP,  the  priority  of  events  are  now  the  x- 
coordinates  of  intersections  of  these  straight  lines  and 
hyperbolas.  The  original  paper  improperly  assumes 
that  we  can  compute  these  priorities  exactly. 

(III) ’  The  original  paper  assumes  that  we  can  com¬ 
pute  the  nominal  cost  function  C{cr,  a')  exactly.  There 
are  two  sources  of  inaccuracies:  we  can  only  compute 
the  midpoints  of  tr  approximately,  and  then  we  only 
compute  an  approximation  to  the  distance  between 
two  approximate  midpoints.  A  more  serious  matter 


arises:  suppose  the  algorithm  outputs  a  sequence  of 
segments 

(Ti,  <T2,  .  .  .  ,  (Ti, 

that  purports  to  be  an  approximation  to  the  shortest 
path.  This  means  that  there  exists  a  zig-zag  path 

TT  =  (s.^iTi,  •  ■■,qk,rk,t) 

(where  qi,  r*  G  (Xi)  that  avoids  all  obstacles.  (Note  that 
qi,ri  can  be  quite  far  apart  in  absolute  terms.)  We 
must  compute  an  approximation  to  this  zig-zag  path, 
since  obtaining  this  explicit  path  is  the  raison  d’etre  of 
most  applications.  The  approximation  must  be  guar¬ 
anteed  to  be  obstacle-avoiding  (and  still  achieves  an 
e-approximation) . 


3  A  New  Subdivision  Scheme 


In  this  section,  we  present  a  new  subdivision  scheme 
for  edges.  The  original  scheme  would  work  as  well,  but 
our  new  scheme  is  motivated  by  the  desire  that  the 
subdivision  for  e/2  is  a  refinement  of  the  subdivision 
for  e.  As  in  the  original,  the  origin  po  of  an  edge  e  is 
chosen  to  be  the  closest  point  on  e  to  the  source  sq. 

We  now  describe  the  subdivision  of  the  half-edge  F 
to  one  side  of  the  origin  (the  other  half-edge  is  sim¬ 
ilarly  treated).  Let  d  =  ||so  -  Polb  be  this  closest 
distance.  Choose  pi  on  half-edge  e'  at  distance  d  from 
Po.  The  segment  [po,Pi]  is  uniformly  subdivided  into 
segments  of  length  dei/2  by  introducing  the  points 
qi,q2,--:  The  rest  of  the  half-edge  e'  is  subdivided 
using  the  points  P2,P3,  •  •  •  defined  as  follows.  Let  m 
be  the  smallest  integer  satisfying 

.ym  >  log(^(e)/d) 

-  log(l  +  ei)’ 


where  ^(e)  denotes  the  length  of  an  edge  e.  Then  we 
define  p,  such  that 


IIpo  -Pilb  =  d 


Let  D{a,  a')  denote  the  shortest  distance  between  the 
two  segments  cr  and  cr' . 


Lemma  1 

(i)  The  subdivision  satisfies  the  basic  inequality 

i{cr)  <  eiD{so,<7) 

(ii)  There  are  at  most  N  =  0{n{L/€i))  segments  in 
all  the  edges. 

(Hi)  The  subdivision  for  e/2  is  a  refinement  of  the  sub¬ 
division  for  e. 
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Property  (i)  is  similar  to  the  original  scheme.  Prop¬ 
erty  (ii)  improves  on  the  original  by  an  additive  factor 
of  0{n\og(l/ei)/ei).  This  improvement  can  be  sig¬ 
nificant  because  e  might  be  exponentially  small  in  L, 
especially  if  we  want  to  guarantee  that  we  approach 
an  actual  shortest  path.  Property  (iii)  is  advantageous 
to  have  when  we  want  to  use  successive  refinements  of 
e.  Note  that  Papadimitriou’s  scheme  does  not  enjoy 
such  a  feature.  We  note  two  possible  disadvantages 
for  our  scheme:  We  use  two  distinct  methods  to  sub¬ 
divide  e  -  for  the  region  near  the  origin,  the  scheme  is 
uniform  and  the  rest  uses  an  exponential  scheme.  It 
seems  that  both  parts  are  essential  to  get  our  proper¬ 
ties  (ii)  and  (iii).  Also,  our  segment  endpoints  (unlike 
Papadimitriou’s)  are  at  irrational  distances  from  the 
origin. 

4  Floating  Point  Number  Computa¬ 
tions 

Approximate  computations  introduce  a  host  of  de¬ 
tailed  calculations  which  we  highlight  in  this  section. 
Our  subdivision  scheme  requires  us  to  compute  values 
of  the  form  where  E,  F  are  numerical  expressions. 
The  following  captures  what  is  needed; 

(PI)  Let  s,Li,L2  be  given  natural  num¬ 
bers  and  E,  F  be  numerical  expressions 
satisfying  <  F  <  1  and  -L2  < 

ln£^  <  The  problem  is  to  com¬ 

pute  E^  to  precision  s,  in  time  polynomial 
in  s,  Li,  L2- 

In  application,  we  have  S  +  L1+L2  =  0(log(l/ei)-p 
L).  Note  that  x  approximates  x  to  precision  (resp,, 
relative  precision)  s  means  |a:  —  a;|  <  2”’  (resp., 

I  (a;  —  x)/x\  <  2“®).  We  resort  to  Brent’s  results  [I] 
on  computation  of  elementary  functions,  and  compute 
E^  using  the  following  obvious  straightline  program; 
(i)  Compute  Vi ,  V2  which  approximate  E,  F  to  preci¬ 
sion  Si  and  S2,  respectively;  (ii)  Compute  V3  which 
approximates  ln(T/i)  to  precision  S3;  (iii)  Compute  V4 
which  approximates  V2  ■  V3  to  precision  S4;  finally, 
(iv)  Compute  V5  which  approximates  exp(V4)  to  pre¬ 
cision  S5.  We  conclude  that  (with  suitable  choices  of 
Si’s)  V5  approximates  E^  to  precision  s.  We  would 
hope  that  the  cost  of  these  steps  is  polynomial  in 
s,  Li,  L2-  The  sequence  of  approximations  is  compli¬ 
cated  by  the  fact  that  application  of  Brent’s  result  is 
not  straightforward;  his  bounds  for  elementary  func¬ 
tions  assume  that  each  function  is  defined  over  an  arbi¬ 
trary  but  fixed  domain  [a,  6].  Under  this  assumption, 


one  can  compute  exp  (a;)  and  ln(2;)  to  precision  s  in 
time  0(p(s)  log  s)  where 

fi{s) 

denotes  the  bit-complexity  of  multiplying  s-bit  inte¬ 
gers.  Towards  a  satisfactory  solution  of  (PI),  consider 
a  subproblem; 

(P2)  Let  s,  L  be  given  natural  numbers  and 
X  a  number  satisfying  2^^  <  |a:|  <  2^. 
Compute  exp(a;)  to  precision  s  in  time 
polynomial  in  s,  |2;|. 

Floating  point  numbers.  Let  us  briefly  describe 
the  floating  point  number  system  we  will  be  assum¬ 
ing.  A  floating  point  (f.p.)  number  a  is  a  pair  (e,  /)  of 
binary  integers  where  e  is  the  exponent  and  /  the  frac¬ 
tion.  Let  (/)  denote  the  number  obtained  by  placing 
a  binary  decimal  point  just  in  front  of  the  most  signifi¬ 
cant  one  (T’)  in  /  (if  /  =  0  then  define  (/>  =  0).  Thus 
1/2  <(/)<!  for  /  ^  0.  The  number  represented  by 
a  is  2®  •  (/).  We  also  write  a  ~  2®(/),  to  emphasize 
that  a  is  the  representation  itself,  not  just  the  value  it 
represents.  The  precision  of  a  is^  equal  to  1  -t-  [ig/J 

Lemma  2  Assuming  a  floating-point  representation 
of  numbers,  we  can  solve  (PS)  in  time  0{fi{s  -t- 
|a;|)log(s-t-|2;|)  i/1  <  |a:|  <  2^,  and  0{fj.{s +  \og  L){L  + 
log(s-t-IogL)))  j/2-''^  <  III  <  1. 

Let  us  sketch  a  proof  of  lemma  2.  Let  s'  =  s  -|-  L 
and  (e,  /)  ~  £.  Then 

(1,/)  ~2/:=a;-2-L‘s^J 

which  lies  in  the  range  [1,  2).  Applying  Brent’s  result, 
we  can  compute  a  value  z  which  approximates  exp(2/) 
to  precision  s'  in  time  0(ju(s')  Ig  s').  Now  we  apply 
repeated  squaring  to  z  for  a  total  of  [Ig  x]  times,  each 
time  taking  care  to  truncate  the  result  to  precision  s'. 
For  1*1  <  1,  we  do  repeated  square  roots  instead  of 
squarings.  We  then  argue  that  the  final  result  approx¬ 
imates  exp(x)  to  precision  s. 

The  upshot  of  a  series  of  arguments  similar  to  the 
above  eventually  leads  us  to: 

Lemma  3  Assume  that  we  can  compute  E,  F  to  any 
precision  r  in  time  T£;(r),  7>(r),  respectively.  Then 
lue  can  solve  problem  (PI)  in  time 

0{Tf{s  -(-  Lq)  Te[s  4- 1^2)+ 
max{p(s  -h  L2)  log(s  -|-  L2), 
p(s  -|-  lg(£.o))(Z,o  +  log(s  -t-  Ig(To)))}), 

^We  write  ‘log’  when  the  actual  base  is  irrelevant.  But  Ig  = 
log2  and  In  =  log^ . 
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where  we  write  Lq  for  Lx  +  1/2-  If  Lq  =  0{s),  this 
simplifies  to  0{Te{s)  +  Tf(s)  +  /i(s)s). 


5  Configuration  Obstacles 

Fix  any  two  edges  ei,e2  of  obstacles.  Write  e;  = 
[pi,qi]  and  let  us  parameterize  the  points  on  span(ej) 
such  that  xpi  +  {l  —  x)qi  is  denoted  by  ei{x),  for  x  G  M. 
The  set  of  pairs  (x,  y)  corresponding  to  a  potential  vis¬ 
ibility  segment  [ei(x),  62(2/)]  is  thus  parameterized  by 
the  unit  square  Q  =  [0, 1]  x  [0, 1].  Let  Q  be  embed¬ 
ded  in  Q'  =  which  denotes  the  parameter  space 
for  [span(ei)  x  span(e2)].  For  any  obstacle  X  C  M®, 
let  X*  denote  the  set  of  points  (x,  y)  G  Q'  such  that 
[ei(x),  62(2/)]  intersects  the  relative  interior  of  X.  We 
call  X*  the  configuration  obstacle  corresponding  to  X. 
As  in  section  2,  it  is  enough  to  understand  the  con¬ 
figuration  obstacle  H*  where  Ff  is  a  half-plane.  As¬ 
sume  that  dH  contains  the  edge  [u,  u]  of  a  triangular 
face.  Let  £(x,  y)  be  the  line  passing  through  the  points 
ei(x), 62(2/)'  The  requirement  that  £{x,y)  intersects 
dH  reduces  to  the  equation  Axy  +  By  +  Cx  +  D  =  0 
where  A  =  det{v-u  |  px  —  qi  \  P2-92),  B  =  det(t;-u  | 
qi-u  \  p2  -  32),  C  =  det(v  -  u  |  pi  -  31  |  32  -  «) 
and  D  =  det(ii  —  u  |  31  —  u  |  32  —  u).  If  A  ^  0, 
we  may  rewrite  the  equation  in  a  standard  form  for 
hyperbolas:  (Ax  +  B)(Ay  -|-  C)  =  —AD  +  BC. 


Degenerate  position.  We  say  the  two  edges  ei,e2 
are  in  degenerate  position  with  respect  to  FT  if  A  =  0 
or  (A  7^  0)  A(AF?  — BC  =  0).  Let  us  analyze  two  edges 
in  degenerate  position.  If  A  =  0  then  it  can  be  shown 
that  either  (i)  the  points  u,  v,  u+pi  —  qy ,  u+P2  —  32  are 
on  the  same  plane,  or  (ii)  the  two  edges  61,62  are  in 
two  parallel  planes  that  do  not  intersect  with  the  line 
dH.  Now  consider  the  case  A  7^  0.  Using  the  relation 
(see  [5]) 

det(a  I  6  I  d)  det(a  |  c  |  e)  = 
det(a  I  6  I  c)  det(a  |  d  |  e) 

—  det(a  I  6  I  e)  det(a  |  d  |  c). 


we  may  deduce 

Lemma  4 

(i)AD-BC  = 


det 


u  V  u+pi-qi  qx  \ 

11  1  I  J 


■  det 


U  V  U  -t-  P2  -  32  32  ^ 

11  1  I  )' 


(ii)  If  AD  —  BC  =  0  then  one  of  the  lines 

span(ex),  span(e2)  either  is  parallel  with  dH,  or  in¬ 
tersects  dH. 


Suppose  61,62  is  not  in  degenerate  position  with 
respect  to  H.  It  can  be  shown  that  the  sign  of  AD  — 
BC  depends  on  the  direction  of  vector  p,  —  3;  with 
respect  to  H  and  on  whether  span(ei)  intersects  H, 
for  i  =  1, 2. 


Configuration  Obstacles.  We  are  ready  to  de¬ 
scribe  H*  C  Q'.  Let  us  define  two  points 
(xo,  po),  (2?!,  2/1)  G  Q'  as  follows:  span(H)  intersects 
span(ei)  at  ei(xo)  and  intersects  span{e2)  at  62(^0)- 
Let  Pi  be  the  plane  containing  dH  and  parallel  to  Cj. 
Then  Px  intersects  span{e2)  at  62(^1)  and  P2  intersects 
span(ei)  at  ei(xi).  We  may  assume  that  these  inter¬ 
section  points  are  well-defined.  Furthermore,  imag¬ 
ine  span(H)  to  be  the  horizontal  plane  {Z  —  0}  in 
physical  (A,  Y,  Z)-space,  and  that  pi  —  qx  have  a  pos¬ 
itive  A-component.  The  vertical  and  horizontal  lines 
through  (xo,po)  divide  Q'  into  four  quadrants.  By 
symmetry,  we  may  assume  that  61,62  are  parameter¬ 
ized  to  increase  in  the  positive  Z  direction.  Then  the 
first  and  third  quadrants  are  necessarily  disjoint  from 
H* .  Let  us  consider  one  possibility  (the  other  is  simi¬ 
larly  treated),  where  the  line  dH  separates  ei(xo)  and 
€2(^0)  on  the  plane  span(H).  We  may  assume  (xi,  pi) 
lies  in  the  first  quadrant.  See  figure  1.  The  asymptotes 
of  the  hyperbola  in  this  case  are  the  vertical  and  hor¬ 
izontal  lines  through  (xi,pi),  and  (xo,po)  lies  on  the 
hyperbola.  Suppose  the  hyperbola  lies  to  the  north¬ 
west  and  southeast  of  (xi,pi).  Then  H*  is  shown  as 
shaded  in  figure  1. 
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6  Construction  of  the  visibility  graph 

In  this  section,  we  shall  (like  Papadimitrou)  count 
the  number  of  algebraic  operations,  to  give  some  idea 
of  the  global  complexity.  To  avoid  confusion  with  the 
bit-complexity  accounting  in  the  next  section,  we  call 
each  algebraic  operation  a  “step”. 

Again  focus  on  a  pair  of  fixed  edges  61,62.  The 
break  points  (these  are  the  pi’s  and  g.’s  in  section  3) 
on  61  and  62  define  a  grid  on  Q.  For  segments  (t*-  C  Ci 
{i  =  1,2),  there  is  a  corresponding  grid  rectangle  R  = 
Rii^i,  0-2)  C  Q.  We  say  that  R  is  covered  iff 

/i((Ti,a2)\A'*  =0.  (1) 

Clearly  (cti,  <72)  is  an  edge  in  the  visibility  graph  G  iff 
A(n’i,(T2)  is  not  covered. 

Our  goal  is  to  decide  for  every  grid  rectangle  R 
whether  or  not  it  is  covered;  if  it  is  uncovered,  we  will 
further  determine  a  rational  sample  point  p  such  that 
a  ball  of  some  small  radius  6  centered  at  p  is  contained 
in  R  \  K* .  Note  that  this  determination  will  allow  us 
to  compute  a  guaranteed  obstacle-avoiding  path. 

Let 

M  =  OiL/ei) 

be  a  bound  on  the  maximum  number  of  break  points 
on  any  edge.  Then  there  are  at  most  grid  rect¬ 
angles  in  Q.  We  proceed  slightly  differently  than  Pa- 
padimitriou. 

First  we  compute  the  arrangement  of  the  collection 
of  boundary  arcs  corresponding  to  all  the  configura¬ 
tion  obstacles  T* .  This  can  be  computed  in  0{nX4{n)) 
where  A4(n)  =  0(n  ■  and  a(n)  is  the  inverse  of 

an  Ackermann-like  function.  This  follows  from  the 
fact  that  any  two  boundary  arcs  can  intersect  at  most 
twice,  and  from  the  incremental  algorithm  in  [4].  We 
next  convert  this  arrangement  into  an  explicit  repre¬ 
sentation  of  the  boundary  of  the  configuration  obstacle 
A'*,  say,  in  O(n^logn)  steps.  Finally  we  perform  an 
interesting  kind  of  line  sweep  over  this  representation 
of  A  *  in  (5.  Our  goal  is  to  decide  for  each  grid  rectan¬ 
gle  whether  it  is  covered  by  K* .  There  are  three  kinds 
of  events  for  the  sweepline: 

(i)  Vertices  on  the  boundary  of  K* . 

(ii)  The  break  points  po,p±i,p±2, ...  on  64. 

(iii)  Intersections  between  a  horizontal  grid  line  and  a 
boundary  arc  of  A*.  Unlike  the  first  two  types,  this 
is  dynamically  inserted  into  the  priority  queue. 

The  state  of  the  sweepline  is  represented  by  a  bal¬ 
anced  binary  tree  T  that  contains  a  leaf  for  each  of 
the  horizontal  grid  lines  and  for  each  of  the  boundary 
arcs  that  intersect  the  sweepline.  Whenever  a  grid  line 
and  a  boundary  arc  become  adjacent  in  T,  we  check  if 


they  will  generate  an  event  of  type  (iii).  Whenever  we 
come  to  an  event  of  type  (ii),  we  have  just  completely 
swept  a  column  of  at  most  M  grid  rectangles  and  will 
begin  to  sweep  a  new  column  of  at  most  M  grid  rect¬ 
angles.  We  use  0[M)  steps  to  do  the  processing  of 
these  grid  rectangles.  There  are  O(n^)  events  of  type 

(i)  and  0{nM)  events  of  type  (iii).  They  can  each  be 
processed  in  0(Iog(n  -f  M))  steps.  The  events  of  type 

(ii)  take  0{M)  steps  each,  and  0{M^)  altogether. 

7  Bit  Complexity 

We  must  now  determine  the  bit  complexity  of  the 
algorithm  in  the  preceding  section.  In  particular,  this 
calls  for  determining  the  precision  to  carry  out  the 
various  tasks.  We  need  this,  for  example,  to  make  sure 
that  we  can  do  line  sweep  correctly  and  to  be  able  to 
determine  sample  points  in  each  grid  rectangle  R  that 
is  not  covered.  Towards  this  end,  we  shall  employ  a 
variety  of  classical  techniques  in  algebraic  computing 
[9].  The  upshot  of  the  calculations  (sketched  below) 
is  that 

IT  =  0(log(l/ei)  +  L) 

bits  of  precision  suffices  in  the  computation  of  points. 
So  comparisons  between  numbers  in  the  algorithm  (as 
in  priority  queue  operations)  take  0{W)  time. 

For  a  given  pair  of  edges,  the  following  bit  complex¬ 
ity  is  readily  determined:  The  break  points  p±i,q±j 
will  be  approximated  to  0{W)  bits,  and  by  lemma  3, 
these  can  be  computed  altogether  in  0{M p{W)W) 
time.  We  can  compute  the  arrangement  in  time 
0{n\^[n)p{W)).  To  compute  the  boundary  of  K* 
takes  0(n^logn  ■  W).  The  sweepline  operations  take 
0{W)  per  queue  operation  and  0{fi{W))  to  compute 
events  of  type  (iii).  Thus  all  the  operations  of  type 
(i)  cost  0(n^  log()i  -f  M)W);  type  (ii)  cost  O(M^W) 
and  type  (iii)  cost  0(nMlog(n  -k  M)p{W)).  Sum¬ 
marizing,  the  bit  complexity  of  the  algorithm  in  the 
previous  section,  applied  to  all  pairs  of  edges,  is 

0{n^X4{n)p{W)  +  n^M\ogMp{W) 
+{nMyW  -f-  nMp{W)W). 

We  now  go  into  some  details.  For  simplicity,  we 
will  ignore  additive  constants  in  the  size  of  integer 
numbers. 

There  are  3  types  of  event  points  in  the  line-sweep 
algorithm:  the  ^-coordinates  of  intersection  points  be¬ 
tween  boundary  curves,  of  asymptotes  of  hyperbolic 
curves,  and  of  parameters  for  break  points.  In  the  fol¬ 
lowing,  we  talk  about  “separating  a  set  of  points”  - 
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this  means  we  compute  the  x-  and  y-coordinates  of 
these  points  to  sufficient  precision  so  that  the  relative 
a;-order  and  y-order  of  any  two  points  in  the  set  are 
correctly  determined.  This  is  needed  for  the  line-sweep 
algorithm  to  work  correctly. 

The  precision  to  separate  two  break  points  depends 
on  the  minimum  distance  between  two  consecutive 
break  points,  i.e.,  the  shortest  length  of  line  segments. 
We  have: 

Lemma  5  A  precision  o/0(log(l/ei) -f- L)  suffices  to 
separate  two  break  points. 

Now  consider  the  intersection  points  of  two  hy¬ 
perbolic  curves.  From  the  preceding  development,  if 
Axy  +  By  +  Cx  +  D  =  0  is  the  equation  of  a  hyper¬ 
bola  that  partially  bounds  a  half-space  configuration 
obstacle  H* ,  then  A,  B,  C,  D  are  integers  of  size  3L. 

Let  a  be  the  ^-coordinate  of  an  intersection  point. 
The  maximum  bit  size  for  a  is  the  sum  of  (i)  bit 
size  of  the  integral  part  of  the  maximum  a  and  (ii) 
the  precision  needed  to  separate  any  two  intersection 
points  or  precision  needed  to  separate  any  intersec¬ 
tion  point  from  0.  Let  Pi{x,y)  =  A^xy  +  Biy  + 
Cix  -h  Di,P2{x,  y)  =  A2xy  -f  Say  +  C2X  +  D2  be  hy¬ 
perbolic  functions  with  integer  coefficients.  Then  the 
^-coordinates  of  intersection  points  between  Pi{x,y) 
and  P2{x,y)  are  roots  of  the  resultant  reSy(Pi,P2)) 
where 


reSy(Pi(a;,y),P2(a;,y))  = 


det 


Aix  +  Pi 

A2X  +  P2 


Cix  +  Di 
C2X  -k  D2 


(2) 


The  resultant  is  an  integer  polynomial  with  coeffi¬ 
cients  of  length  QL.  By  Cauchy’s  bound  on  roots  of 
polynomials  [9],  6L  bits  suffice  to  represent  the  inte¬ 
gral  part  of  any  root,  and  6L  bits  suffice  to  separate 
the  roots  from  0. 

Next  let  us  calculate  a  lower  bound  on  the  gap 
between  two  intersection  points,  assuming  they  are 
different.  Let  a  be  the  root  of  the  resultant 
reS;^(Pi(2;,  y),  P2(a;,  y))  and  (3  be  the  root  of  the  re¬ 
sultant  xeSy{P3{x,  y),  P4(x,  y))  where  Pi,  P2,  P3,  P4  are 
hyperbolic  functions  that  are  integer  polynomials.  Ap¬ 
plying  the  generalized  Hadamard’s  bound  [9]  to  the 
two  resultants,  we  show  that  24L  bits  are  needed  to 
separate  a  and  /?.  Thus  we  need  6L,  24T  bits  for 
the  integral,  fractional  part  of  the  intersection  points 
between  two  hyperbolic  functions,  respectively.  This 
yields: 


Lemma  6  A  precision  of  SOL  bits  suffices  to  separate 
any  two  intersection  points  of  hyperbolic  curves. 


Next  we  determine  the  precision  to  separate  the  in¬ 
tersection  points  and  the  approximate  break  points  in 
the  parameter  space  Q' .  Let  P{x)  =  Ax"^  -b  Bx  -|-  C 
be  the  polynomial  in  (2).  The  roots  of  P{x)  contain 
the  a;-coordinates  of  intersection  points  between  two 
hyperbolic  curves.  The  coefficients  of  P  are  integers  of 
length  6L.  The  approximate  position  of  break  points  is 
represented  by  a  rational  number  Df  E  where  E  =  2'^ 
and  D  is  an  integer  such  that  |£)|  <  E.  Therefore 
it  is  the  root  of  (5(2;)  =  Ex  -  D.  The  generalized 
Hadamard’s  bound  again  shows  that  2W  -b  12L  bits 
suffice  to  separate  the  roots  of  P{x)  and  Q{x').  Sum¬ 
ming  up  the  precisions  that  suffice  for  integral  and 
fractional  parts  of  points,  we  conclude: 

Lemma  7  A  precision  of2W  +  ISL  =  0(log(l/ei)  -b 
L)  suffices  to  separate  intersection  points  of  hyperbolic 
curves  from  break  points. 

In  the  same  way,  we  can  show  that  other  types  of 
pairs  of  events  are  separated  with  this  precision. 

Theorem  8  It  suffices  to  compute  the  event  points  to 
precision  0(log(l/ei)  -b  L). 


8  Coda 

Now  that  we  have  a  visibility  graph  G  =  {N,A), 
we  wish  to  compute  the  nominal  cost  of  each  visibility 
edge  in  A.  Unlike  the  original  algorithm,  we  do  not 
pick  the  distance  between  midpoints  of  segments  for 
computing  nominal  costs.  Instead,  we  pick  any  ratio¬ 
nal  point  in  each  segment  instead  of  the  midpoint.  If 
we  run  Djikstra’s  algorithm  on  G,  assuming  that  we 
know  the  exact  nominal  cost  of  each  visibility  edge, 
then  the  original  calculations  shows  that  the  distance 
of  the  “approximate  shortest  path”  tt  is  at  most  1  -b  e 
times  the  length  of  the  actual  shortest  path  tt*  . 

But,  of  course,  our  calculation  only  uses  approx¬ 
imations  to  the  nominal  cost.  The  approximations 
arise  because  we  must  take  square  roots.  Because  we 
choose  rational  points,  it  turns  out  that  we  only  need 
to  compute  square-roots  of  rational  numbers  x  where 
2-4L-6  <  X  <  to  some  precision  s.  It  turns  out 

that 

s  =  log(4n/e) 

suffices,  and  it  takes  0{pL{W))  time  to  compute  the 
square- root  to  this  precision. 

After  this,  we  may  invoke  Dijkstra’s  algorithm  and 
compute  the  approximate  shortest  path  tt.  Note  that 
all  intermediate  numbers  in  Dijkstra’s  are  sums  of  at 
most  n  approximate  distances,  and  hence  they  have 
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sizes  at  most  0(ln(n/e)  +  L).  Each  operation  in  Di- 
jkstra’s  algorithm  involving  these  numbers  takes  time 
0(ln(n/e)  +  L)  since  they  are  either  addition  of  two 
numbers  or  comparisons.  Thus  the  bit  complexity  of 
Dijkstra’s  algorithm  is 

0{N‘^H{W)) 

where  N  :=  nM  is  an  upper  bound  on  the  number  of 
segments  in  all  the  edges. 

One  final  point:  suppose  Dijkstra’s  algorithm  yields 
a  sequence  of  segments  (sq,  0-2, . . . ,  For 

each  consecutive  pair  (crj-,  Ci+i)  of  segments,  we  can 
compute  a  sample  point  (xi,yi)  G  cr,-  x  <7^+1  such  that 
the  segment  [xi ,  ?/,■]  is  a  visibility  segment  (section 
2(III)).  In  fact,  in  the  parameter  space,  we  can  en¬ 
sure  that  there  is  a  ball  Bi  of  positive  radius  centered 
around  (x,,  t/;)  such  that  every  {x',y')  G  Bi  represents 
a  visibility  segment.  Using  the  sample  points,  we  fi¬ 
nally  obtain  an  explicit  zig-zag  path  (section  2(III)’) 
that  is  obstacle  avoiding.  In  conclusion: 

Theorem  9  There  is  an  approximate  algorithm  for 
ESP  in  3-space  that  computes  an  explicit  obstacle¬ 
avoiding  path  with  relative  error  e  >  0  with  bit- 
complexity 

Oi(n^X4in)  +  n^M  logM  +  {nMf)  ■  pl{W)) 
where  W  =  0(ln(n/e)  -|-  L)  and  M  =  0{nL/€). 

9  Conclusion 

1.  Papadimitriou’s  paper  on  approximate  Eu¬ 
clidean  shortest  paths  has  become  recognized  as  a  key 
result  in  this  area.  In  this  paper,  we  rectified  several 
gaps  in  his  paper,  and  presented  for  the  first  time, 
a  true  bit-complexity  bound  for  an  e-approximation 
algorithm  for  ESP. 

2.  It  should  be  clear  from  our  development  that 
there  are  many  tedious  and  non-obvious  details  when 
we  do  such  approximations.  The  result  is  an  inter¬ 
esting  mix  of  combinatorial,  numerical  and  algebraic 
complexity.  Many  of  these  tools  are,  in  fact,  those  that 
are  necessary  in  order  to  make  “exact  computation”  a 
practical  reality  [8]. 

3.  In  our  solution,  we  tried  to  extract  some  gen¬ 
eral  lemmas  about  approximate  computing  with  float¬ 
ing  point  numbers,  to  facilitate  future  applications  of 
such  an  approach.  Note  that  one  could  avoid  Brent’s 
result  by  approximating  Taylor’s  expansions  directly, 
but  this  is  expected  to  yield  an  inferior  bit  complexity. 


4.  We  may  generally  call  this  the  method  of  gen¬ 
eralized  grids.  Of  course,  grids  are  a  familiar  practi¬ 
cal  technique  in  all  of  computational  sciences.  From 
a  complexity  theoretic  viewpoint,  such  methods  have 
been  shunned  in  the  past  as  trivial  or  uninteresting. 
This  need  not  be  so,  as  Papadimitriou’s  work  has 
demonstrated.  In  fact,  grid  methods  may  be  the  most 
practical  recourse  for  solving  some  intractable  prob¬ 
lems.  It  would  be  interesting  to  derive  some  general 
theorems  about  these  approaches. 
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The  Realization  Problem  for  Euclidean  Minimum  Spanning  Trees  is 

NP-hard 
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Abstract 

We  show  that  deciding  whether  a  tree  can  be  drawn  in 
the  plane  so  that  it  is  the  Euclidean  minimum  spanning 
tree  of  the  locations  of  its  vertices  is  NP-hard. 

Keywords:  tree,  algorithm,  graph  drawing,  graph  layout, 
NP-hard,  spanning  tree. 


1  Introduction 

This  paper  investigates  a  fundamental  geometric  real¬ 
ization  problem,  the  realization  problem  for  Euclidean 
minimum  spanning  trees:  Given  a  tree  T  =  (P,  E),  find 
a  location  D{u)  for  each  vertex  u  €  U  so  that  T  is  a 
minimum  spanning  tree  of  {D{u)  :  u  €  P},  or  deter¬ 
mine  that  no  such  locations  exist. 

This  problem  belongs  to  the  growing  body  of  knowledge 
and  research  on  the  geometric  realizability  of  combina¬ 
torial  objects  and  the  related  problem  of  finding  com¬ 
binatorial  characterizations  of  geometric  objects.  The 
Delaunay  triangulation  realizability  problem  [4],  [3]  pro¬ 
vides  another  such  example.  It  asks  whether,  for  a  given 
triangulation  specified  combinatorially  in  terms  of  ver¬ 
tices,  edges,  faces  and  incidence  relations,  there  is  a  way 
to  place  the  vertices  in  the  plane  so  that  their  Delaunay 
triangulation  is  combinatorially  equivalent  to  the  given 
triangulation. 
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The  study  of  graph  drawing  and  layout  also  prompts  in¬ 
terest  in  Euclidean  minimum  spanning  tree  realizability. 
Several  graph  drawing  algorithms  try  to  make  a  layout 
which  is  as  “small”  as  possible  in  some  sense.  Some  aim 
for  minimum  area,  some  for  minimum  width,  some  for 
minimum  total  edge  length  [12,  10].  We  have  observed 
that  in  a  number  of  experimental  results  (see  [5]),  algo¬ 
rithms  for  drawing  trees  often  (but  not  always)  produce 
tree  layouts  that  are  minimum  spanning  trees,  or  nearly 
minimum  spanning  trees,  of  the  vertex  locations. 

In  a  paper  that  studies  graph  theoretic  changes  in  min¬ 
imum  spanning  trees  of  moving  point  sets,  Monma  and 
Suri  [9]  note  that  every  tree  with  maximum  degree  at 
most  five  can  be  drawn  as  a  minimum  spanning  tree, 
and  that  no  tree  with  a  vertex  of  degree  larger  than  6 
can  be  drawn  as  a  minimum  spanning  tree. 

Bose,  Lenhart  and  Liotta  [1]  show  that  for  several 
classes  of  proximity  graphs  (for  example,  relative  neigh¬ 
borhood  graphs  and  Gabriel  graphs),  the  problem  of 
determining  whether  a  tree  can  be  realized  as  a  proxim¬ 
ity  graph  can  be  solved  in  polynomial  time. 

This  paper  presents  the  first  negative  result  in  this  area: 
the  realization  problem  for  Euclidean  minimum  span¬ 
ning  trees  is  NP-hard. 


2  Preliminaries 

Given  a  set  S  of  points  in  the  plane,  a  spanning  tree  of 
5  is  a  tree  whose  vertex  set  is  S.  Edges  of  the  tree  can 
be  regarded  either  as  pairs  of  points  in  5  or  as  open 
line  segments  between  points  in  S.  A  spanning  tree 
T  =  {S,  E)  of  5  is  a  Euclidean  minimum  spanning  tree 
of  5  if 

dist{u,v) 

uv^E 

is  minimized  over  all  spanning  trees  of  S. 
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A  drawing  D  of  a  tree  T  =  (  V,  E)  is  a  pair  of  1-1  func¬ 
tions 

Dv  '■  V  — >•  B? ,  De  ■  E  — <•  L, 

where  L  is  the  set  of  open  line  segments  in  R~,  such 
that  the  endpoints  of  De{u,  v)  are  D[u)  and  D{v).  The 
images  of  the  vertices  and  edges  of  T  under  D  form  a 
tree  which  is  isomorphic  to  T. 

When  discussing  a  tree  T  —  {V,E)  and  some  drawing  D 
of  T,  we  often  use  the  same  notation  for  a  vertex  and  its 
position  as  given  by  Dy ,  and  the  same  notation  for  an 
edge  and  the  corresponding  line  segment.  For  instance, 
if  u,v,a,b  €  V  and  uv,ab  6  E  then  expressions  such 
as  “the  length  of  the  edge  uv"  and  “edge  ab  intersects 
edge  uv”  refer  to  the  drawing  D  of  T.  Also,  a  “vertex” 
or  “edge”  of  D  refers  to  the  image  under  Dy  or  De  of 
a  vertex  or  an  edge  of  T. 

Definition  1  Suppose  thai  D  is  a  drawing  of  T  — 
(V,E),  and  let  S  —  {Dy[v)  :  v  G  V"}.  If  T  is  iso¬ 
morphic  to  a  minimum  spanning  tree  of  S,  then  we  say 
that  D  realizes  T. 

Thus  “T  is  realizable”  means  that  there  is  a  1-1  function 
Dy  that  maps  each  tree  vertex  v  to  a  point  Dy(v)  in 
the  plane,  such  that  T  is  isomorphic  to  at  least  one 
of  the  Euclidean  minimum  spanning  trees  of  the  point 
set  Dy(V).  Hence  our  main  result  can  be  described  as 
follows. 

Minimum  Spanning  Tree  Realizability  (MSTR) 
Instance:  A  tree  T. 

Question:  Is  T  realizable? 

Theorem  1  MSTR  is  NP-hard. 

Specifically,  we  show  that  the  problem  of  determining 
whether  trees  with  maximum  degree  6  are  realizable  is 
NP-hard. 

We  say  a  tree  T  has  a  unique  realization  D,  or  that 
D  uniquely  realizes  T,  if  D  realizes  T  and  for  every 
other  realization  D'  of  T,  there  is  a  congruence  mapping 
Dy{V)  to  Dyiy)  and  De{E)  to  D'^{E)  after  a  suitable 
change  of  scale. 

We  now  give  a  brief  overview  of  the  proof  of  the  main 
result.  We  prove  Theorem  1  by  a  reduction  from  Not- 
All-Equal-3SAT  (see  Section  5).  This  is  done  by  de¬ 
signing  a  family  of  trees  we  call  {m,  n)-logipedes.  These 
trees  are  defined  purely  combinatorially  in  terms  of  in¬ 
cidences  of  vertices  and  edges.  We  show  that  logipedes 
are  uniquely  realizable.  Figure  1  shows  the  unique  real¬ 
ization  of  a  (3, 4)-logipede. 


Figure  1:  A  (3,4)-logipede 


The  realization  shown  in  the  Figure  places  the  tree  on  a 
triangular  grid.  We  prove  that  this  phenomenon  holds 
in  general.  Any  realization  of  a  logipede  as  a  Euclidean 
minimum  spanning  tree  draws  the  logipede  on  a  trian¬ 
gular  grid.  Here  we  emphasize  strongly  that  the  combi¬ 
natorial  structure  of  the  tree  forces  the  grid  to  appear 
implicitly  in  the  realization.  No  initial  requirement  is 
made  that  the  tree  must  be  embedded  on  a  grid,  or 
that  its  edge  lengths  must  be  uniform. 

To  transform  an  instance  of  Not-All-Equal-3SAT  with 
m  clauses  and  n  variables  to  an  instance  of  MSTR, 
we  construct  an  {m,  n)-logipede  and  use  it  to  provide 
a  framework  for  a  pair  of  (clause,  boolean  variable)  and 
(clause,  complement  variable)  incidence  matrices.  We 
regard  the  realization  of  an  (m,  n)-logipede  as  consist¬ 
ing  of  a  “spine”  whose  length  is  proportional  to  n,  to 
which  pairs  of  “legs”  are  attached  whose  lengths  depend 
on  m  and  the  place  of  attachment  along  the  spine.  Each 
pair  of  legs  represents  a  boolean  variable  and  its  com¬ 
plement,  and  the  legs  in  the  pair  act  like  columns  in  the 
two  incidence  matrices.  For  technical  reasons,  the  legs 
grow  in  length,  and  so  the  rows  of  the  incidence  ma¬ 
trices  cannot  be  visualized  as  parallel  to  the  spine,  but 
rather  as  forming  an  angle  with  it.  Details  are  given  in 
Section  5. 

The  occurence  or  not  of  a  literal  in  a  clause  is  indicated 
by  modifying  the  leg  corresponding  to  the  literal  at  the 
position  in  the  leg  that  corresponds  to  the  clause. 

To  define  logipedes  and  to  prove  that  they  are  uniquely 
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realizable,  we  construct  a  sequence  of  uniquely  realiz¬ 
able  “gadgets” .  The  next  section  describes  the  gadgets. 
Section  4  indicates  how  to  prove  they  are  uniquely  real¬ 
izable.  Section  5  proves  Theorem  1. 


3  The  Gadgets 

Since  our  proof  that  a  particular  gadget  is  uniquely  re¬ 
alizable  generally  depends  on  knowing  that  its  prede¬ 
cessors  are  uniquely  realizable,  the  sequence  of  gadgets 
given  here  can  be  regarded  as  a  route  through  the  proof 
of  logipede  unique  realizability. 

It  cannot  be  over-emphasized  that  the  gadgets  are 
purely  combinatorial  objects,  sometimes  called  free  or 
abstract  trees.  To  save  space,  however,  we  convey  the 
definition  of  most  gadgets  by  giving  a  realization  rather 
than  by  writing  out  a  combinatorial  definition. 

Definition  2  A  tree  consisting  of  a  degree  6  vertex  to¬ 
gether  with  its  six  neighbors  is  a  star.  The  degree  6  ver¬ 
tex  is  called  the  center  of  the  star,  and  the  other  vertices 
are  called  the  points  of  the  star. 

A  star  is  illustrated  in  two  ways  in  Figure  2;  combina- 
torially,  and  realized  as  a  minimum  spanning  tree. 


Figure  2:  A  star  and  its  realization 

Chains  of  stars  have  limited  but  multiple  possibilities 
for  realizations  (Figure  3).  Chains  of  stars  are  subtrees 
of  many  of  our  gadgets,  but  they  must  be  modified  to 
achieve  unique  realizability. 

Definition  3  The  tree  in  Figure  4  is  a  base.  The  star 
with  center  s  in  the  Figure  is  called  the  center  star  of 
the  base,  and  vertex  s  is  called  the  center  of  the  base. 
The  two  remaining  stars  are  calledleg  stars  of  the  base. 


Figure  3:  Two  realizations  of  a  star  chain 


Figure  4:  A  base  and  its  realization 

Definition  4  The  tree  in  Figure  5  is  a  spine  base.  The 
new  star  is  called  the  1st  or  leading  spine  star,  denoted 
Si. 


Definition  5  The  tree  in  Figure  6  is  a  short  spine  in 
realized  form.  The  two  new  stars  adjoined  to  the  1st 
leg  stars  are  called  2nd  leg  stars.  The  third  new  star  is 
called  the  cap  star. 


Definition  6  Figure  7  illustrates  a  spine  of  length  k 
for  k  —  7.  (A  spine  of  length  1  is  just  a  short  spine.) 

Definition  7  Figure  8  gives  a  realization  of  an  {m,  k)- 
legged  spine  for  m  =  6  and  k  =  7.  IVe  say  the  spine  has 


Figure  5:  A  spine  base  and  its  realization 


length  k  and  the  legs  have  length  m  because  the  number 
of  spine  stars  is  k,  and  the  number  of  leg  stars  in  each 
leg  is  m.  (A  {2,k)-legged  spine  is  just  a  spine  of  length 
k.) 


Definition  8  Figure  9  gives  a  realization  of  an 
(^77ii,  m2,  ij-skeleton  for  mj  —  6,  m2  =  11  and  k  = 
8.  It  is  created  by  joining  an  (m2,l)-legged  spine  to 
an  (mi,k  —  \)-legged  spine  as  shown.  We  say  it  has 
k  spine  stars,  or  length  k  for  short,  and  front  legs  of 
length  mj  and  rear  legs  of  length  m2. 


Figure  6:  A  realization  of  a  short  spine 


Figure  8:  A  (6,7)-legged  spine 

NOTATION.  Note  that,  given  an  (mi, m2, i)-skeleton, 
the  (mi,  I:  —  l)-legged  spine  and  the  (m2,  l)-legged  spine 
used  to  create  it  are  uniquely  identifiable.  The  spine 
stars  in  the  {mi,k—  l)-legged  spine  retain  their  names 
in  the  skeleton,  and  are  called  the  1st,  2nd,  ...,  k  —  1st 
spine  stars  (denoted  Si,  S2,  ■■■,  Sk-i)  of  the  skeleton. 
The  cap  star  of  the  (m2,  l)-legged  spine  becomes  the 
kth  spine  star  Sk  of  the  skeleton. 

Definition  9  Figure  1  of  the  previous  section  illus¬ 
trates  an  (m,n)-logipede  for  m  —  3  and  n  —  f.  An 
{m,n)-logipede  is  created  from  a  (2m,{2m-\-n-\-l),2n)- 
skeleton  as  shown  in  the  figure.  Here  the  m  and  n  re¬ 
fer  to  the  number  m  of  clauses  and  n  of  variables  in 
a  Not- all- equal- 3 SAT  instance  that  the  logipede  will  en¬ 
code,  and  the  numbers  of  stars  in  the  legs  and  the  spine 
are  functions  of  these  parameters. 

Definition  10  A  pre-(m,n)-logipede  is  the  tree  ob¬ 
tained  by  following  the  construction  for  an  (m,n)- 
logipede,  but  adjoining  star  chains  of  length  1  instead 
of  star  chains  whose  lengths  depend  on  m  and  the  index 
of  the  spine  star. 
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3.  In  D,  both  the  path  between  b  and  c  and  the  path 
between  a  and  b  contain  an  edge  of  length  1. 

Then  the  edge  sb  has  length  I  and  forms  an  angle  of  120° 
with  each  of  the  segments  ba  and  be. 


Figure  9:  A  (6,ll,8)-skeleton 

4  Uniqueness  Results 

^  ^  1  •  1  Figure  10:  Diagram  for  120-degree  Lemma 

This  section  outlines  the  proof  that  (m,  n)-logipedes  are 

uniquely  realizable.  Most  proofs  of  lemmas  are  straight¬ 
forward  and  have  been  omitted.  Some  are  left  for  flavor. 

Lemma  1  Suppose  that  S  is  a  finite  set  of  points  in  the 
plane,  T  is  a  minimum  spanning  tree  of  S,  and  S'  C  S. 

If  the  subgraph  T'  ofT  induced  by  S'  is  connected,  then 
T'  is  a  minimum  spanning  tree  of  S' . 

The  above  Lemma  is  used  as  follows.  Suppose  that  O' 
is  a  unique  realization  of  a  tree  T' ,  and  that  a  tree  T 
contains  T'.  Then  in  every  realization  D  of  T,  T'  must 
be  drawn  congruent  (after  scaling)  to  D'. 

Figure  11:  Realization  of  a  double  star 

Lemma  2  If  segments  ab  and  be  are  edges  in  a  Eu¬ 
clidean  minimum  spanning  tree,  then  labc  ts  the  largest  Corollary  1  Suppose  that  T  is  a  star  chain  of  length 

angle  in  Aabc.  2.  Then  T  is  uniquely  realizable  as  in  Figure  11. 

Lemma  3  In  every  realization  of  a  star,  all  edges  have 
the  same  length,  and  consecutive  edges  in  the  cyclic  or¬ 
dering  of  edges  around  the  center  form  60°  angles. 

Lemma  4  Let  D  be  a  realization  of  tree  T.  Suppose 
that 

1.  D  contains  three  vertices  a,  b,  c  such  that  segments 

ab  and  be  are  of  length  I  and  form  an  angle  of  120°.  Figure  12:  Diagram  for  5-hex  Lemma 

(These  segments  are  not  necessarily  edges  of  D). 

2.  Vertex  b  is  a  point  of  a  star  with  center  s,  where  Lemma  5  Suppose  that  in  some  realization  D  of  tree 

the  star  is  disjoint  from  a  and  c.  T,  a  vertex  c  of  T  is  located  at  the  center  of  a  regular 
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hexagon  of  side  length  I,  at  least  five  of  whose  vertices 
are  occupied  by  vertices  ofT,  which  need  not  be  adjacent 
to  c  or  to  each  other  in  T.  Suppose  further  that  for  each 
vertex  V  ofT  located  at  a  vertex  of  the  hexagon,  there  is 
a  path  in  D  connecting  v  to  c  that  contains  at  least  one 
edge  of  length  I  (Figure  12).  Then  each  neighbor  w  of  c 
must  be  located  at  one  of  the  6  vertices  of  the  hexagon. 


Figure  13:  Diagram  for  3-claw  Lemma 

Lemma  6  Let  T  be  a  tree  formed  be  adjoining  three 
new  leaves  to  one  of  the  points  b  of  a  star.  Then  in  any 
realization  D  ofT,  b  is  located  at  the  center  of  a  regular 
hexagon  whose  vertices  are  occupied  by  the  center  of  the 
star,  two  points  of  the  star,  and  the  three  new  leaves 
adjoined  to  point  b.  (See  Figure  13.) 

A  vertex  of  a  realization  D  of  a  tree  T  is  called  a  hexagon 
centerii  it  is  the  center  of  a  regular  hexagon  of  side  equal 
to  the  minimum  edge  length  in  D,  such  that  at  least  5 
of  the  vertices  of  the  hexagon  are  occupied  by  vertices 
of  D. 

Lemma  7  Suppose  that 

1.  D  is  a  realization  of  a  tree  T  in  which  a  vertex  c 
IS  located  at  the  center  of  a  regular  hexagon  with  side 
length  1;  denote  the  vertices  of  the  hexagon  by  hi,...hQ 
in  cyclic  order. 

2.  Vertices  hi,  /12  and  hs  of  the  hexagon  are  occupied  by 
vertices  Vi,  V2  and  V3  of  D,  where  for  each  Uj,  1  <  i  <  3, 
there  is  a  path  in  D  connecting  u,-  to  c  and  containing 
an  edge  of  length  1. 

3.  Vertex  c  is  adjacent  in  T  to  a  vertex  s  of  degree  6, 
s  ^Vi,V2  or  V3. 

Then: 

-  Vertex  s  is  located  at  vertex  /15  of  the  hexagon,  and  the 
edges  between  s  and  its  neighbors  have  length  I,  and 

-  The  two  neighbors  of  s  that  precede  and  follow  c  in  the 
cyclic  ordering  of  the  neighbors  of  s  occupy  vertices 
and  hs  of  the  hexagon. 


Figure  14:  Diagram  for  3-hex  Lemma. 

Proof:  In  D,  let  e  and  /  denote  the  neighbors  of  s  that 
precede  and  follow  c  in  the  cyclic  ordering  of  vertices 
adjacent  to  s,  as  in  Figure  14.  The  fact  that  D  is  a 
minimum  spanning  tree  places  constraints  on  the  loca¬ 
tions  of  e  and  /.  By  Lemma  3,  the  segments  ce  and  cf 
must  form  an  angle  of  120°. 

The  only  solution  to  these  constraints  is  to  place  e  and 
/  at  vertices  /14  and  hg  of  the  hexagon,  which  by  Lemma 
3  determines  the  placement  of  s  and  the  set  of  locations 
occupied  by  its  neighbors.  □ 

With  the  machinery  of  the  preceding  technical  lemmas, 
it  is  possible  to  prove  in  order  the  unique  realizability  of 
the  following  sequence  of  gadgets  (we  omit  the  proofs; 
see  the  figures  of  the  previous  sections  for  the  realiza¬ 
tions):  spine  bases,  short  spines,  (m,  i)-legged  spines, 
(mi,m2,  F)-skeletons,  pre-(m,  n)-logipedes,  and  (m,  n)- 
logipedes. 


5  The  Transformation 

This  section  uses  the  uniqueness  of  the  logipede  real¬ 
ization  to  transform  the  NP-complete  problem  Not-All- 
Equal-3SAT  [6]  to  MSTR. 

Not-All-Equal-3-Sat  (NAESS) 

Instance:  A  set  C  of  m  clauses  Ci,  C2, ...,  Cm,  each  con¬ 
taining  3  literals  from  a  set  A  of  n  boolean  variables 
Xi,X2,...,Xn  and  their  complements. 

Question:  Can  consistent  truth  values  be  assigned  to 
the  literals  so  that  each  clause  contains  at  least  one  true 
literal  and  at  least  one  false  literal? 

Without  loss  of  generality,  we  assume  that  no  clause  in 
C  contains  both  a  variable  Xj  and  its  complement  Aj . 

Our  transformation  follows  the  paradigm  used  by  Bhatt 
and  Cosmodakis  [11],  Idicula  [7],  Gregori  [8]  and  Bran- 
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denburg  [2].  We  modify  an  (m,  n)-logipede  so  that  the 
modified  logipede  can  be  realized  (although  not  neces¬ 
sarily  uniquely)  if  and  only  if  the  NAE3S  instance  being 
transformed  is  a  yes  instance. 

Consider  two  mxn  incidence  matrices  defined  as  follows. 
Row  i  of  each  matrix  represents  clause  c,-;  in  one  ma¬ 
trix,  column  j  represents  variable  Xj,  and  in  the  other 
matrix,  column  j  represents  the  complement  Xj  of  vari¬ 
able  Xj.  In  the  (clause,  variable)  matrix,  entry  (i,j)  is 
1  if  clause  i  contains  variable  Xj ;  otherwise,  the  entry  is 
0.  The  entries  of  the  (clause,  complementary-variable) 
matrix  are  defined  similarly. 

An  (m,  n)-logipede  represents  these  two  matrices  as  fol¬ 
lows.  Let  Ij  and  Ij  denote  the  leg  chains  associated 
with  the  jth  odd-indexed  spine  star  5'2j-i,  1  <  J  <  «■ 
These  legs  represent  variable  Xj  and  its  complement 
Xj,  respectively.  For  geometric  reasons,  it  is  conve¬ 
nient  to  make  the  first  j  —  I  stars  closest  to  the  spine 
in  legs  Ij  and  1)  serve  as  padding  that  does  not  repre¬ 
sent  any  positions  of  the  incidence  matrices.  Beyond  the 
padding,  every  other  star  in  a  leg  chain  represents  an 
incidence  matrix  position.  In  particular,  position  {i,j) 
in  the  (clause,  variable)  matrix  is  represented  by  a  star 
in  leg  Ij ,  namely  by  star  (j  —  1)  -f  (2i  -  1),  where  the  leg 
star  closest  to  the  spine  is  regarded  as  the  first  star  in 
the  leg. 

Lemma  8  There  is  a  polynomial  time  transformation 
from  NAESS  to  MSTR. 

Proof:  Given  an  NAE3S  instance  with  m  clauses  and  n 
variables,  construct  an  (m,  n)-logipede  T,  and  then  add 
to  it  as  follows. 

1.  If  variable  Xj  appears  in  clause  Ci  (so  by  assump¬ 
tion  X'j  does  not  appear  in  c,),  select  two  degree  1 
pointsofstar  (j  —  l)-f  (2z— 1)  in  leg  Ij  for  modifica¬ 
tion.  Adjoin  two  new  leaves  to  one  of  the  selected 
points,  and  one  new  leaf  to  the  other  selected  point. 
Do  not  modify  leg  /j. 

2.  Similarly,  if  Xj,  but  not  Xj,  appears  in  c,-,  then 
adjoin  new  leaves  as  above  to  two  points  of  star 
(j  —  1)  -b  (2i  -  1)  in  leg  /j  but  do  nothing  to  leg  Ij. 

3.  If  neither  Xj  nor  Xj  appears  in  c,-,  then  adjoin  new 
leaves  as  above  both  in  Ij  and  in  /j . 

Clearly  this  transformation  can  be  done  in  polynomial 
time  in  the  length  of  the  NAESS  instance.  The  set  of 
three  new  leaves  adjoined  to  a  star  that  is  modified  is 
called  a  striker.  Figure  15  gives  one  possible  realization 


of  the  logipede  with  strikers  corresponding  to  the  yes 
instance  Ci  =  X[  OR  X'^  OR  Xg,  C2  =  X{  OR  Xj  OR 
X4,  and  C3  =  X^  OR  X^  OR  X4. 

Suppose  that  the  NAESS  instance  is  a  yes  instance.  The 
underlying  unlabelled  {in,  n)-logipede  has  a  unique  re¬ 
alization.  However,  the  labelled  leg  Ij  could  be  drawn 
on  either  side  of  the  spine,  with  leg  /j  drawn  on  the 
opposite  side.  Consider  a  consistent  truth  assignment 
to  variables  such  that  each  clause  contains  at  least  one 
false  literal  and  at  least  one  true  literal.  A  realization  of 
the  modified  logipede,  with  strikers  added,  can  be  ob¬ 
tained  from  such  a  truth  assignment  by  placing  all  legs 
whose  corresponding  literals  are  true  on  the  same  side  of 
the  spine,  and  placing  all  remaining  legs  on  the  opposite 
side  of  the  spine.  The  details  are  straightforward. 

To  show  that  only  yes  instances  of  NAE3S  are  trans¬ 
formed  to  yes  instances  of  MSTR,  arbitrarily  choose  a 
true  side  and  a  false  side  of  the  spine  in  the  minimum 
spanning  tree  realization.  Set  the  literals  correspond¬ 
ing  to  legs  appearing  on  the  true  side  to  true  and  the 
remaining  literals  to  false.  In  the  example  shown  in 
Figure  15,  the  assignment  of  truth  values  read  off  from 
the  realization  would  be  Xi  —  X3  =  X4  =  true  and 
X2  =  false,  or  the  complements  of  these  values.  It  is 
not  hard  to  show  that  this  strategy  works  in  general. 

□ 


true  side 


ii  H  13  I4 


Figure  15:  One  realization  of  a  transformed  yes  instance 
Now  Theorem  1  follows  immediately. 
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In  conclusion,  we  note  that  the  proof  of  Theorem  1  con¬ 
structs  a  tree  whose  only  realization  is  on  a  triangular 
grid.  Furthermore,  any  tree  drawn  on  a  triangular  grid 
such  that  tree  edges  all  have  unit  length  is  a  minimum 
spanning  tree.  Thus  our  transformation  can  be  used  to 
show  that  the  problem,  considered  in  [7],  of  drawing  a 
tree  on  a  triangular  grid  so  that  every  edge  has  unit 
length  is  NP-complete. 
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Abstract 

In  this  paper,  we  present  an  optimal  parallel  random¬ 
ized  algorithm  for  the  Voronoi  diagram  of  a  set  of  n 
non-intersecting  (except  possibly  at  endpoints)  line  seg¬ 
ments  in  the  plane.  Our  algorithm  runs  in  O(logn)  time 
with  very  high  probability  and  uses  0{n)  processors  on  a 
CROW  PRAM.  This  algorithm  is  optimal  in  terms  of  P.T 
bounds  since  the  sequential  time  bound  for  this  problem 
is  Q(n  log  n).  Our  algorithm  improves  by  an  (9(log  n)  fac¬ 
tor  the  previously  best  known  deterministic  parallel  algo¬ 
rithm  which  runs  in  O(log^n)  time  using  0(n)  processors 
[13].  We  obtain  this  result  by  using  random  sampling 
at  “two  stages”  of  our  algorithm  and  using  efficient  ran¬ 
domized  search  techniques.  This  technique  gives  a  direct 
optimal  algorithm  for  the  Voronoi  diagram  of  points  as 
well  (all  other  optimal  parallel  algorithms  for  this  prob¬ 
lem  use  reduction  from  the  3-d  convex  hull  construction). 

1  Introduction 

Voronoi  diagrams  are  elegant  and  versatile  geometric 
structures  which  have  applications  for  a  wide  range  of 
problems  in  computational  geometry  and  other  areas.  For 
example,  computing  the  minimum  weight  spanning  tree, 
or  the  all-nearest  neighbor  problem  for  a  set  of  line  seg¬ 
ments  can  be  solved  immediately  and  efficiently  from  the 
Voronoi  diagram  of  line  segments.  As  we  learnt  from  [17], 
these  diagrams  can  also  be  used  to  compute  a  country’s 
territorial  waters!  Certain  two-dimensional  motion  plan¬ 
ning  problems  can  be  solved  quickly  when  the  Voronoi 
diagram  is  available  [19].  Thus,  exploiting  parallelism  to 
obtain  faster  solutions  is  a  desirable  goal.  In  this  paper, 
we  are  interested  in  developing  an  optimal  parallel  ran¬ 
domized  algorithm  on  a  PRAM  (Parallel  Random  Access 
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Machine)  for  the  construction  of  the  Voronoi  diagram  of  a 
set  of  non-intersecting  (except  possibly  at  endpoints)  line 
segments  in  the  plane.  The  first  sequential  algorithm  for 
this  problem  was  given  by  Lee  and  Drysdale  [17],  which 
runs  in  O(nlog^n)  time.  This  run-time  was  later  improved 
to  O(nlogn)  in  numerous  papers  (Kirkpatrick  [15],  Yap 
[23]  and  Fortune  [10]),  which  is  optimal  since  sorting  can 
be  reduced  to  this  problem.  The  best  known  parallel  de¬ 
terministic  algorithm  was  given  by  Goodrich,  O’Diinlaing 
and  Yap  [13]  and  runs  in  O(log^n)  time  using  0(n)  pro¬ 
cessors.  It  seems  unlikely  that  existing  deterministic  tech¬ 
niques  can  be  used  to  improve  this  run-time. 

Recent  years  have  seen  significant  advances  in  paral¬ 
lel  algorithm  design  for  computational  geometry  prob¬ 
lems.  Some  of  the  earliest  work  in  this  area  was  done 
by  Chow  [5]  and  Aggarwal  ei.  al.  [1],  who  gave  paral¬ 
lel  algorithms  for  various  fundamental  problems  such  as 
two-dimensional  convex  hulls,  trapezoidal  decomposition 
etc.  A  number  of  these  algorithms  have  since  been  im¬ 
proved.  Atallah,  Cole  and  Goodrich  [3]  demonstrated  op¬ 
timal  deterministic  algorithms  for  many  of  these  problems 
by  applying  the  versatile  technique  of  cascading  divide- 
and-conquer  and  building  on  data  structures  developed  in 

[1] .  Reif  and  Sen  [22]  also  obtained  optimal  randomized 
parallel  algorithms  for  a  number  of  these  problems;  these 
algorithms  use  n  processors  and  run  in  (9(log  n)  time  with 
very  high  probability. 

The  important  problems  of  constructing  Voronoi  dia¬ 
grams  of  points  in  two  dimensions  and  convex  hulls  of 
points  in  three  dimensions,  however,  eluded  optimal  par¬ 
allel  solutions  for  a  long  time.  Both  these  problems  have 
sequential  run-times  of  O(nlogn).  Aggarwal  et.  al.  [1] 
gave  O(log^n)  and  O(log^n)  time  algorithms  (using  n  pro¬ 
cessors)  for  the  Voronoi  diagram  and  convex  hull  prob¬ 
lems,  respectively,  and  the  technique  of  cascaded-merging 
could  not  be  extended  to  these  problems  to  improve  their 
run-times  [8].  Very  recently,  Goodrich  [12]  has  given  an 
algorithm  for  3-d  convex  hulls  that  does  optimal  work, 
but  has  O(log^n)  run-time,  and  Amato  and  Preparata 

[2]  have  described  an  algorithm  that  runs  in  O(logn) 
time  but  uses  processors.  Randomization,  however, 
proves  to  be  very  useful  in  obtaining  optimal  run-time  as 


57 


well  as  optimal  P.T  bounds^.  In  [21],  Reif  and  Sen  gave 
an  optimal  randomized  algorithm  for  the  construction  of 
the  convex  hull  of  points  in  three  dimensions.  Since  the 
problem  of  finding  the  Voronoi  diagram  of  points  in  two 
dimensions  can  be  reduced  to  the  three-dimensional  con¬ 
vex  hull  problem,  they  also  obtained  an  optimal  parallel 
method  for  the  former.  Their  algorithm  runs  in  O(logTi) 
time,  using  n  processors,  with  very  high  probability,  and 
there  are  no  known  deterministic  algorithms  that  match 
these  bounds. 

2  The  Use  of  Randomization 

Recent  work  by  Clarkson  and  Shor  [6],  Mulmuley  [18], 
and  Haussler  and  Welzl  [14]  has  shown  that  random¬ 
ization  can  be  used  to  obtain  better  upper  bounds  for 
various  geometric  problems  such  as  higher-dimensional 
convex  hulls,  halfspace  range  reporting,  segment  inter¬ 
sections,  linear  programming  etc.  Clarkson  and  Shor  [6] 
used  random  sampling  techniques  to  obtain  tight  bounds 
on  the  expected  use  of  resources  by  algorithms  for  various 
geometric  problems.  The  main  idea  behind  their  general 
technique  is  to  use  random  sampling  to  divide  the  prob¬ 
lem  into  smaller  ones.  The  manner  in  which  the  random 
sample  is  used  to  divide  the  original  input  into  subprob¬ 
lems  depends  on  the  particular  geometric  problem  under 
consideration.  They  showed  that  for  a  variety  of  such 
problems,  given  a  randomly  chosen  subset  R  of  the  input 
set  S,  the  expected  size  of  each  subproblem  is  0(|5|/|i?|) 
and  the  expected  total  size  is  0(|5|).  A  sample  that  sat¬ 
isfies  these  conditions  is  said  to  be  good,  and  bad  other¬ 
wise.  They  showed  that  any  randomly  chosen  sample  is 
good  with  constant  probability,  and  hence  bad  with  con¬ 
stant  probability  as  well.  This  allows  us  to  obtain  bounds 
on  the  expected  use  of  resources,  but  does  not  give  high 
probability  results  (i.e.  bounds  that  hold  with  probability 
>  (1  —  l/n“),  where  n  is  the  input  size,  and  a  >  0).  As 
pointed  out  by  Reif  and  Sen  [21],  this  fact  proves  to  be 
an  impediment  in  the  parallel  environment  due  to  the  fol¬ 
lowing  reason:  Parallel  algorithms  for  such  problems  are, 
typically,  recursive.  For  sequential  algorithms,  since  the 
expectation  of  the  sum  is  the  sum  of  expectations,  it  is 
enough  to  bound  the  expected  run-time  of  each  step.  For 
recursive  parallel  algorithms,  the  run-time  at  any  stage  of 
the  recursion  will  be  the  maximum  of  the  run-times  of  the 
subproblems  spawned  at  that  stage.  There  is  no  way  of 
determining  the  maximum  of  expected  run-times  without 
using  higher  moments.  Moreover,  even  if  we  can  bound 
the  expected  run-time  at  the  lowest  level  of  the  recursion, 
this  bound  turns  out  to  be  too  weak  to  bound  the  total 
run-time  of  the  algorithm. 

In  [21],  Reif  and  Sen  give  a  novel  technique  to  over¬ 
come  this  problem.  A  parallel  recursive  algorithm  can  be 

^Sorting  can  be  reduced  to  these  problems,  and  hence  the 
best  possible  run-time  will  be  0(log  n)  on  EREW  and  CREW 
PRAMs. 


thought  of  as  a  process  tree,  where  a  node  corresponds  to 
a  procedure  at  a  particular  stage  of  the  recursion,  and  the 
children  of  that  node  correspond  to  the  subproblems  cre¬ 
ated  at  that  stage.  The  basic  idea  of  the  technique  given 
in  [21]  is  to  find,  at  every  level  of  the  process  tree,  a  good 
sample  with  high  probability.  By  doing  this,  they  can 
show  that  the  run-time  of  the  processes  at  level  i  of  the 
tree  is  0(log  n/2')  with  very  high  probability,  and  hence 
the  run-time  of  the  entire  algorithm  is  0(log  n)  with  very 
high  probability.  By  choosing  a  number  of  random  sam¬ 
ples  (say  fl'(n)  of  them;  typically  (/(n)  =  O(logn)),  we  are 
guaranteed  that  one  of  them  will  be  good  with  high  likeli¬ 
hood.  The  procedure  to  determine  if  a  sample  is  good  or 
not  will  have  to  be  repeated  for  each  of  the  giji)  samples. 
However,  we  would  have  to  ensure  that  this  would  not 
cause  the  processor  bound  of  0(n)  to  be  exceeded  and 
this  is  done  by  polling  i.e.  using  only  a  fraction  of  the 
input  {l/g{n),  typically)  to  determine  the  “goodness”  of 
a  sample.  The  idea  is  that  the  assessment  of  a  sample 
(good  or  bad)  made  from  this  smaller  set  is  a  very  good 
reflection  of  the  assessment  that  would  be  made  from  the 
entire  set.  This  method  finds  a  good  sample  efficiently  at 
every  level  of  the  process  tree,  and  is  useful  for  converting 
expected  value  results  into  high  probability  results. 

Note  that  the  total  size  of  the  subproblems  can  be 
bound  to  only  within  a  constant  multiple  of  the  origi¬ 
nal  problem  size.  Thus  in  a  process  tree  of  (9(loglogn) 
levels,  this  could  lead  to  a  polylogarithmic  factor  increase 
in  processor  bound.  In  [6],  Clarkson  and  Shor  get  around 
this  problem  by  using  only  a  constant  number  of  levels 
of  recursion  in  their  algorithm  and  combining  this  with 
incremental  techniques  (which  are  inherently  sequential; 
see  [6]  for  further  details).  As  mentioned  earlier,  Reif  and 
Sen’s  [21]  strategy  to  handle  this  problem  is  to  eliminate 
redundancy  at  every  level  of  the  process  tree.  This  step 
is  non-trivial  and  quite  problem-specific.  Our  approach 
in  this  paper  gets  rid  of  this  need  altogether.  In  other 
words,  we  do  not  need  to  devise  a  method  to  control  to¬ 
tal  problem  size  at  every  level  of  the  process  tree.  The 
basic  idea  is  to  use  random  sampling  at  two  stages  of 
the  algorithm.  We  show  that  the  polylog  factor  increase 
in  processor  bound  actually  gives  us  a  method  to  choose 
much  larger  samples,  and  this  allows  us  to  ignore  the 
problem  of  increase  in  processor  bound.  By  developing 
efficient  search  strategies  to  determine  subproblems,  we 
are  able  to  obtain  an  optimal  algorithm  for  the  Voronoi 
diagram  of  line  segments  in  the  plane.  We  think  that  our 
approach  is  general  enough  to  apply  to  other  problems  as 
well. 

We  would  like  to  point  out  that  our  strategy  applies 
to  the  Voronoi  diagram  of  points  in  the  plane  as  well, 
thus  giving  a  direct  algorithm  for  this  problem  (instead 
of  using  the  reduction  to  3-d  convex  hulls).  During  the 
course  of  the  paper,  we  will  point  out  the  analogous  steps 
for  the  case  of  points  in  the  plane. 
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Figure  1:  The  bisector  of  line  segments  Sx  and  S2- 

3  Preliminaries  and  Definitions 

The  parallel  model  of  computation  that  will  be  used 
in  this  paper  is  the  Concurrent  iiead  Concurrent  ITrite 
(CRCW)  PRAM.  This  is  the  sychronous  shared  memory 
model  of  parallel  computation  in  which  processors  may 
read  from  or  write  into  a  memory  cell  simultaneously. 
Write  conflicts  are  resolved  arbitrarily;  in  other  words, 
the  protocol  used  for  resolving  such  conflicts  does  not 
affect  our  algorithm.  Each  processor  performs  standard 
real-arithmetic  operations  in  constant  time.  In  addition, 
each  processor  has  access  to  a  random  number  generator 
that  returns  a  random  number  of  C(logn)  bits  in  con¬ 
stant  time.  A  randomized  parallel  algorithm  A  is  said  to 
require  resource  bound  f(n)  with  very  high  probability  if, 
for  any  input  of  size  n,  the  amount  of  resource  used  by 
A  is  at  most  c.a.f{n)  with  probability  >  1  —  l/n“  for 
positive  constants  c,  a  (a  >  1).  O  is  used  to  represent 
the  complexity  bounds  of  randomized  algorithms  i.e.  A 
is  said  to  have  resource  bound  0(f{n)).  The  following  is 
an  important  theorem. 

Theorem  3.1  (Reif  and  Sen,  [22])  Given  a  process  tree 
that  has  the  property  that  a  procedure  at  depth  i  from  the  root 
takes  timeT,  such  that  Pr[Ti  >  fc(e')'alogn]  <  1  aiogn^ 

then  all  the  leaf-level  procedures  are  completed  in  0(log  n) 
time,  where  k  and  a  are  constants  greater  than  zero,  and 
0  <  e'  <  1. 

Note  that  the  number  of  levels  in  the  process  tree  will 
be  O(loglogn).  Intuitively,  the  above  theorem  says  that 
if  the  time  taken  at  a  node  which  is  at  a  distance  i  from 
the  root  is  0((logn)/2*)  with  high  probability,  then  the 
run-time  of  the  entire  algorithm  is  O(logn). 

3.1  Voronoi  Diagrams 

We  now  give  definitions  and  establish  some  notation  for 
the  geometric  objects  of  interest  in  this  paper  i.e.  Voronoi 
diagrams  (these  definitions  are  standard;  see  e.g.  [13, 
17]).  Let  S  be  a  set  of  nonintersecting  closed  line  segments 
in  the  plane.  For  simplicity  of  description,  we  will  assume 
that  the  elements  of  S  are  in  general  position  i.e.  no 
three  of  them  are  cocircular.  Following  the  convention 
in  [17,  23],  we  will  consider  each  segment  s  G  R  to  be 


composed  of  three  distinct  objects:  the  two  endpoints  of 
s  and  the  open  line  segment  bounded  by  those  endpoints. 
The  Euclidean  distance  between  two  points  p  and  q  is 
denoted  by  d(p,  q).  The  projection  of  a  point  q  on  to 
a  closed  line  segment  s  with  endpoints  a  and  b,  denoted 
proj(q,  s),  is  defined  as  follows:  Let  p  be  the  intersection 
point  of  the  straight  line  containing  s  (call  this  line  s), 
and  the  line  going  through  q  that  is  perpendicular  to  s . 
If  p  belongs  to  s,  then  proj{q,  s)  =  p.  If  not,  then 
proj{q,  s)  =  aifd{q,  a)  <  d{q,  b)  and  proj{q,  s)  =  b, 
otherwise.  The  distance  of  a  point  q  from  a  closed  line 
segment  s  is  nothing  but  d{q,  proj(q,  s)).  By  an  abuse 
of  notation,  we  denote  this  distance  as  d(g,  s).  Let  Si 
and  S2  be  two  objects  in  S.  The  bisector  of  Si  and  S2, 
B(si,  S2),  is  the  locus  of  all  points  q  that  are  equidistant 
from  Si  and  S2  i-s.  d{q,  si)  =  d{q,  S2).  Since  the  objects 
in  S  are  either  points  or  open  line  segments,  the  bisectors 
will  either  be  parts  of  lines  or  parabolas.  The  bisector  of 
two  line  segments  is  shown  in  Figure  1.  Note  that  if  S 
is  a  set  of  points,  all  the  bisectors  are  parts  of  straight 
lines.  The  formal  definition  is  stated  below,  followed  by 
an  important  theorem. 

Definition  3.2  The  Voronoi  region,  V{s),  associated  with  an 
object  s  in  S  is  the  locus  of  all  points  that  are  closer  to  s  than 
to  any  other  object  in  S  i.e.  V(s)  =  {p  \  d{p,  s)  <  d{p,  s') 
for  all  s'  €  5}.  The  Voronoi  diagram  of  S,  Vor(5),  is  the 
union  of  the  Voronoi  regions  V{s),  s  £  S.  The  boundary 
edges  of  the  Voronoi  regioons  are  called  Voronoi  edges,  and 
the  vertices  of  the  diagram,  Voronoi  vertices. 

Theorem  3.3  (Lee  et  al.  [17])  Given  a  set  S  of  n  objects 
in  the  plane  (in  this  paper,  these  objects  will  either  be  noninter¬ 
secting  closed  line  segments  or  points),  the  number  of  Voronoi 
regions,  Voronoi  edges,  and  Voronoi  vertices  o/Vor(S)  are  all 
0(n).  To  be  precise,  for  n  >  3,  Vor(S)  has  at  most  n  vertices 
and  at  most  3n  —  5  edges. 

It  is  convenient  to  divide  Voronoi  regions  into  smaller 
regions  by  augmenting  the  diagram  in  the  following  way 
(such  an  augmentation  was  originally  proposed  by  Kirk¬ 
patrick  [15]):  If  t)  is  a  Voronoi  vertex  of  V(s),  and  if 
v'  =  proj{v,  s),  then  the  line  segment  obtained  by  join¬ 
ing  V  and  v'  is  a  spoke  of  V (s).  The  spokes  define  new  sub- 
regions  within  V(s).  These  sub-regions  bounded  by  two 
spokes,  part  of  s  and  a  Voronoi  edge  are  called  primitive 
regions  (prims)  [13].  Some  spokes  and  prims  are  shown  in 
Figure  1.  Hereafter,  all  references  to  the  Voronoi  diagram 
will  assume  that  it  is  augmented  as  above. 

In  the  case  of  the  Voronoi  diagram  of  points,  all  bisec¬ 
tors  are  straight  lines  and  hence  two  bisectors  can  inter¬ 
sect  at  most  once.  However  for  line  segments,  the  bisec¬ 
tors  are  composed  of  straight  line  segments  and  parabolic 
arcs.  Consequently,  we  give  the  following  lemma,  which 
will  be  of  use  to  us  later  on  in  the  paper. 

Lemma  3.4  Given  line  segments  s,  Si  and  S2  in  the 
plane,  the  two  bisectors  B(s,  si)  and  B(s,  S2)  can  inter¬ 
sect  at  most  twice. 
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We  give  below  some  results  that  will  be  useful  for  the 
brute  force  construction  of  the  Voronoi  diagram  of  a  sam¬ 
ple  of  the  appropriate  size,  and  will  be  used  in  the  course 
of  our  algorithm. 

Lemma  3.5  The  Voronoi  diagram  of  a  set  of  n  line  seg¬ 
ments  in  the  plane  can  he  constructed  on  a  CREW  PRAM 
in  O(logn)  time  using  rV'  processors.  (Proof  Omited) 

Lemma  3.6  The  Voronoi  diagram  of  a  set  of  n  points 
in  the  plane  can  be  constructed  on  a  CREW  PRAM  in 
O(logn)  using  ri^  processors. 

The  above  lemma  follows  easily  from  the  fact  that  in¬ 
tersections  of  n  half-planes  can  be  found  in  O(logn)  time 
using  n  processors  [1,  4].  (Amato  and  Preparata’s  [2] 
method,  though  a  more  complicated  approach,  would  give 
us  the  same  result.)  The  following  are  well-known  non- 
optimal  parallel  algorithms  that  will  be  used  when  the 
input  to  a  subproblem  is  of  an  appropriately  small  size. 
Lemma  3.7  (Goodrich  et.  al.,  [13])  The  Voronoi  di¬ 
agram  of  a  set  of  n  line  segments  in  the  plane  can  be 
constructed  on  a  CREW  PRAM  in  O(log^n)  time  using 
n  processors. 

Lemma  3.8  (Aggarwal  et.  al.,  [1])  The  Voronoi  dia¬ 
gram  of  a  set  of  n  points  in  the  plane  can  be  constructed 
on  a  CREW  PRAM  in  O(log^n)  time  using  n  processors. 

4  Randomized  Algorithms  for 
Voronoi  Diagrams 

In  this  section,  we  develop  an  optimal  parallel  randomized 
algorithm  for  the  construction  of  the  Voronoi  diagram  of 
a  set  of  line  segments  in  the  plane.  As  we  mentioned  in 
Section  2,  Reif  and  Sen  [21]  use  the  novel  technique  of 
polling  to  give  an  optimal  randomized  algorithm  for  the 
three-dimensional  convex  hull  problem.  This  immediately 
gives  an  optimal  randomized  method  for  the  Voronoi  dia¬ 
gram  construction  of  a  set  of  points  in  the  plane  because 
this  problem  is  O(logn)  (parallel)  time  reducible  to  the  3- 
d  convex  hull  problem.  However,  no  analogous  reduction 
is  at  hand  for  the  case  of  line  segments.  Our  optimal  par¬ 
allel  randomized  algorithm  tackles  the  Voronoi  diagram 
problem  directly  which,  to  our  knowledge,  has  not  been 
done  before.  The  technique  that  we  present  in  the  forth¬ 
coming  sections  is  general  enough  that  it  can  be  applied 
to  the  case  of  line  segments  as  well  as  points  in  the  plane. 
As  a  result,  we  also  obtain  a  new  and  simpler  randomized 
parallel  method  for  the  Voronoi  diagram  of  points.  Note: 
Even  though  we  restrict  our  attention  to  line  segments  in 
the  plane,  this  technique  would  work  even  when  the  input 
contains  circular  arcs  (see  [23]  for  relevant  details). 

4.1  Outline  of  the  Method 

Let  S  =  {si,  S2,  . . . ,  Sn}  be  the  input  set  of  line  seg¬ 
ments  in  the  plane  and  let  i?  be  a  random  sample  from 
S.  Let  |i?|  =  for  some  0  <  e  <  1.  The  sample 
R  will  be  used  to  divide  the  original  input  S  into  smaller 


subproblems  so  that  each  of  these  can  be  solved  in  par¬ 
allel.  The  technique  of  using  a  random  sample  to  divide 
the  original  problem  into  subproblems  will  be  useful  only 
if  we  can  show  that  the  subproblems  are  of  roughly  equal 
size  (0(ri.'^“'^))  and  that  the  total  size  of  all  the  subprob¬ 
lems  is  almost  equal  to  the  size  of  the  original  problem 
(0(n)).  The  expected  value  resource  bounds  for  com¬ 
putational  geometry  problems  obtained  by  Clarkson  and 
Shor  [6]  will  be  applied  here.  As  mentioned  in  Section  2, 
due  to  the  nature  of  randomized  parallel  algorithms  it  is 
necessary  to  efficiently  find,  at  each  level  of  the  process 
tree,  such  a  sample  with  high  probability.  The  technique 
of  polling  developed  by  Reif  and  Sen  [21]  will  be  utilised 
towards  this  end.  So  a  crude  outline  of  the  algorithm 
could  be  as  follows. 

(1) .  Construct  the  Voronoi  diagram  of  R  using  the  brute 
force  technique  (Lemma  3.5/Lemma  3.6).  Call  this  dia¬ 
gram  VojfR).  We  use  Vor{R)  to  divide  the  original  prob¬ 
lem  into  smaller  problems  which  will  be  solved  in  parallel. 

(2) .  Process  Vor{R)  appropriately  in  order  to  efficiently 
find  the  input  set  of  line  segments  to  each  of  these  sub¬ 
problems. 

(3) .  Recursively  compute  (in  parallel)  the  Voronoi  dia¬ 
gram  of  each  subproblem. 

(4) .  Obtain  the  final  Voronoi  diagram  from  the  recur¬ 
sively  computed  Voronoi  diagrams. 

By  choosing  an  appropriate  e,  we  can  ensure  that  the 
first  step  can  be  done  in  0(log77)  time  using  n  processors. 
In  Section  4.4,  we  describe  the  use  of  a  randomized  search 
technique  in  order  to  efficiently  find  the  subproblems  de¬ 
fined  by  a  chosen  sample.  We  show  that  step  2  can  be 
carried  out  in  O(logn)  time  with  high  probability  using 
n  processors.  In  Section  4.5,  we  describe  the  merge  tech¬ 
nique  to  compute  the  Voronoi  diagram  from  recursively 
computed  Voronoi  diagrams  (this  step  is  non-trivial)  and 
show  that  the  final  merge  step  can  be  done  in  O(logn) 
time  using  n  processors.  Thus,  the  recurrence  relation 
for  the  run-time  is  T(n)  =  T{rP~^)  -|-  0(log n),  which 
solves  to  O(logn)  (this  follows  from  Theorem  3.1).  How¬ 
ever,  the  description  of  the  algorithm  that  we  have  given 
here  is  incomplete. 

As  mentioned  earlier,  there  is  a  side-effect  in  such  re¬ 
cursive  algorithms  that  is  important  to  consider.  When 
we  use  a  random  sample  to  divide  the  original  problem 
into  smaller  ones,  we  can  succeed  in  bounding  the  total 
size  of  the  subproblems  to  only  within  a  constant  multiple 
of  n.  In  a  recursive  algorithm,  this  results  in  an  increase 
in  the  total  problem  size  as  the  number  of  recursive  levels 
increases.  For  a  sample  size  of  O(n^),  the  depth  of  the 
process  tree  for  a  parallel  randomized  algorithm  would 
be  O(loglogn),  and  even  this  could  result  in  a  polyloga- 
rithmic  factor  increase  in  the  total  problem  size.  In  [21], 
Reif  and  Sen  get  around  this  problem  by  eliminating  re¬ 
dundancy  in  the  input  to  the  subproblems  at  every  level 
of  the  recursion.  In  other  words,  since  it  is  known  that 
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the  final  output  size  is  0{n),  it  is  possible  to  eliminate 
those  input  elements  from  a  subproblem  whicch  do  not 
contribute  to  the  final  output.  By  doing  this,  they  bound 
the  total  problem  size  at  every  level  of  the  process  tree 
to  be  within  c.n  for  some  constant  c.  This  step  is  non¬ 
trivial  and,  in  general,  avoiding  a  growth  in  problem  size 
in  this  manner  can  be  quite  complicated.  Moreover,  the 
strategy  that  can  be  used  to  eliminate  redundancy  seems 
to  be  very  problem-specific.  We  describe  a  method  to  use 
sampling  at  two  stages  of  the  algorithm,  which  will  help 
us  to  overcome  the  problem  of  the  increase  in  total  in¬ 
put  size  as  our  algorithm  proceeds  down  the  process  tree. 
Moreover,  it  appears  that  our  strategy  is  general  enough 
to  be  applicable  to  other  kinds  of  problems  as  well.  This 
technique  is  described  in  further  detail  in  Section  4.3.  It  is 
also  crucial  to  device  efficient  search  strategies  in  order  to 
determine  all  the  subproblems  that  an  element  lies  in.  In 
our  algorithm,  this  need  is  particularly  important  because 
of  the  fact  that  one  of  the  stages  of  sampling  entails  larger 
sample  sizes  (this  will  become  clearer  in  Section  4.3).  We 
give  such  an  efficient  search  strategy  in  Section  4.4. 

4.2  Defining  the  Subproblems 

We  now  give  a  precise  description  of  how  the  Voronoi  dia¬ 
gram  of  a  random  sample  R  is  used  to  divide  the  original 
input  S  into  smaller  problems.  Let  |/?|  =  r.  Consider  an 
edge  e  of  Vor{R).  e  has  two  primitive  regions  (prims  were 
defined  in  Section  3)  on  either  side  of  it;  call  these  'P^(e) 
and  'Pfl(e).  Every  vertex  v  of  Vor{R)  is  equidistant  from 
exactly  three  elements  of  R,  and  v  is  closer  to  these  three 
elements  than  to  any  other  element  of  R.  Thus  every  ver¬ 
tex  defines  a  circle  such  that  exactly  three  elements  of  R 
are  incident  on  it  and  the  interior  of  the  circle  is  empty. 
Let  T>r{v)  denote  the  circle  (and  its  interior)  defined  by 
Voronoi  vertex  v  of  Vor{R).  Let  vi  and  V2  be  the  two 
vertices  of  e. 

Consider  now  the  region  obtained  by  the  union  of 
'Pfl(e),  and  the  two  circles  Vr{vi)  and  Vr^v^) 
(see  Figure  2);  in  the  case  of  unbounded  edges,  we  have 
two  prims  and  one  circle.  Call  this  region  Cfl(e).  Ob¬ 
serve  that  for  the  Voronoi  diagram  of  points,  it  is  enough 
to  consider  Cij(e)  to  be  the  union  of  just  the  two  circles 
defined  by  the  two  vertices  of  e  because  the  prims  'P^(e) 
and  'P^(e)  always  lie  entirely  within  this  union.  This  is 
not  always  the  case  for  line  segments,  as  can  be  seen  in 
Figure  2. 

Remark:  We  would  like  the  subproblem  associated  with 
every  edge  of  Vor{R)  to  satisfy  the  following  condition;  if 
the  final  Voronoi  region  V{si)  of  element  Si  £  S  inter¬ 
sects  Vfi(e)  or  Vfi{e),  then  we  would  like  at  least  all  such 
Si  to  be  part  of  the  input  associated  with  the  edge  e. 

For  every  element  Sj  of  S,  consider  the  set  of  points  that 
are  closer  to  Sj  than  to  any  other  element  of  R.  In  other 
words,  consider  the  Voronoi  region  of  s*  in  the  Voronoi 
diagram  of  the  set  R  U  {si}.  Denote  this  by  V''^(sj). 
Clearly,  the  final  Voronoi  region  of  s,  will  be  a  region 


Figure  2;  Region  Cfi{e)  (shaded)  associated  with 
edge  e. 

smaller  than  (or  equal  to)  V^{si).  An  element  Si  will 
belong  to  the  subproblem  associated  with  edge  e  if  and 
only  if  V^(sj)  has  a  non-empty  intersection  with  ■P^(e) 
and  'Pfl(e).  Observe  that  determining  input  for  each  sub- 
problem  in  this  manner  satisfies  the  desired  condition  that 
we  stated  in  the  remark  above.  V^{si)  intersects  'P^(e) 
and  V\{e)  if  and  only  if  the  element  Si  has  a  non-empty 
intersection  with  CR{e). 

Let  A(Ci{(e))  denote  the  set  of  all  elements  s;  G  S 
such  that  Si  has  a  non-empty  intersection  with  Cij(e). 
Thns,  the  input  set  for  the  subproblem  associated  with  e 
will  be  nothing  but  A(C/i(e)).  Each  of  the  Vbr(A(Ci?(e))) 
will  be  computed  recursively.  The  number  of  edges 
in  Vor{R)  will  be  at  most  3r  —  5  (Theorem  3.3), 
and,  hence,  so  will  the  number  of  regions  CR{e).  Let 
Cr  =  {CR{e)  I  e  is  a  Voronoi  edge  from  Vor(i2)}.  We 
can  now  use  Clarkson  and  Shor’s  result  (Corollary  3.8, 
[6])  to  obtain  the  following.  Note  that  it  takes  four  ele¬ 
ments  of  R  to  define  each  region  CR(e):  the  two  elements 
that  e  bisects,  and  two  other  elements  that  determine  the 
two  vertices  of  e  (referring  now  to  the  notation  used  in  [6], 
clearly  6  =  4  and  the  “ranges”  identified  by  the  function 
S  are  precisely  the  regions  Ci?(e)  defined  here). 

Lemma  4.1  Given  a  random  sample  R  of  size  r  from  a  set 
of  objects  S  of  size  n  and  using  the  notation  established  above, 
both  of  the  following  conditions  hold  with  probability  at  least 
1/2: 

(a)  max  {  |A(CR(e))|  }  <  kmax  {n/r)  log  r 
CR{e)  e  Cr 

(b)  Y.  ^  *=tot  in/r)  E(  |C«|  ) 

Cn(e)  6  Cr 

where  kmax  and  kf^f;  are  constants  (obtained  from  Corollary 
3.8  in  [6]). 

Since  E{  |Ci{|  )  is  0(r),  the  above  conditions  guarantee 
that  with  probability  at  least  1/2,  the  total  size  of  the 
subproblems  is  almost  equal  to  the  size  of  the  original 
problem  and  the  subproblems  are  of  roughly  equal  size. 
As  in  [6,  21],  if  a  sample  R  satisfies  these  conditions  then 
it  is  called  a  good  sample  and  bad  otherwise.  In  order  to 
solve  the  subproblems  in  parallel  and  obtain  an  optimal 
run-time,  we  want  a  good  sample  with  high  probability. 
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4.3  Two  Stages  of  Sampling 

As  mentioned  in  Section  2,  Reif  and  Sen  [21]  describe 
the  novel  technique  of  polling  in  order  to  obtain  a  good 
sample  with  high  probability  from  samples  that  are  only 
expected  to  be  good.  If  (9(logn)  samples  are  chosen,  in¬ 
stead  of  just  one,  then  it  follows  from  Lemma  4.1  that 
with  high  probability  one  of  these  samples  will  be  good. 
Let  Ri,  ■•■)  Ra]ogn  be  these  samples  (where  the 

value  of  a  is  fixed  according  to  the  desired  success  prob¬ 
ability  of  the  algorithm)  of  size  0{n'^)  each.  Let  us  as¬ 
sume  for  now  that  we  have  an  efficient  (n  processors  and 
(9(logn)  time)  procedure  to  compute  the  total  subprob¬ 
lem  size  J2cji  (e)  e  Cr  ^  A'(Cfl.(e))  |  for  each  Vor{Ri). 
However,  in  order  to  determine  which  of  these  Ri  is  good, 
this  procedure  will  have  to  be  repeated  for  each  of  the 
O(logn)  samples.  This  will  mean  an  increase  in  the  pro¬ 
cessor  bound  of  0{n)  which  we  want  to  avoid.  Reif  and 
Sen  [21]  introduce  the  idea  of  polling  in  order  to  overcome 
this  problem.  They  determine  the  goodness  of  sample  Ri 
by  using  only  0{n/\og^n)  randomly  chosen  elements  from 
the  input  set,  where  6  >  2  is  some  constant.  The  pro¬ 
cedure  to  determine  the  goodness  of  the  alogn  samples 
can  be  run  in  parallel  for  all  the  Ri. 

The  randomized  parallel  algorithm  is  then  run  on  the 
good  sample  found  in  the  above  manner.  However  the 
technique  outlined  so  far  still  does  not  guarantee  that  we 
stay  within  the  desired  processor  bound  of  0{n).  The 
best  bound  that  Lemma  4.1  can  give  us  is  that  the  total 
size  of  the  subproblems  at  level  (« +  1)  of  the  process  tree 
is  kiof  times  the  total  size  of  the  subproblems  at  level  i. 
This  implies  that  after  O(loglogn)  levels,  the  total  size 
could  increase  by  a  polylogarithmic  factor.  Suppose  the 
total  size  at  the  leaf  level  of  the  process  tree  is  at  most 
n.log^n,  for  some  constant  c.  If  the  input  to  the  divide- 
and-conquer  algorithm  were  to  be  of  size  n/\og^n,  then 
the  total  problem  size  at  the  leaf  level  of  the  process  tree 
would  be  0{n).  This  observation  (along  with  the  results 
developed  in  the  following  sections)  yields  the  following. 

Theorem  4.2  The  Voronoi  diagram  of  a  set  of  n  line 
segments  can  be  constructed  in  0(log77.)  time  with  high 
probability  using  nlog'^n  processors. 

The  above  theorem  actually  enables  us  to  choose  samples 
of  size  much  larger  than  O(n^).  In  particular,  such  a 
sample  S'  could  be  of  size  O(n/log®n),  q  being  a  constant 
>  c.  If  S'  is  a  good  sample,  then  it  would  again  divide  the 
original  input  into  smaller  problems  of  roughly  equal  size 
(i.e.  A^(C5'(e))  would  be  of  size  roughly  O(log^n)  for  all 
Voronoi  edges  e  in  Vor(5''))  and  ^  |X(C5/(e))]  would  be 
0(n).  In  order  to  compute  the  Voronoi  diagram  of  these 
subproblems,  we  can  then  use  any  non-optimal  algorithm 
like  the  one  stated  in  Lemma  3.7  (Lemma  3.8  for  points 
in  the  plane).  We  would,  however,  like  to  be  able  to  find 
such  a  good  sample  S'  with  high  probability.  As  we  know 
from  Lemma  4.1,  there  is  a  constant  probability  that  S' 


is  a  good  sample.  As  before,  if  we  choose  O(logn)  such 
samples,  we  know  that  at  least  one  of  them  will  be  good 
with  very  high  probability. 

Let  N  —  n/log‘'n.  Let  Si,  S2,  •••,  5diogn  be  the 
O(logn)  samples  of  size  N  each.  Since  the  size  of  Si  is 
large,  we  obviously  cannot  afford  to  construct  Vor{Si) 
using  a  brute  force  technique  (as  we  can  do  with  sam¬ 
ples  of  size  0{n^)).  We  will  have  to  run  the  randomized 
parallel  algorithm  using  each  of  these  Si  as  input.  Let 
R\,  R’2,  ...,  R'aiogN  t’®  O(logV)  random  samples, 
each  of  size  N' ,  chosen  from  Si.  Thus  the  skeleton  of  our 
algorithm  will  now  be  as  follows.  Note  that  the  testing  of 
the  samples  Si  is  done  with  respect  to  a  restricted  input 
set  (polling). 

Algorithm  VORONOI_DlAGRAM; 

»N  :=  n/\ogfn. 

•  Pick  rflogn  random  samples  Si ,  S2, .  ■  .  ,  Sd\agn  of  size  N 
each . 

•  Let  7  be  a  random  subset  of  the  input  set  S  such  that 
|/|  =  n/log'^n,  q  being  a  constant  <  q. 

•  S'  :=  P!CK-THE-RlGHT-SAMPLE(Si,52,...,5diogn,7)- 

•  Partition  the  entire  input  5  according  to  the  good  sample 
5';  such  a  method  is  given  in  Section  4.4. 

•  Solve  each  subproblem  using  a  non-optimal  technique 
(Lemma  3.7  or  Lemma  3.8), 

•  Merge  the  results  (see  Section  4,5). 

Function  PicK-THE_Right_Sample(5i  ,  52, . . . ,  5diojn,  7). 

•  Do  the  following  in  parallel  for  each  5i  (1  <  *  <  dlogn). 

1.  (a)  Choose  a  log  7?  random  samples  7?) ,  7?2,  •••,  7J], log  „ 

each  of  size  from  the  set  Si. 

(b)  Construct  the  Voronoi  diagram  of  each  7?) 
(1  <  j  <  0  log  vV)  using  the  brute  force  technique 
described  in  Lemma  3.5  of  Section  3  (Lemma  3.6 
for  the  Voronoi  diagram  of  points  in  the  plane). 

(c)  Determine  which  of  these  7?'  is  a  good  sample  for 
S,'. Hence  the  inputs  to  the  method  in  Section  4.4 
will  be  Rj  and  Si.  Suppose  Rf  is  one  such  good 
sample;  with  high  probability,  there  will  be  such  a 
]'■ 

(d)  Use  7?y  to  divide  5,  into  smaller  subproblems. 

(e)  Recursively  compute  (in  parallel)  the  Voronoi  dia¬ 
gram  of  each  subproblem. 

(f)  Obtain  the  final  Voronoi  diagram  Vbr(5,)  from 
these  recursively  computed  Voronoi  diagrams. 

2.  Compute  the  total  subproblems  size  when  restricted  to 
7  (this  is  polling).  Hence  the  inputs  to  the  method  in 
Section  4.4  will  be  5,  and  7. 

•  Return  the  best  5, ;  with  high  probability  there  will  be  such 
an  Si. 

Observe  that  in  the  above  function,  it  will  not  be  neces¬ 
sary  to  use  polling  in  step  1(c)  because  of  the  smaller  size 
of  the  Si.  The  whole  point  of  polling  is  to  ensure  that  the 
processor  bound  of  0{n)  is  not  exceeded.  However,  we 
can  afford  to  use,  for  all  1  <  *  <  dlogn,  the  whole  set 
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Si  to  determine  the  goodness  of  R)  {I  <  3  <  alogn) 
because  (dlogn). (alogn). is  o(n)  as  long  as  g  >  3. 

We  know  that  each  of  the  Vor{Si)  can  be  constructed 
in  O(logn)  time  with  very  high  probability  (more  ac¬ 
curately,  we  will  know  this  for  sure  after  the  next  two 
sections).  But  we  want  to  be  sure  that  every  one  of 
the  VoifSi)  will  be  constructed  in  O(logn)  time  with 
high  probability.  This  follows  immediately  from  the  fact 
that  for  events  Ai,  A2,  ...  (not  necessarily  disjoint), 
Pr[[JiAi]  <  Suppose  the  probability  that  the 

construction  of  Vor{Si)  takes  more  than  /?  logn  steps  is 
<  n“"  for  some  constants  a  and  /?.  Then  it  follows 

from  the  stated  inequality  that  the  probability  that  the 
construction  of  one  or  more  of  the  Vor{Si)  takes  more 
than  /?  logn  time  is  <  (n~“).(dlogn).  Consequently, 

the  probability  that  all  the  Voronoi  diagrams  Vbr(5i), 
1^07^52),  . . .,  Vor{Sdiogn)  are  constructed  in  O(logn) 

time  is  >  (1  —  (n"“).(dlogn)),  which  is  very  high. 

It  will  be  necessary  to  process  the  Voronoi  diagram  of 
the  random  sample  appropriately  so  that  we  have  a  fast 
method  to  find  the  subproblems  defined  by  it.  Note  that 
in  our  scheme  it  is  imperative  that  we  have  an  efficient 
algorithm  to  perform  this  processing.  In  other  words,  we 
cannot  afford  to  have  a  parallel  algorithm  that  uses  a 
polynomial  number  of  processors.  Whereas  in  [21],  since 
the  sample  size  is  always  O(n^),  they  can  choose  an  appro¬ 
priate  £  such  that  the  processor  bound  of  0(n)  is  main¬ 
tained,  we  do  not  have  this  flexibility.  This  is  because  of 
the  large  sample  size  during  the  first  stage  of  sampling. 
In  the  following  section,  we  give  an  efficient  method  that 
satisfies  our  requirement. 

4.4  Finding  X{CR{e))  for  each  CR{e) 

Let  il  be  a  random  sample  chosen  from  a  set  S,  and  let 
|i?|  =  r.  (Hence  R  could  either  be  one  of  the  5,  cho¬ 
sen  from  the  original  input  S  {r  =  N  and  S  =  S)  01 
one  of  the  chosen  from  5,-  (r  =  and  S  =  Si).) 
Let  us  assume  Vor(R)  is  available  to  us;  Vor(R)  would 
have  been  constructed  either  through  a  brute-force  tech¬ 
nique  or  the  divide-and-conquer  method  outlined  in  the 
previous  section.  We  now  describe  a  method  to  process 
Vor(R)  appropriately  so  that  we  can  find  V(Cfi(e))  (C  S) 
for  all  Voronoi  edges  e  of  Vor(R)  in  O(logn)  time  with 
high  probability  using  0(n)  processors.  Note  that  we  ob¬ 
viously  cannot  sequentially  determine  all  the  subproblems 
that  an  element  of  S  lies  in  because  this  could  be  as  large 
as  0(r). 

4.4.1  Processing  Vor{R)  to  Form  the  Search 
Data  Structure  VorDS(i?) 

Let  U  he  a  triangulated  subdivision  with  u  vertices.  A 
subdivision  hierarchy  is  a  sequence  (7i,  172,...,  Uh(u)  of 
triangulated  subdivisions  satisfying  the  following  condi¬ 
tions:  (1)  17i  =  U,  (2)  |17/i(u)|  =  3,  and  (3)  each  region 
of  Ui+i  intersects  at  most  r  regions  of  Ui  for  some  con¬ 


stant  r.  The  idea  of  using  fractional  independent  sets^ 
to  build  a  subdivision  hierarchy  was  first  proposed  by 
Kirkpatrick  [16].  Subdivision  hierarchies  can  be  used  to 
construct  efficient  search  data  structures.  Randomized 
parallel  algorithms  to  solve  this  problem  efficiently  were 
given  independently  by  Dadoun  and  Kirkpatrick  [9]  and 
by  Reif  and  Sen  [22].  We  can  clearly  use  the  same  idea  of 
fractional  independent  sets  to  build  a  hierarchy  of  Voronoi 
diagrams  that  will  be  very  useful  for  our  purposes. 

The  dual  VV{R)  of  the  Voronoi  diagram  VoifR)  of 
a  set  of  elements  R  is  the  graph  which  has  a  node  for 
every  element  in  R  and  an  edge  between  two  nodes  if 
their  corresponding  Voronoi  regions  share  a  Voronoi  edge 
(for  the  case  when  i?  is  a  set  of  points,  the  dual  is  the 
well-known  Delaunay  triangulation  of  the  set  of  points). 
X>V(J?)  is  planar,  and  can  be  used  to  build  a  hierarchy  of 
Voronoi  diagrams.  By  a  slight  abuse  of  notation,  for  each 
segment  s  £  R,  we  will  use  s  to  refer  to  its  corresponding 
node  in  the  graph  W{R)  as  well  (hence  R  will  refer  to 
the  set  of  nodes  when  we  talk  about  VV{R)).  Observe 
that  when  R  is  a  set  of  line  segments  in  the  plane,  VV{R) 
could  be  a  multigraph  (i.e.  there  could  be  more  than  one 
edge  between  two  nodes):  this  is  because  a  bisector  can  be 
split  into  two  or  more  pieces  in  Vor(R).  It  can  be  shown 
that  every  interior  face  of  'DV{R)  must  be  triangular.  In 
a  triangulated  subdivision,  the  exterior  face  is  assumed  to 
be  a  triangle  as  well.  If  W{R)  does  not  have  a  triangular 
exterior  face,  we  can  assume  that  R  has  been  augmented 
appropriately  so  that  we  have  this  property  (this  can  be 
done  in  O(logr)  time  using  r  processors). 

In  the  remainder  of  this  paper,  we  use  Vr(s)  to  denote 
the  Voronoi  region  of  s  G  R  in  the  Voronoi  diagram 
Vor{R).  Also,  as  in  [16],  we  will  assume  that  every  node 
of  a  fractional  independent  set  is  an  internal  node  VV{R), 
and  the  degree  of  each  such  node  is  less  than  or  equal 
to  r,  where  r  is  a  fixed  constant  (for  our  purposes,  it 
suffices  to  assume  that  r  =  6).  Let  Ri  =  R.  Consider  a 
fractional  independent  set  W  of  Ri  in  the  graph  I>V(Ri), 
and  let  R2  =  Ri  —  W.  Let  s  be  a  node  in  W.  Let  r'^'(s) 
represent  the  set  of  neighbors  of  s  in  PV(Ri)  (obviously 
|r«‘(5)|  <  t).  Since  W  is  an  independent  set,  we  know 
that  r^‘(s)  C  R-i-  Then  we  have  the  following. 

Lemma  4.3  For  an  element  s'  G  R2,  Vn^i^')  contains 
a  part  of  the  region  Vr^{s)  ( i.e.  Vr^{s')  fl  Vri(s) 
if  and  only  if  s'  G  r'^‘(s). 

It  is  now  easy  to  construct  Vor(R2)  from  Vor{Ri). 
First,  we  know  from  [9,  22]  that  with  very  high  proba¬ 
bility,  we  can  find  such  an  independent  set  W  in  constant 
time  using  r  processors  (the  idea  is  to  use  randomized 

^If  G  =  (V,  E)  is  a  planar  graph,  then  a  set  X  C  V  is  a 
fractional  independent  set  oi  G  if  (1)  the  degree  of  each  vertex 
n  G  X  is  less  than  or  equal  to  some  constant  d  (2)  no  two 
vertices  of  X  are  connected  by  an  edge  (i.e.  the  set  X  is  an 
independent  set)  and  (3)  |X|  >  c  \V\  for  some  fraction  c. 
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symmetry  breaking;  see  [9,  22]  for  details).  It  follows  from 
Lemma  4.3  that  VV{R2)  can  be  obtained  from  W{Ri) 
in  constant  time  using  r  processors;  For  all  s  G  W,  (a) 
delete  the  node  s  and  the  edges  between  s  and  members  of 
r'^*(s)  and  (b)  add  edges  between  pairs  of  elements  from 
r'^*(s)  if  they  share  a  Voronoi  edge  in  Vor{R2)  (there  will 
be  at  most  3  such  new  edges  for  each  s).  Note  that  for 
each  new  prim  of  V'or(/?2),  we  can  determine  in  constant 
time  the  prims  of  Vor[Ri)  that  it  intersects. 

We  can  repeat  on  Ro  the  steps  described  above  and 
obtain  a  new  set  i?3  and  the  graph  VV{Rz),  and  we  can 
continue  the  process  until  we  have  a  set  of  size  3.  In  other 
words,  we  can  build  a  hierarchy  (analogous  to  the  subdi¬ 
vision  hierarchy  of  Kirkpatrick  [16])  of  Voronoi  diagrams 
Vor{Ri),  Vor{R2),  . . Vor{Rh)  where  h  =  O(logr).  This 
will  take  O(logr)  time  and  r  processors  with  high  prob¬ 
ability.  We  build  the  search  data  structure  as  we  con¬ 
struct  this  hierarchy  of  Voronoi  diagrams.  The  data  struc¬ 
ture  is  a  directed  acyclic  graph  VorDS(i?)  that  contains 
a  node  for  each  primitive  region  in  the  diagrams  Vor{Ri) 
(1  <  i  <  h).  Each  prim  of  Vor{Ri^i)  is  a  node  in 
the  (i  +  l)-th  level  of  VorDS(fi)  and  has  a  directed  edge 
to  those  nodes  at  level  i  with  which  it  has  a  non-empty 
intersection.  Obviously  the  degree  of  each  node  in  the 
data  structure  is  less  than  or  equal  to  r.  Thus  we  have 
the  following. 

Lemma  4.4  The  search  data  structure  VorDS(/?)  can  be 
built  in  O(logr)  time  with  very  high  probability  using  r 
processors,  where  r  =  |i?|. 

4.4.2  Searching  VorDS(i?)  to  find  A’(C/j(e))  for 
all  CRie) 

Once  VorDS(i?)  has  been  constructed,  we  want  to  search 
it  efficiently  in  order  to  find  the  input  set  to  each  subprob¬ 
lem  determined  by  R  i.e.  V(Cij(e))  for  C/j(e)  G  Cr.  Es¬ 
sentially,  each  element  s  £  S  searches  through  VorDS(/?) 
to  determine  all  the  subproblems  that  it  lies  in,  and  this 
is  done  in  parallel  for  all  such  s.  Note  that  this  is  also 
the  step  that  allows  us  to  determine  if  i?  is  a  good  sample 
or  not.  (Hence  in  our  algorithm,  S  may  sometimes  be  a 
polling  set  that  is  a  subset  of  the  original  input  S.)  Let 
I  5  I  =  Af.  We  first  state  the  basic  idea  behind  how  the 
search  proceeds. 

For  each  element  s  G  5,  we  start  off  the  search  by 
determining  the  prims  at  level  h  that  V^'*(s)  intersects^. 
Obviously  V'^'*  (s)  is  of  constant  size  and  hence  this  step 
can  be  carried  out  in  constant  time  using  0{Af)  proces¬ 
sors.  Our  algorithm  works  in  phases.  We  describe  the 
steps  during  one  phase:  Suppose  that  for  s  £  S,  a  pro¬ 
cessor  n  has  reached  the  node  corresponding  to  prim  T 
at  level  (f  +  1)  of  VorDS(i?).  As  we  know  from  the  pre¬ 
vious  section,  P  intersects  a  constant  number  of  prims  at 

^Recall  that  we  use  V^{s)  to  denote  the  region  that  is 
closer  to  s  than  to  any  other  element  in  R  i.e.  the  Voronoi 
region  of  s  in  the  Voronoi  diagram  of  i?  U  {s}. 


level  i.  Let  these  be  called  Vi,V2,  ■  ■  ■  ,Vp  (note  that  p 
will  be  less  than  or  equal  to  r).  Without  loss  of  gener¬ 
ality,  let  Si ,  So,  ■■■,Sp  be  the  elements  of  Ri  such  that 
for  1  <  i  <  p,  Pj  belongs  to  Vr,  (sj),  respectively.  If 
B{s,  Sj)  intersects  prim  Pj ,  then  H  generates  a  request 
for  a  new  processor  in  the  following  manner:  it  randomly 
generates  a  processor  label  and  writes  into  that  proces¬ 
sor  to  indicate  that  it  is  being  requested.  There  could  be 
write  conflicts  during  this  step  and  an  arbitrary  processor 
will  succeed  (in  the  case  of  a  success,  the  new  processor- 
will  need  to  know  Pj  and  s  so  as  to  continue  the  search 
from  the  node  corresponding  to  Pj).  Obviously  H  can 
carry  out  this  step  in  constant  time  (p  steps).  In  order  to 
be  sure  that  this  scheme  works,  we  need  to  know  that  for 
each  s  £  S,  every  prim  that  should  be  reached  at  level 
i  is  reachable  from  level  (f  -|-  1)  in  the  above  described 
manner.  This  follows  from  the  lemma  given  below  (we 
omit  the  proof,  which  is  quite  straightforward). 

Lemma  4.5  If  V^‘{s)  intersects  a  prim  P  at  level  i, 
then  V^'+'(s)  must  intersect  at  least  one  of  the  prims 
that  P  intersects  at  level  {i  +  1)  o/ VorDS(ii). 

We  described  one  phase  of  the  algorithm  above.  During 
the  next  phase,  another  set  of  requests  are  generated  in  a 
similar  manner  (including  the  ones  that  weren’t  satisfied 
in  the  previous  phase),  and  so  on.  It  follows  from  the 
above  lemma  that  by  searching  through  VorDS(i?)  in  the 
described  manner,  we  can  find  V'^(s)  and  hence  all  the 
subproblems  that  s  lies  in. 

There  is  an  important  fact  that  we  have  ignored  so  far. 
In  order  to  carry  out  this  search  step  efficiently,  we  have 
to  be  sure  that  the  total  number  of  requests  generated  is 
0{AflogAf)  over  all  levels  of  VorDS(i?).  Observe  that  it  is 
possible  for  the  total  problem  size  to  increase  from  level 
z  =  1  to  level  i—h  of  VorDS(f?)  because  an  element  s  £  S 
might  lie  in  a  greater  number  of  subproblems  at  level  f  + 1 
than  at  level  i.  However,  we  can  prove  that  this  increase 
is  such  that  the  total  subproblems  size  is  still  0{N)  at 
every  level  of  VorDS(/?).  We  will  not  go  into  the  proof, 
for  lack  of  space  and  appeal  to  intuition  instead.  Consider 
an  element  s'  £  Ri  such  that  s'  belongs  to  the  indepen¬ 
dent  set  W  of  PV(i?i).  For  every  element  that  belongs  to 
r^'(s')  (the  neighbors  of  s'  inX>V(i?r)),  the  total  problem 
size  associated  with  that  element  at  level  2  of  VorDS(/?) 
could  increase.  This  is  because  elements  that  belong  to 
the  subproblems  associated  with  s'  (there  can  be  at  most 
a  constant  number  of  such  subproblems)  could,  upon  the 
removal  of  s',  now  belong  to  the  neighbors  as  well;  by  an 
abuse  of  notation,  call  this  set  of  elements  X{Cr^{s')). 
Hence  the  maximum  possible  increase  in  size  at  level  2  of 
VorDS(i2)  will  be  given  by  («'))!■ 

In  a  similar  manner,  we  can  compute  the  total  increase  at 
level  3  of  the  data  structure,  while  taking  into  account  the 
possible  increase  in  the  problem  size  associated  with  an 
independent  set  element  at  level  2.  We  are  able  to  prove 
that  this  increase  at  each  level  of  VorDS(7?)  is  given  by  a 
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geometric  series  of  ratio  less  than  1,  as  long  as  the  frac¬ 
tion  of  independent  set  elements  at  each  level  of  the  data 
structure  is  less  than  fi,  where  //  <  1  is  fixed.  We  can 
show  that  this  must  be  the  case  with  very  high  probabil¬ 
ity.  The  intuition  behind  this  result  is  as  follows:  Those 
independent  set  elements  that  cause  a  large  increase  in 
problem  size  must  have  high  degree  in  'DV{R),  and  those 
that  cause  a  smaller  increase  will  have  a  smaller  degree. 
Hence  there  will  be  fewer  elements  that  cause  a  large  in¬ 
crease  than  those  that  cause  a  smaller  increase  (since  in 
a  planar  graph  there  will  be  fewer  vertices  of  large  degree 
than  vertices  of  smaller  degree). 

The  search  through  this  data  structure  is  carried  out 
by  the  dynamic  allocation  of  processors  to  each  segment 
s  £  S.  The  idea  is  to  use  a  number  of  processors  pro¬ 
portional  to  |T'^i(s)l  for  s.  We  can  use  randomized  tech¬ 
niques  for  dynamic  processor  allocation  (such  methods 
are  standard,  as  given,  for  example,  in  [20]).  For  good 
samples,  all  the  requests  will  be  satisfied  in  (9(logW) 
phases  with  very  high  likelihood  using  0(Af)  processors. 
If  the  search  through  VorDS(/?)  is  incomplete  after  these 
OilogM)  phases  (i.e.  there  are  requests  that  haven’t  been 
satisified),  then  we  can  reject  R  with  high  confidence. 

Note  that  since  the  data  structure  that  we  are  search¬ 
ing  through  is  a  directed  acyclic  graph,  it  is  possible  that 
an  s  G  <S  may  arrive  at  the  same  node  through  two  differ¬ 
ent  paths.  In  such  instances,  we  want  to  avoid  repeating 
searches  that  have  already  commenced.  We  would  like 
to  use  0{M\ogM)  space  in  order  to  solve  this  problem; 
we  can  use  randomized  hashing  techniques  to  achieve  this 
goal.  This  will  use  0{M)  space  for  each  level  of  VorDS(I?). 
From  Gil,  Matias  and  Vishkin  [11],  we  know  that  we  can 
certainly  do  this  in  0(log*Af)  time  (nearly  constant)  with 
high  probability  by  doing  an  optimal  amount  of  work  at 
each  phase.  But  we  show  that  over  all  the  levels  of  the 
data  structure,  the  total  processing  time  is  0(logA/’)  with 
high  probability  (omitted  for  lack  of  space).  Thus  we  have 
the  following. 

Lemma  4.6  IVe  can  find  X(Cfl(e))  for  all  CR{e)  G  Cr 
in  d(log Af)  time  using  0{Af)  processors  and  0{AflogAf) 
space,  where  R  is  a  random  sample  of  S  and  |5|  =  Af. 

4.5  Merging  Recursively  Computed 
Voronoi  Diagrams 

In  the  previous  section,  we  described  the  method  to  find 
the  subproblems  determined  by  a  random  sample  from 
the  input  set  S.  So  assume  now  that  for  a  good  sample 
R,  X(Cfl(e))  has  been  found  for  all  Cfi(e)  G  Cr.  The 
diagrams  Vor{X{CR{e)))  will  be  computed  recursively,  in 
parallel  for  each  Cr^o).  We  want  to  merge  these  Voronoi 
diagrams  to  form  Vorf^^).  In  the  remainder  of  the  sec¬ 
tion,  we  describe  the  method  to  perform  this  merge  step 
efficiently. 

The  final  Voronoi  diagram  of  S  can  be  constructed 
by  computing  the  final  Voronoi  region  V(s)  for  each  el¬ 


ement  s  G  <5.  V(s)  can  be  determined  by  finding  the 
intersection  of  all  the  Voronoi  regions  of  s  that  have  been 
computed  recursively  in  the  subproblems.  As  before,  let 
\R\  =  r  and  let  |5|  =  Af.  Since  i?  is  a  good  sample  of 
S,  we  know  that  the  total  size  of  the  subproblems  is  less 
than  or  equal  to  kfgf  Af.  It  follows  therefore  that  the  total 
number  of  Voronoi  edges  in  all  the  recursively  computed 
Voronoi  diagrams  must  be  less  than  Af  (since  the 

number  of  Voronoi  edges  in  Vor{X{CR{e)))  is  less  than 
3.|A(Cfl(e))|). 

Consider  the  set  of  subproblems  in  which  s  G  <?  lies. 
Let  Ti{(s)  be  the  set  of  recursively  computed  Voronoi 
regions  of  s  in  these  subprobles.  That  is'*,  T/j(s)  = 
{Vy(Cfi(e))(^)  I  e  The  Voronoi  re¬ 

gions  will  be  represented  as  the  collection  of  their  Voronoi 
edges.  Note  that  the  total  size  of  all  the  Tii(s)  for  all 
s  ^  S  will  also  be  0{Af)  since  every  Voronoi  edge  is 
counted  exactly  two  times  over  the  regions  X r{s)  for  all 
s  G  S  (once  for  each  of  the  two  Voronoi  regions  that  it 
borders).  Every  Voronoi  edge  in  Tfl(s)  is  part  of  some 
bisector  B(s,  s')  (s'  G  S).  In  the  case  of  line  segments, 
computing  the  intersection  of  these  bisectors  to  find  V(s) 
is  not  an  efficient  strategy  because  of  the  fact  that  two 
such  bisectors  can  intersect  twice  (see  Lemma  3.4).  The 
following  lemma  provides  us  with  a  solution.  (We  omit 
the  proof  for  lack  of  space.) 

Lemma  4.7  Every  Voronoi  vertex  of  the  final  Voronoi 
region  V(s)  appears  as  a  vertex  in  at  least  one  of  the 
Voronoi  regions  in  the  set  Xr(s). 

Thus  we  just  have  to  search  the  set  of  vertices  that  ap¬ 
pear  in  the  Voronoi  regions  in  T/{(s)  and  eliminate  those 
vertices  that  are  not  part  of  V(s).  We  do  this  as  fol¬ 
lows.  The  vertices  of  any  Voronoi  region  of  an  element  s 
have  well-defined  sorted  order  defined  by  the  projection 
of  the  vertices  on  s  (this  is  given  as  the  function  proj  in 
Section  3).  Assume  that  the  vertices  appear  in  clockwise 
order  (with  respect  to  some  axis  that  is  fixed  on  s).  We 
merge  in  parallel  the  Voronoi  vertices  of  the  regions  in 
Tfi(s)  according  to  this  ordering  by  using  the  cascading 
divide-and-conquer  technique  that  sorts  optimally  [Cole 
[7],Atallah,  Cole  and  Goodrich  [3]],  until  finally  we  are 
left  with  the  vertices  of  V(s).  Each  leaf  of  the  merge  tree 
will  contain  a  Voronoi  vertex  from  the  regions  in  Tii(s) 
along  with  the  Voronoi  edge  to  its  right.  Let  xjs  be  the 
total  number  of  Voronoi  edges  in  Tii(s). 

We  say  that  a  Voronoi  vertex  v  is  hidden  by  a  Voronoi 
edge  e  if  the  segment  from  v  to  proj(v,  s)  intersects  e  and 
V  does  not  lie  on  e.  The  idea  is  to  maintain,  at  each  node 
K  of  the  merge  tree  and  each  phase  i  of  the  cascade,  a  list 
that  consists  of  vertices  that  are  not  hidden  by  any  of  the 
Voronoi  edges  encountered  at  k  until  phase  i.  After  the 
final  phase,  the  root  of  the  merge  tree  will  have  the  list 

^Recall  that  we  use  Va(u)  to  denote  the  Voronoi  region  of 
a  G  A  in  the  diagram  Vor(A). 
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of  Voronoi  vertices  that  belong  to  V^(s).  This  step  runs 
in  0(log  j/s)  time  with  0{ys)  processors.  We  execute  this 
step  in  parallel  for  all  s  G  S  and  since  ^  ^  7/s  is 
0(Af),  we  have  the  following. 

Lemma  4.8  The  Voronoi  diagram  Vor(iS)  can  be  con¬ 
structed  from  the  recursively  computed  Voronoi  diagrams 
Vor(Ar(C/7(e)))  G  Cr)  in  0{\ogJ\f)  time  using 

0{M)  processors. 

5  Conclusions  and  Applications 

It  follows  from  the  previous  section  that  we  have  an  op¬ 
timal  algorithm  for  the  Voronoi  diagram  of  a  set  of  line 
segments  in  the  plane.  We  state  this  below. 

Theorem  5.1  The  Voronoi  diagram  of  a  set  of  n  non¬ 
intersecting  line  segments  in  the  plane  can  be  computed  in 
O(logn)  time  using  0{n)  processors  and  0{n\ogn)  space. 

The  above  theorem  immediately  gives  us  optimal  paral¬ 
lel  randomized  algorithms  for  computing  the  minimum 
weight  spanning  tree,  the  all-nearest  neighbor  for  the  set 
of  segments  and  also  an  optimal  parallel  algorithm  to  plan 
the  motion  of  an  object  from  one  point  to  another  while 
avoiding  polygonal  obstacles  (see  [19]  for  details). 

Clarkson  and  Shor’s  [6]  results  established  the  effec¬ 
tiveness  of  random  sampling  in  deriving  better  expected 
bounds  for  computational  geometry  problems  and  Reif 
and  Sen  [21]  demonstrate  how  to  obtain  high  probability 
results  from  expected  bounds,  which  is  crucial  for  par¬ 
allel  algorithms.  Our  result  offers  more  evidence  of  the 
usefulness  of  randomization  in  obtaining  more  efficient 
algorithms.  It  will  be  interesting  to  see  if  our  technique 
can  be  applied  to  other  problems  to  obtain  more  efficient 
results  and  perhaps  less  complicated  algorithms  since  we 
believe  that  our  approach  is  simpler  and  more  general. 
For  instance,  we  think  that  the  3-d  convex  hull  algorithm 
given  by  Reif  and  Sen  [21]  could  be  simplified,  especially 
since  the  surface  of  a  convex  polyhedron  is  topologically 
equivalent  to  a  bounded  planar  subdivision. 
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Abstract 

We  give  a  simple  lazy  randomized  incremental  algo¬ 
rithm  to  compute  <A:-levels  in  arrangements  of  x- 
monotone  Jordan  curves  in  the  plane,  and  in  arrange¬ 
ments  of  planes  in  three-dimensional  space.  If  eanh 
pair  of  curves  intersects  in  at  most  s  points,  the  ex¬ 
pected  running  time  of  the  algorithm  is  0{k^\s{n/k)  + 
min(As(n)log^n,  fc^A5(n/fc)logn)).  For  the  three- 
dimensional  case  the  expected  running  time  is  0{nk'^  ■+ 
min(nlog^n, n/c^logn)).  The  algorithm  also  works  for 
computing  the  <fc-level  in  a  set  of  discs,  with  an  expected 
running  time  of  0{nk  min(n  log^  n,  nk  log  n)).  Further¬ 
more,  we  give  a  simple  algorithm  for  computing  the  order- 
k  Voronoi  diagram  of  a  set  of  n  points  in  the  plane  that 
runs  in  expected  time  0{k{n  —  k)  logn  -t-  nlog^n). 

1  Introduction 

Arrangements  of  hyperplanes  have  been  studied  for 
a  long  time  in  combinatorial  and  computational  ge- 
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ometry  and  yet  they  have  kept  some  of  their  secrets. 
Some  of  the  intriguing  open  questions  are  related  to 
the  concept  of  levels.  We  say  that  a  point  p  is  at 
level  k  with  respect  to  a  set  H  of  non-vertical  hyper¬ 
planes  in  if  there  are  exactly  k  hyperplanes  in  H 
that  lie  strictly  above  p.  The  fc-level  of  an  arrange¬ 
ment  A{H)  of  hyperplanes  is  the  closure  of  all  facets 
of  A{H)  whose  interior  points  have  level  k  with  re¬ 
spect  to  H — see  Figure  1.  The  <A:-level  of  A{H)  is 


Figure  1:  The  2-level  in  an  arrangement  of  lines. 

the  union  of  all  i-levels  of  A{H),  for  0  <  f  <  A:.  The 
fc-level  and  the  <  fc-level  of  arrangements  of  monotone 
surfaces  are  defined  analogously.  In  fact,  one  can  give 
a  more  general  definition  of  levels.  Given  a  family  F 
of  subsets,  also  called  ranges,  of  R"^,  define  the  level  of 
a  point  p  with  respect  to  F  to  be  the  number  of  ranges 
that  contain  p  in  their  interior;  the  fc-level  and  <k- 
level  in  the  arrangement  of  F  are  defined  as  before. 
For  a  set  H  of  hyperplanes,  if  we  choose  ranges  to 
be  the  half-spaces  lying  above  the  hyperplanes  of  H, 
then  the  level  of  p  is  same  under  the  two  definitions. 

Although  many  researchers  have  studied  combina¬ 
torial  aspects  of  levels  in  arrangements,  the  maximum 
complexity  of  the  fc-level  is  still  unknown.  Even  in 
the  plane  there  is  a  large  gap  between  the  known 
upper  and  lower  bound  on  the  maximum  complex¬ 
ity  of  the  fc-level:  the  best  known  lower  bound  is 
O(nlog(fc  -f  1))  [Ede87],  whereas  the  best  known  up¬ 
per  bound  is  0(n\/fc/ log* (fc -F  1))  [PSS92].  However, 
exact  bounds  are  known  for  the  maximum  complex¬ 
ity  of  the  <fc-level  in  an  arrangement  of  hyperplanes 
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in  Using  a  probabilistic  counting  method,  Clark¬ 
son  and  Shor  [CS89]  proved  that  the  maximum  com¬ 
plexity  of  the  <fc-level  is  Following 

a  similar  technique,  Sharir  [Sha91]  proved  that  the 
maximum  complexity  of  the  <fc- level  in  a  set  of  x- 
monotone  Jordan  curves,  with  at  most  s  pairwise 
intersections,  is  Q{k^Xs{n/k)).  Here  As(to)  is  the 
maximum  length  of  an  (m,  s)-Davenport-Schinzel  se¬ 
quence;  for  any  constant  s,  Xs{m)  is  roughly  linear  in 
m  [ASS89].  For  discs  he  proved  a  bound  of  Q{nk)  on 
the  complexity  of  the  <A;-level. 

On  the  algorithmic  side,  Mulmuley  [Mul91b]  gave  a 
randomized  incremental  algorithm  for  computing  the 
<k-level  in  hyperplane  arrangements  in  any  dimen¬ 
sion.  For  d  >  4,  the  expected  running  time  of  his 
algorithm  is  which  is  optimal.  For 

d  —  2,3  the  expected  running  time  of  his  algorithm 
is  \og{n/k)),  which  is  suboptimal  by  a  log¬ 

arithmic  factor.  Recently,  Everett  et  al.  [ERvK93] 
gave  an  optimal  0(n  log  n  -b  nk)  expected  time  ran¬ 
domized  algorithm  for  computing  the  </c-level  in  an 
arrangement  of  lines  in  the  plane.  Mulmuley’s  al¬ 
gorithm  can  be  applied  to  compute  the  <fc-level  of 
arrangements  of  a:-monotone  Jordan  curves  in  the 
plane,  but  there  does  not  seem  to  be  an  easy  way 
to  generalize  it  for  computing  the  <fc-level  in  more 
general  ranges  like  discs.  Sharir  [Sha91]  presented  a 
divide-and-conquer  algorithm  for  computing  the  <k- 
level  of  rather  general  ranges  in  the  plane;  its  worst- 
case  running  time  is  roughly  log^  n  times  the  maxi¬ 
mum  size  of  the  <fc-level. 

In  this  paper  we  show  that  by  modifying  Mul¬ 
muley’s  algorithm  slightly,  we  can  obtain  a  sim¬ 
pler  and  faster  randomized  algorithm  for  com¬ 
puting  the  <A;- level  in  a  hyperplane  arrange¬ 
ment  whose  expected  running  time  is  + 

nlognmin(/L:l''^/^^ ,  log^'’*'^^^  n)),  which  is  optimal  for 
k  >  log^n  if  d  =  2,  for  k  >  log^'^^n  if  d  =  3,  and 
for  all  values  of  A;  if  d  >  4.  Moreover,  the  modified 
algorithm  works  for  more  general  ranges  like  discs  or 
pseudo-discs  as  well. 

Let  us  have  a  brief  look  at  Mulmuley’s  algo¬ 
rithm  [Mul91b]  (see  also  his  book  [Mul93]).  The  n 
hyperplanes  are  added  one  by  one  in  random  order 
and  during  this  process  the  <fe-level  of  the  current 
arrangement  is  maintained.  For  all  faces  of  the  cur¬ 
rent  structure  a  conflict  list  (containing  the  not  yet 
inserted  hyperplanes  that  intersect  the  face)  is  main¬ 
tained.  When  adding  a  new  hyperplane  h,  the  algo¬ 
rithm  first  updates  the  arrangement  locally  at  faces 
that  are  intersected  by  h.  Then  it  removes  cells  that 
have  “fallen  off”  because  they  are  now  on  level  fc  -b  1; 
Mulmuley  calls  this  the  peeling  step. 

We  observe  that  this  algorithm  maintains  too  much 


information.  After  adding  the  first  n/2  hyperplanes 
we  expect  the  fc-level  of  A{H)  to  be  located  near  the 
A:/2-level  of  A{R),  where  R  is  the  set  of  the  first  n/2 
hyperplanes;  most  of  the  <A:-level  of  A{R)  lying  be¬ 
low  this  fc/2-level  will  be  peeled  off  in  later  stages. 
Hence,  we  only  maintain  the  <re(r)-level  of  the  first 
r  hyperplanes,  where  K{r)  is  approximately  k{r/n). 
However,  now  we  have  to  be  a  bit  more  careful  about 
discarding  layers  of  cells.  It  turns  out  that  the  easi¬ 
est  way  to  do  this  is  to  remove  the  peeling  step  alto¬ 
gether,  and  to  replace  it  by  regular  clean-up  phases. 
This  idea  was  inspired  by  the  lazy  randomized  incre¬ 
mental  algorithms  of  de  Berg  et  al.  [dBDS94] .  At  the 
same  time,  this  makes  it  possible  to  use  the  algorithm 
in  situations  where  it  is  not  obvious  how  to  access  the 
parts  of  the  current  cell  complex  that  must  be  peeled 
off.  One  such  situation  is  for  the  <A;-level  for  discs. 


As  stated  earlier,  not  much  is  known  about  the 
complexity  of  the  exact  A;-level.  There  is  one  special 
case  where  the  maximum  complexity  of  the  A:-level 
of  a  three-dimensional  arrangement  is  known  to  be 
Q{k{n  —  k)),  namely  when  all  the  hyperplanes  are 
tangent  to  the  unit  paraboloid.  This  situation  arises 
when  the  planes  are  the  images  of  a  set  of  points  in 
the  plane  under  the  transformation  that  maps  the 
order-A;  Voronoi  diagram  of  these  points  to  the  k- 
level  of  the  planes.  Most  known  algorithms  for  com¬ 
puting  the  A-levels  in  three-dimensional  space  actu¬ 
ally  compute  the  <A:-level  [Mul91b,  CE87,  BDT93]. 
Since  the  complexity  of  the  <fc- level  is  ©(n/c^)  in 
the  situation  sketched  above,  the  running  time  of 
these  algorithms  is  ^{nk^  +  nlogn).  The  random¬ 
ized  incremental  algorithm  by  Aurenhammer  and 
Schwarzkopf  [AS92]  maintains  only  the  /c-level,  but 
it  can  be  shown  that  any  randomized  incremental  al¬ 
gorithm  that  maintains  the  fc-level  of  the  interme¬ 
diate  arrangements  must  take  time  Q{nk^)  as  well, 
since  the  expected  number  of  structural  changes  in 
the  fc-level  is  Q.{nk‘^)  [AS92].  The  only  algorithm  that 
approaches  the  desired  0{k[n  —  k))  time  bound  was 
presented  by  Clarkson  [Cla87].  His  algorithm  runs  in 
time  0{n^'^’^k),  where  e  >  0  is  an  arbitrarily  small 
constant.  (The  algorithm  can  probably  be  improved 
somewhat  by  using  more  recent  results  on  geometric 
cuttings) . 

We  present  a  simple  randomized  incremental  algo¬ 
rithm  that  runs  in  0{k{n  -  k)logn  +  n  log^  n)  time. 
We  obtain  this  result  by  maintaining  neither  the  en¬ 
tire  </!;-level  nor  the  exact  fc-level  of  the  current  ar¬ 
rangement,  but  some  suitably  chosen  region  that  is 
guaranteed  to  contain  the  /c-level  of  the  full  set. 
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2  Computing  the  <fc-level 

Let  H  he  a.  set  of  n  hyperplanes  in  d-dimensional 
space.  We  denote  the  arrangement  formed  by  fd  as 
A{H)  and  the  level  of  a  point  p  with  respect  to  H 
as  inip)-  We  define  the  level  of  a  face  of  A{H)  as 
the  level  of  any  of  its  interior  points.  For  a  subset 
i?  C  id,  we  denote  the  bottom-vertex  triangulation  of 
the  <fc- level  of  A{R)  by  Ck{R)-  This  triangulation  is 
defined  as  follows.  Let  C  be  a  cell  of  the  </c- level  and 
let  V  be  the  bottom  vertex  of  C  (that  is,  the  lexico¬ 
graphically  smallest  vertex  of  the  cell).  If  d  =  1  then 
C  is  a  segment  and  it  is  already  a  (1-dimensional) 
simplex.  If  d  >  1,  then  we  recursively  triangulate  the 
(d— l)-dimensional  facets  of  C  and  extend  each  (d— 1)- 
simplex  into  a  d-simplex  using  vertex  v.  The  bottom- 
vertex  triangulation  Ck{R)  of  the  <A:- level  of  A{R)  is 
obtained  by  triangulating  every  cell  of  the  <A:-level 
in  this  manner.  (Unbounded  cells  of  A{R)  require 
some  care  in  this  definition  [Cla88].)  Observe  that  the 
bottom-vertex  triangulation  is  a  cell-complex:  any 
facet  of  a  simplex  is  shared  with  exactly  one  other 
simplex.  In  this  section  we  give  an  algorithm  to  com¬ 
pute  Ck  (H)  for  a  given  parameter  k  <  n. 

The  algorithm.  To  compute  Ck{H)  we  first  gen¬ 
erate  a  random  permutation  hi^h2,  ■  ■  ■  ,hn  of  id.  Let 
Hr  {hi,...,  hr},  and  let  K(r)  be  an  appropriate 
non-decreasing  function  with  «:(n)  >  k  to  he  de¬ 
fined  below.  The  idea  of  the  algorithm  is  to  main¬ 
tain  CK{r){Rr)  while  adding  the  hyperplanes  one  by 
one.  However,  it  turns  out  to  be  easier  not  to  dis¬ 
card  cells  whose  level  (with  respect  to  Hr)  has  grown 
too  large  after  every  step.  Instead  we  get  rid  of  these 
cells  at  regular  clean-up-phases,  namely  after  insert¬ 
ing  the  2*-th  hyperplane,  for  1  <  f  <  [lognj,  and 
after  inserting  the  last  hyperplane.  This  type  of  lazy 
randomized  incremental  algorithm  was  introduced  re¬ 
cently  by  de  Berg  et  al.  [dBDS94].  Below  we  give  a 
detailed  description  of  the  algorithm. 

We  maintain  a  collection  of  d-simplices  that  forms 
a  d-dimensional  cell  complex,  defined  as  follows.  Sup¬ 
pose  we  have  added  the  first  r  hyperplanes.  Then  the 
collection  Sr  forms  the  bottom-vertex  triangulation  of 
the  cells  of  A{Hr)  whose  level  is  at  most  «(p)  with 
respect  to  Hp,  where  p  is  the  time  when  the  latest 
clean-up  was  performed.  Our  schedule  for  perform¬ 
ing  clean-ups  implies  that  p  is  the  largest  power  of 
two  that  is  less  than  r.  With  every  simplex  A  E  Sr 
we  keep  a  conflict  list  H{A)  that  contains  the  hyper¬ 
planes  in  H  intersecting  the  interior  of  A,  and  with 
each  hyperplane  h  e  H\Hr  vfe  keep  a  list  of  simplices 
of  Sr  that  h  intersects.  We  also  keep  an  adjacency 
graph  Q  on  the  simplices.  Two  simplices  Ai  and  A2 


are  connected  in  this  graph  if  they  share  a  facet.  We 
also  mark  an  arc  of  this  graph  if  the  shared  facet  of 
the  two  simplices  connected  by  this  arc  is  contained 
in  a  hyperplane.  The  level  of  points  changes  if  and 
only  if  we  pass  from  one  simplex  to  an  adjacent  one 
connected  by  a  marked  arc. 

Now  consider  the  addition  of  hyperplane  hr-i-i.  In 
a  generic  step  of  the  algorithm  (that  is,  when  r  -f- 1  is 
not  a  power  of  2)  we  must  perform  two  tasks:  update 
our  collection  of  simplices  (that  means,  re-triangulate 
the  cells  that  are  split  by  hr)  and  their  adjacency 
relations,  and  compute  the  conflict  lists  for  the  new 
simplices. 

Let  us  start  with  the  first  task.  Using  the  conflict 
lists  we  identify  all  the  simplices  in  Sr  that  are  in¬ 
tersected  by  hr+i;  those  are  the  only  ones  influenced 
by  the  insertion  of  hr+i.  Because  we  know  the  adja¬ 
cency  relations  among  the  simplices,  we  can  get  the 
intersected  simplices  in  a  number  of  groups,  one  for 
each  cell  that  is  intersected.  Consider  such  a  group, 
and  let  C  be  the  corresponding  cell.  All  the  simplices 
in  the  group  have  to  be  deleted  and  replaced  by  a 
number  of  new  simplices.  Let  us  first  take  a  look 
at  the  situation  when  d  =  2 — see  Figure  2.  To  deal 
with  the  part  of  C  that  lies  on  the  same  side  of  hr+i 
as  the  bottom  vertex  v  of  C,  we  draw  new  diagonals 
from  V  to  the  two  intersection  points  of  hr+i  with  the 
boundary  of  C;  this  creates  three  new  simplices.  The 
part  of  C  lying  on  the  opposite  side  of  hr+i  is  sim¬ 
ply  triangulated  from  scratch.  This  way  the  bottom- 
vertex  triangulations  of  the  two  cells  that  result  from 
the  splitting  of  C  are  constructed  in  time  that  is  lin¬ 
ear  in  the  number  of  new  simplices  that  are  created. 
The  adjacency  relations  among  the  simplices  can  eas¬ 
ily  be  determined  during  the  process.  In  dimensions 
greater  than  two  things  are  not  very  different.  Con¬ 
sider  a  3-dimensional  cell  C.  We  first  re-triangulate 
the  2-dimensional  facets  that  are  intersected,  in  the 
way  we  just  described.  We  also  triangulate  C  PI  hr+i- 
Next  we  construct  the  3-dimensional  simplices.  Given 
the  triangulation  of  the  facets  of  the  cell,  the  proce¬ 
dure  for  this  is  analogous  to  the  planar  procedure. 
In  general,  in  dimension  d  we  first  treat  the  parts  of 
the  {d  —  l)-facets  incident  to  intersected  simplices  re¬ 
cursively,  and  then  we  extend  the  {d  —  l)-simplices 
that  we  get  to  d-simplices  by  connecting  them  to  the 
correct  bottom  vertex. 

Our  second  task  is  to  compute  the  conflict  lists  of 
the  new  simplices.  Let  C  be  a  cell  that  has  been  split 
by  hr+i,  and  let  C'  be  one  of  the  two  new  cells  con¬ 
tained  in  C.  Some  of  the  simplices  of  C  may  already 
have  existed  in  C;  these  simplices  already  have  the 
correct  conflict  lists.  To  find  the  conflict  lists  for  the 
new  simplices  we  collect  all  the  hyperplanes  from  old 
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Figure  2:  Re-triangulation  of  a  cell  that  is  split. 


simplices  that  intersect  C .  We  remove  the  duplicates 
from  our  collection,  and  for  each  hyperplane  we  de¬ 
termine  one  new  simplex  that  it  intersects  (its  initial 
simplex).  This  can  be  done  in  time  proportional  to 
the  total  size  of  the  conflict  lists  involved.  Next,  we 
perform  for  each  hyperplane  a  traversal  of  the  adja¬ 
cency  graph  on  the  new  simplices,  starting  from  its 
initial  simplex.  This  way  we  determine  all  the  new 
simplices  intersected  by  the  hyperplane.  After  having 
done  this  for  each  hyperplane,  we  have  constructed  all 
the  new  conflict  lists.  The  time  we  have  spent  is  lin¬ 
ear  in  the  size  of  these  lists  plus  the  size  of  the  old 
lists  that  disappear. 


In  a  generic  step  of  the  algorithm  this  is  all  we  do. 
(It  would  be  difficult  to  discard  simplices  at  every 
step.  The  reason  for  this  is  the  following.  Since  our 
“threshold”  K{r)  depends  on  r,  it  is  not  necessarily 
the  case  that  all  simplices  lying  below  the  new  hy¬ 
perplane  hr+i  can  be  removed.  Therefore  we  would 
have  to  maintain  the  levels  of  the  simplices,  so  that 
we  can  decide  in  later  stages  whether  or  not  to  dis¬ 
card  them.  But  maintaining  the  level  of  the  simplices 
explicitly  is  too  costly,  because  it  involves  simplices 
not  intersected  by  flr+i-)  For  r  =  2, 4, . . . ,  n, 

however,  we  perform  a  clean-up  step.  We  traverse  the 
entire  current  cell  complex  using  the  adjacency  graph 
Q.  While  visiting  each  simplex  we  compute  its  level 
and  discard  it  if  its  level  is  larger  than  nir),  where  r 
is  the  current  step.  There  will  be  a  slight  chance  that 
by  doing  so  we  discard  a  cell  that  later  turns  out  to 
contain  a  part  of  the  <A;-level  of  A{H),  but  we  will 
argue  that  the  probability  that  this  happens  is  small. 
If  it  happens,  we  will  notice  so  in  the  final  clean-up 
phase  and  we  will  restart  the  algorithm  with  a  new 
random  permutation.  Since  the  probability  of  pick¬ 
ing  such  an  unfortunate  permutation  is  small,  this 
procedure  changes  the  expected  running  time  of  the 
algorithm  by  a  constant  factor  only. 


The  analysis.  Consider  the  situation  after  step  r 
of  the  algorithm.  Let  p  be  the  largest  power  of  two 
that  is  less  than  r,  and  define  k*  :=  k(p).  From  now 
on  we  assume  that  r  and,  hence,  p  and  k*  are  fixed. 

We  use  the  abstract  framework  of  de  Berg  et 
al.  [dBDS94]  to  analyze  our  algorithm.  Let  T  be  the 
set  of  all  possible  simplices  A,  that  is,  all  A  that  arise 
in  the  bottom-vertex  triangulation  of  A{R)  for  some 
subset  R  C  H.  We  denote  the  defining  set  of  a  sim¬ 
plex  A  e  .F  by  'L’(A);  this  is  the  inclusion-minimal 
subset  R  for  which  A  appears  in  the  bottom-vertex 
triangulation  of  ^(R).  (In  degenerate  situations,  dif¬ 
ferent  defining  sets  can  correspond  to  simplices  with 
the  same  set  of  vertices;  these  simplices  are  consid¬ 
ered  to  be  distinct.)  We  define  the  killing  set  of  A, 
denoted  ^(A),  as  the  set  of  hyperplanes  in  H  that 
intersect  the  interior  of  A.  Furthermore,  for  a  subset 
R  C  H  we  define  T{R)  as  the  set  of  simplices  in  the 
bottom-vertex  triangulation  of  the  full  arrangement 
A{R),  and  we  define  M{R)  as  the  set  of  all  simplices 
in  F  that  have  level  at  most  k*  with  respect  to  R. 
Notice  that  the  simplices  in  M{R)  do  not  have  to  be 
present  in  T{R).  Finally,  for  R'  C  R  C  ff  we  define 
C{R,R')  :=  T[R)  r\M[R').  The  idea  is  that  the  set 
£(R,  R')  corresponds  to  the  structure  maintained  by 
the  lazy  algorithm:  in  our  analysis  R  will  be  the  cur¬ 
rent  subset  and  R'  will  be  the  subset  for  which  the 
latest  clean-up  was  performed.  Observe  that  £(R,  R) 
is  the  same  as  (R),  the  set  of  simplices  of  the 
bottom- vertex  triangulation  of  the  <fc*-level  of  ^(R). 

Let  6(A)  :=  |'R(A)|  and  w(A)  :=  |/C(A)|.  It  is 
straightforward  to  verify  that  the  following  three  con¬ 
ditions  hold: 

(i)  there  is  a  constant  6  >  0  depending  on  d  such 
that  6(A)  <  6  for  all  A  G  R", 

(a)  a  region  A  G  R  is  in  T(R)  if  and  only  if  V(A)  C 
R  and  /C(A)  0  R  =  0, 

(in)  for  R'  C  R  C  iL  we  have  A4(R)  C  A4(R'). 
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Let  /ii, . . . ,  be  a  random  permutation  of  H,  and 
let  Hr  :=  {hi, . . . ,  hr}.  Define  Cr  :=  C{Hr,Hp).  (Re¬ 
call  that  p  is  the  largest  power  of  two  that  is  less 
than  r.)  Furthermore,  for  1  <  g  <  r  <  n,  we  define 
the  function 

T{r,q):=  m^xE[\£{HuH,)\].  (1) 

q<t<r 

De  Berg  et  al.  [dBDS94]  proved  that  under  conditions 
(i)-(iii)  the  following  theorem  holds. 

Theorem  1  Let  3  <  r  <  n  and  let  p  be  the  largest 
power  of  two  that  is  less  than  r.  The  expected  size 
of  Cr  is  at  most  T{r,p).  The  expected  number  of 
simplices  in  Cr  \  Cr-i  is  bounded  by  0(4r(r,p/2)). 
The  expected  total  conflict  size  of  these  simplices  is 
bounded  as 

To  apply  this  theorem  we  must  bound  the  func¬ 
tion  T{r,q)  for  r/4  <  q  <  r.  In  other  words,  we 
must  bound  the  expected  number  of  simplices  in  the 
bottom- vertex  triangulation  T{Ht)  that  have  level  at 
most  k*  with  respect  to  Hq,  iov  r/A  <  q  <  t  <  r.  It 
is  sufficient  to  bound  the  expected  number  of  vertices 
of  A{Ht)  at  level  at  most  k*  with  respect  to  Hq;  the 
number  of  simplices  is  linear  in  this  number.  Observe 
that  Hq  is  a  random  sample  of  Ht-  We  shall  refor¬ 
mulate  our  problem  in  an  abstract  setting  so  that  we 
can  use  results  on  random  sampling. 

Recall  that  C{Hq,Hq)  equals  Ck-{Hq),  the  set  of 
simplices  of  the  bottom-vertex  triangulation  of  the 
<&*-level  of  A{Hq).  As  before,  let  T  denote  the  set  of 
all  possible  simplices,  and  let  11(A)  denote  the  defin¬ 
ing  set  of  a  simplex  A.  Again,  the  following  condition 
holds: 

(i’)  there  is  a  constant  b  >  0  such  that  6(A)  <  6  for 
all  A&E, 

Let  /C(A,  Ht)  be  the  subset  of  hyperplanes  in  Ht  that 
intersect  the  interior  of  A.  It  is  not  difficult  to  check 
that  the  following  conditions  hold  for  every  subset 
RCHt: 

(ii’)  if  A  e  C{R,  R),  then  P(A)  C  R, 

(in’)  if  A  e  C{R,  R),  then  /C(A,  Ht)nR  =  0, 

(iv’)  if  A  e  C{R,  R)  and  R'  is  a  subset  of  R  with 
T>{A)  CR’  CR,  then  A  e  CiR',R'). 

Define  uj{A,Ht)  :=  \lC{A,Ht)\.  The  number  of  ver¬ 
tices  we  have  to  consider  is 

0{  (l  +  ^(A,^^^t))"). 


To  bound  the  expression  above  we  now  use  the  fact 
that  Hq  is  a  random  sample  of  Ht  of  size  q  with  q  > 
t/A.  Because  conditions  (i’)-(iv’)  hold  we  can  use 
a  theorem  by  de  Berg  et  al.  [dBDS94]  for  bounding 
higher  moments.  This  theorem  tells  us  that 

E[  =  0{{t/q)'^\Ck.{Hq)\). 

The  complexity  of  Ck-{Hq),  the  <A:*-level  of  A{Hq), 
is  0{qA/'^l(k*)A/‘^'\).  We  can  conclude  that  T{r,  q)  = 
for  7-/4  <  q  <  r.  Combining  this 
result  with  Theorem  1  gives  us  the  following  result. 
(To  simplify  the  expression  we  have  replaced  occur¬ 
rences  of  p  and  p/2  by  r;  this  does  not  influence  our 
bounds  asymptotically  because  r/2  <  p  <  r.) 

Lemma  2  The  expected  running  time  of  the  lazy 
randomized  algorithm  is 

0(n^rl-‘^/^J"^(«;(r)  -f  1)1^'^/^^). 

T—\ 

For  dimensions  larger  than  three,  we  simply  choose 
K(r)  ;=  k.  In  this  case,  a  cell  that  we  discard  can¬ 
not  contain  any  part  of  the  <A:-level  later  on,  so  our 
algorithm  always  succeeds.  We  obtain  an  algorithm 
for  the  computation  of  <fc- levels  with  expected  run¬ 
ning  time  The  algorithm  is  nearly 

identical  to  Mulmuley’s  algorithm  and  has  the  same 
optimal  running  time;  the  only  difference  is  that  we 
replaced  his  peeling  steps  by  our  clean-up  steps. 

For  dimensions  two  and  three,  however,  this 
choice  of  K(r)  gives  us  an  expected  running  time  of 
0(nkf^^'^'^  \og{n/k)),  which  is  suboptimal  by  a  factor 
of  log(n/fc).  We  now  show  that  a  slight  twist  in  the 
algorithm,  namely  a  different  choice  of  iz{r),  improves 
its  expected  running  time  for  sufficiently  large  values 
of  k. 

In  the  following,  the  dimension  d  is  2  or  3.  To 
minimize  the  running  time  we  would  like  to  choose 
K{r)  as  small  as  possible.  As  remarked  earlier,  how¬ 
ever,  there  is  a  slight  possibility  that  the  algorithm 
does  not  give  us  the  entire  <fc- level,  that  is,  that  we 
have  discarded  cells  that  contain  a  vertex  of  the  <k- 
level.  We  must  choose  K{r)  large  enough  so  that  the 
probability  of  this  happening  is  small.  To  bound  this 
probability  we  need  the  following  lemma,  which  can 
be  proven  using  tail  estimates  for  the  binomial  distri¬ 
bution  [AS93]. 

Lemma  3  Let  v  be  a  vertex  in  an  arrangement  A{H) 
of  n  hyperplanes  at  level  iniv)  <  k,  and  let  r  <  n 
and  fl  >  1  be  parameters.  The  probability  that  for 
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a  random  sample  R  C  H  of  r  hyperplanes  the  level 
£r{v)  of  V  is  more  than  is  bounded  by 

For  (3  >  e^ ,  this  is  bounded  by 

2  exp{—2pk{r/n)). 

In  the  remainder  we  shall  need  that  the  probability 
that  a  vertex  v  at  level  at  most  k  in  A{H)  has  a  level 
greater  than  K(r)  in  A{Hr)  is  bounded  by  2/n^.  From 
the  lemma  above  it  readily  follows  that  this  is  true  if 
we  choose 

K(r)  max  (2e^  A;-,  4  Inn). 

n 

Now  let  us  consider  the  probability  that  the  algo¬ 
rithm  fails.  Our  choice  of  r{t)  implies  that  the  prob¬ 
ability  that  a  vertex  v  is  mistakenly  discarded  in  any 
given  clean-up  step  is  at  most  2/n^.  There  are  [lognj 
clean-up  phases,  and  there  are  at  most  vertices  to 
consider  (with  d  =  2  or  d  =  3),  so  the  probability 
that  we  ever  discard  a  vertex  unjustified  is  at  most 
21ogn/n.  In  other  words,  we  expect  to  succeed  af¬ 
ter  we  have  run  the  algorithm  a  constant  number  of 
times. 

According  to  Lemma  2  the  expected  running  time 
of  the  algorithm  with  the  given  choice  of  /t(r)  will  be 


such  arrangements  into  constant  complexity  pieces, 
or  boxes.  We  use  the  vertical  decomposition;  we  ex¬ 
tend  a  vertical  segment  upward  from  every  intersec¬ 
tion  point  of  two  curves  until  it  hits  another  curve 
and  extend  a  vertical  segment  downward  until  it  hits 
another  curve;  if  a  vertical  segment  does  not  intersect 
any  curve,  it  is  extended  to  infinity.  The  same  tech¬ 
nique  can  be  applied  to  the  cells  in  the  </v(r)-level 
of  the  arrangement  A{Hr).  The  number  of  resulting 
boxes  is  0{R{r)'^\g{r / nfr))) .  We  can  now  use  the  lazy 
randomized  incremental  algorithm  described  above  to 
compute  the  <A;-level  in  an  arrangement  of  Jordan 
curves.  We  only  need  to  change  the  way  the  decom¬ 
position  is  updated  after  an  insertion,  since  we  are 
now  using  vertical  decompositions  instead  of  bottom- 
vertex  triangulations.  We  omit  the  details  of  how  the 
vertical  decomposition  is  updated;  they  are  the  same 
as  in  randomized  incremental  algorithms  for  comput¬ 
ing  full  planar  arrangements  [MulQla]. 

The  analysis  of  the  algorithm  follows  the  analysis 
given  earlier,  with  the  maximum  complexity  of  the 
<A;-level  in  a  set  of  hyperplanes  replaced  by  the  max¬ 
imum  complexity  of  the  <A;-level  in  a  set  of  Jordan 
curves.  Thus,  ifp  is  the  largest  power  of  two  less  than 
r  then  the  expected  running  time  is 


0{n)Y, 


r=l 


< 


0{n)Y, 


r=l 


+ 


=  -t- n(logn)^‘^/^^‘'"^). 


For  A;  <  4  In  n,  this  is  no  improvement  over  the  trivial 
choice  of  nlr).  Hence,  our  final  choice  for  K{r)  is 

,  ,  if  A;  <4 Inn, 

'  '  I  max(2e^A:^,41nn),  otherwise. 


This  gives  us  an  expected  running  time  of  0{nk^‘^£'^^  4- 
n  log  n  min(A;  ^  Jog  ^ ^ ^  ^  which  is  optimal  for 

k  >  logn  in  the  planar  case,  and  for  k  >  log^^^n 
in  the  three-dimensional  case. 

Before  we  state  our  theorem  on  computing  the  <k- 
level  we  discuss  a  generalization  of  the  planar  algo¬ 
rithm. 

Let  LA  be  a  set  of  n  x-monotone  Jordan  curves  in 
the  plane  with  each  pair  of  curves  intersecting  at  most 
s  times.  To  apply  our  algorithm  to  arrangements  of 
Jordan  curves  we  need  a  technique  for  decomposing 


where  r(r,p/2)  =  0{K{r)'^\s{r/K{r))).  The  same 
choice  of  K(r)  as  before  gives  us  a  running  time 
of  0(A;^As(n/A:)  -f  min(As(n)  log^  n,  k^Xs{n/k)  logn)). 
This  bound  subsumes  the  case  of  line  arrangements, 
since  two  lines  intersect  at  most  once  and  Ai(n)  =  n. 

The  same  algorithm  also  works  for  a  set  of  discs. 
Using  the  fact  that  the  complexity  of  the  <A;-level  of  a 
set  of  n  discs  is  0{nk),  it  can  shown  that  the  expected 
running  time  of  the  algorithm,  as  in  the  case  of  lines, 
is  0{nk  F  mm{n\o^  n,nk\ogn)).  We  omit  the  easy 
details  from  here. 

Theorem  4 

(i)  The  <k-level  of  an  arrangement  of  n  x-monotone 
Jordan  curves  in  the  plane  such  that  any  pair 
intersects  in  at  most  s  points  can  he  computed 
in  0{k'^\s{n/k)  -f  min(As(n)  log^n,  k‘^Xs{n/k)  logn)) 
time  by  a  lazy  randomized  incremental  algorithm. 

(ii)  The  <k-level  of  an  arrangement  of  n  planes  in 
three-dimensional  space  can  be  computed  in  expected 
time  0{nk'^  +  min(n  log^  n,  nk^  logn))  by  a  lazy  ran¬ 
domized  incremental  algorithm. 

(in)  The  <k-level  of  an  arrangement  of  n  discs  in 
the  plane  can  be  computed  in  expected  time  0{nk 
min(n  log^  n,  nk  log  n)) . 
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3  Computing  order-k  Voronoi  dia¬ 
grams 

Let  P  be  a  set  of  n  points  in  the  plane.  The  order- 
k  Voronoi  diagram  of  P  is  the  subdivision  of  the 
plane  into  cells  such  that  the  k  closest  points  in  P 
are  uniquely  determined  inside  each  cell.  In  other 
words,  two  points  p,  q  are  in  the  same  cell  if  the  k 
points  of  P  that  are  closest  to  p  are  also  the  k  points 
of  P  that  are  closest  to  q.  P  can  be  mapped  to  a  set 
H  =  H{P)  of  planes  in  three-dimensional  space  as 
follows.  Lift  each  point  of  P  to  the  unit  paraboloid 
[/  ;  2  =  and  take  the  plane  that  is  tangent 

to  U  at  that  point.  The  order-A;  Voronoi  diagram  of 
P  corresponds  to  the  fc-level  of  H — see  for  example 
Edelsbrunner’s  book  [Ede87].  The  maximum  com¬ 
plexity  of  the  order-/c  Voronoi  diagram  and,  hence, 
the  maximum  complexity  of  the  k-level  in  the  situa¬ 
tion  above,  is  Q{k{n  -  k)). 

We  give  a  simple  algorithm  that  computes  the  fc- 
level  for  this  case  in  0{k{n  —  fc)  log  n  +  n  log^  n)  time. 
The  algorithm  is  randomized  incremental,  like  the  one 
in  the  previous  section.  This  time,  however,  we  do 
not  use  the  lazy  paradigm:  we  clean  up  the  structure 
after  every  step.  Another  difference  with  the  previous 
algorithm  is  the  following. 

In  the  algorithm  for  computing  the  <  fc-level  we  ob¬ 
served  that  at  time  r  the  <fc-level  of  the  full  set  is 
expected  to  be  contained  in  the  <fcr/n-level  of  the 
sample.  Stated  differently,  we  made  an  estimation  of 
the  real  level  of  a  vertex  in  the  arrangement  of  sam¬ 
ple  planes  based  on  its  level  in  the  sample.  We  then 
decided  whether  or  not  to  discard  the  vertex  based  on 
this  estimation.  By  making  our  estimations  conser¬ 
vatively  we  ensured  that  the  probability  of  discarding 
a  “good”  vertex  was  small. 

This  line  of  attack  turns  out  not  to  work  for  com¬ 
puting  the  exact  fc-level.  So  we  maintain  a  portion  of 
the  arrangement  which  is  guaranteed  to  contain  the 
fc-level.  In  particular,  we  do  not  make  an  estimate 
of  the  real  level  of  a  vertex  based  on  its  level  in  the 
sample.  Instead  we  maintain  for  each  vertex  in  the 
current  arrangement  a  range  of  values  that  is  guaran¬ 
teed  to  contain  its  real  level.  We  only  discard  a  vertex 
when  fc  is  not  in  this  range.  Thus  we  are  sure  that 
the  algorithm  produces  the  right  answer.  To  bound 
the  expected  running  time  of  our  algorithm,  however, 
we  have  to  use  a  probabilistic  argument. 

As  before,  we  denote  by  H  the  set  of  planes  and 
by  Hr  the  first  r  elements  in  a  random  permutation 
of  H.  Let  A  be  a  simplex  in  the  bottom-vertex  tri¬ 
angulation  of  the  arrangement  A{Hr)-  The  conflict 
list  H{A)  of  A  is  the  set  of  planes  in  H  that  inter¬ 


sect  the  interior  of  A.  Let  ^(A)  be  the  level  of  any 
point  in  the  interior  of  A  with  respect  to  H  \  H{A) 
and  let  a;(A)  :=  \H{A)\.  We  call  A  interesting  if 
^(A)  <  fc  <  ^(A)  -t-  a;(A),  and  we  call  a  cell  of  A{Hr) 
active  if  its  bottom-vertex  triangulation  has  at  least 
one  interesting  simplex. 

Our  algorithm  maintains  the  simplices  of  the 
bottom-vertex  triangulation  of  all  active  cells  of 
A{Hr).  Since  the  fc-level  of  A{H)  can  intersect  only 
interesting  simplices,  we  are  sure  that  the  fc-level  is 
contained  in  the  region  that  we  maintain.  With  every 
simplex  A  we  maintain  the  conflict  list  Lf(A)  and  the 
level  £(A).  We  also  store  with  every  vertex  of  each 
active  cell  its  real  level,  that  is,  its  level  with  respect 
to  H.  This  will  help  us  to  compute  the  level  i(A) 
when  we  create  a  new  simplex  A.  Finally,  we  main¬ 
tain  the  adjacency  graph  on  the  simplices;  there  is  an 
arc  between  two  simplices  if  they  share  a  facet. 

To  add  a  new  plane  /i^+i  we  proceed  as  follows. 
First  we  identify  all  active  cells  intersected  by  hr+i', 
using  the  conflict  lists  this  can  be  done  in  time  linear 
in  the  number  of  simplices  intersected  by  /ir+i-  Then 
we  re-triangulate  these  cells  and  we  compute  the  con¬ 
flict  lists  for  the  new  simplices,  as  described  in  the 
previous  section.  Every  new  simplex  A  contains  at 
least  one  old  vertex;  using  the  level  of  this  vertex  we 
can  compute  the  levels  of  the  new  vertices  of  A  by  ex¬ 
amining  the  planes  in  H{A).  Hence,  we  can  compute 
the  real  levels  of  all  new  vertices  in  time  that  is  linear 
in  the  size  of  the  new  conflict  lists.  Next  we  compute 
the  level  ^(A)  of  each  new  simplex  A;  using  the  level 
of  any  of  the  vertices  of  A  this  can  again  be  done  by 
examining  the  planes  in  H[A).  (Note  that  ^(A)  does 
not  change  for  the  old  simplices.)  Finally,  we  check 
which  of  the  new  simplices  is  interesting,  and  we  dis¬ 
card  the  cells  that  have  no  interesting  simplex.  These 
two  steps  can  be  done  in  time  linear  in  the  total  size 
of  the  conflict  lists  of  all  deleted  simplices. 

When  the  last  hyperplane  has  been  added, 
we  have  H{A)  =  0  for  every  A.  In  other  words, 
ci;(A)  =  0  and  we  are  left  with  the  simplices  of  the 
bottom-vertex  triangulation  of  A{H)  that  have  level 
fc.  The  facets  of  these  simplices  that  bound  their  sim¬ 
plex  from  below  and  are  contained  in  one  of  the  planes 
in  H  form  the  fc-level  of  A{H). 

It  remains  to  analyze  the  algorithm. 

Lemma  5  The  expected  number  of  simplices  present 
at  time  r  is  0[rk^  log  r  +  r  log^  r) . 

Proof:  We  use  an  argument  based  on  the  e-net  the¬ 
ory.  Haussler  and  Welzl  [HW87]  proved  that  a  ran¬ 
dom  sample  R  ot  r  planes  from  a  set  H  ot  n  planes 
has  the  following  property  with  probability  at  least 
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1  -  1/n^:  Any  line  segment  s,  which  do  not  intersect 
a  plane  in  R,  intersects  at  most  Cy  logr  planes  in  H, 
where  C  is  a  constant. 

The  set  Hr  is  a  random  sample  of  H  of  size  r. 
Let  us  therefore  assume  for  a  moment  that  the  above 
mentioned  property  holds  for  Hr,  that  is,  every  seg¬ 
ment  s  that  lies  within  a  cell  of  A{Hr)  intersects  at 
most  C^logr  planes  of  H.  As  a  consequence,  for 
all  simplices  A  present  in  stage  r  we  have  cj(A)  < 
3C^  logr.  We  now  bound  the  total  complexity  of  all 
active  cells  (still  assuming  that  the  property  above 
holds  for  Hr).  By  definition,  every  active  cell  C  con¬ 
tains  at  least  one  interesting  simplex  A.  Let  p  be  an 
arbitrary  point  inside  A.  The  level  of  p  with  respect 
to  H  lies  between  i{A)  and  £(A)  -|-a;(A).  Because  A 
is  interesting  and  uj{A)  <  3C^  logr  we  know  that  the 
level  ofp  lies  between  k-3Cj  logr  and  k  +  3C^  logr. 
We  now  observe  that  every  vertex  u  of  C  can  be  con¬ 
nected  to  p  with  a  segment  s  =  ^  lying  inside  C. 
Therefore  s  crosses  no  more  than  Cj  logr  planes  in 
H,  and  we  can  conclude  that  the  level  of  v  lies  be¬ 
tween  k  —  4C^  logr  and  k  -I-  4(7^  logr.  The  number 
of  vertices  in  AiH)  with  this  level  is  bounded  by 

k+4C^  logr 

=  0(n/c— logr -h  n(— logr)^). 

e=k-4C^logr  ’’ 

The  probability  that  a  given  vertex  of  A{H)  appears 
in  A{Hr)  is  ©((^)^).  Hence,  the  total  complexity  of 
all  active  cells  (and  thus  also  the  number  of  simplices 
present)  at  time  r  is  0{rk^  log  r  -I-  r  log^  r). 

This  derivation  was  under  the  assumption  that  the 
e-net  property  mentioned  above  holds  for  Hr.  This  is 
true  with  probability  at  least  1-1/n^.  If  the  property 
is  not  true  for  Hr,  we  use  the  trivial  bound  of  0{r^) 
on  the  number  of  simplices  in  A{Hr).  The  expected 
number  of  simplices  is  therefore  bounded  by 

(1  -  1/n^)  •0(rA:^logr-hrlog^r)  4-  (l/n^)-0(r^) 

=  0(rfc^logr-f  rlog^r).  g 

As  in  the  previous  section,  the  expected  running 
time  of  the  algorithm  is  bounded  by  the  sum  of  the 
sizes  of  the  conflict  lists  of  all  simplices  ever  created 
during  the  algorithm.  Using  Theorem  1  again  (with 
the  appropriate  definitions  of  jC  and  r),  we  can  prove 
that  this  quantity  is  bounded  by 

n 

—0[rk-  logr-l-rlog^r)  =  0{nk\ogn+n\o^  n). 

r=l  ^  ^ 

For  k  <  n/2  this  is  equal  to  0{k{n  -  k)logn  -I- 
nlog^n).  For  k  >  n/2,  we  consider  the  fc-level  as 
the  (n  —  A;)-level  in  the  arrangement  turned  upside 
down,  leading  to  the  following  theorem. 


Theorem  6  The  k-level  in  an  arrangement  of  n 
planes  in  three-dimensional  space  that  are  all  tangent 
to  the  unit  paraboloid  can  be  computed  in  expected 
time  0(k(n- k)  log  n-|-n  log^  n)  with  a  randomized  in¬ 
cremental  algorithm,  using  0{k{n-k)  logn-hnlog^  n) 
storage. 

Remark.  We  only  used  the  fact  that  all  planes  in 
the  set  H  are  tangent  to  the  unit  paraboloid  in  the 
analysis  of  the  algorithm;  the  algorithm  itself  works 
for  arbitrary  sets  of  planes.  Hence,  if  the  complexity 
of  the  fc-level  for  arbitrary  sets  of  planes  is  about  the 
same  as  in  the  restricted  case  of  planes  tangent  to 
the  unit  paraboloid,  then  the  running  time  of  our 
algorithm  is  also  close  to  optimal  in  this  general  case. 

Corollary  7  The  order-k  Voronoi  diagram  of  n 
points  in  the  plane  can  be  computed  in  expected  time 
0{k{n  —  k)  log  n-\-n  log^  n)  with  a  randomized  incre¬ 
mental  algorithm  that  uses  0{k{n-k)  logn-|-n  log^  n) 
storage. 

4  Concluding  Remarks 

We  have  presented  algorithms  concerning  levels  in  ar¬ 
rangements.  The  first  algorithm  computes  the  <k- 
level  in  an  arrangement  of  curves  in  the  plane  or 
planes  in  three-dimensional  space.  The  second  algo¬ 
rithm  computes  the  exact  fc-level  of  a  set  of  planes  in 
three-dimensional  space  that  are  tangent  to  the  unit 
paraboloid;  the  A;-level  in  this  situation  corresponds 
to  the  order-fc  Voronoi  diagram  of  a  set  of  points  in 
the  plane.  The  algorithms  improve  and/or  simplify 
the  best  known  algorithms  for  these  problems. 

Both  algorithms  are  randomized  and  incremental. 
We  improve  upon  previous  randomized  incremental 
algorithms  for  these  problems  by  being  more  care¬ 
ful  in  what  to  maintain.  In  particular,  previous  ran¬ 
domized  incremental  algorithms  for  these  problems 
maintained  a  part  of  the  current  arrangement  based 
on  the  level  of  that  part  in  the  current  arrangement. 
The  part  that  we  maintain,  on  the  other  hand,  is 
based  on  the  level  of  that  part  in  the  full  arrangement. 
This  level  is,  of  course,  not  known  exactly  during  the 
algorithm.  For  the  computation  of  the  <A:-level  we 
therefore  made  an  approximation  of  the  level  in  the 
full  arrangement  based  on  the  level  in  the  current  ar¬ 
rangement.  For  the  computation  of  the  exact  fc-level 
we  maintained  a  range  of  values  that  is  guaranteed  to 
contain  the  level  in  the  full  arrangement. 

Our  algorithms  are  suboptimal  when  k  is  very 
small.  We  leave  it  as  an  open  problem  to  develop 
an  algorithm  that  is  optimal  for  all  values  of  k.  An¬ 
other  open  problem,  even  in  the  plane,  is  to  compute 
the  exact  A:-level  in  optimal  time. 
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Computing  Many  Faces  in  Arrangements  of  Lines  and  Segments  * 


Pankaj  K.  AgarwaP  Jin  MatouseF  Otfried  Schwarzkopf^ 


Abstract 

We  present  randomized  algorithms  for  computing 
many  faces  in  an  arrangement  of  lines  or  of  seg¬ 
ments  in  the  plane,  which  are  considerably  simpler 
and  slightly  faster  than  the  previously  known  ones. 
The  main  new  idea  is  a  simple  randomized  0(n  log  n) 
expected  time  algorithm  for  computing  yJi  cells  in  an 
arrangement  of  n  lines. 

1  Introduction 

Given  a  finite  set  of  lines,  L,  in  the  plane,  the  ar¬ 
rangement  of  L,  denoted  as  A{L),  is  the  cell  complex 
induced  by  L.  The  0-faces  (or  vertices)  of  A{L)  are 
the  intersection  points  of  L,  the  1-face  (or  edges)  are 
maximal  portions  of  lines  of  L  that  do  not  contain 
any  vertex,  and  the  2-faces  (called  cells)  are  the  con¬ 
nected  components  of  -  (JL.  For  a  finite  set  S 
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of  segments  we  define  the  arrangement,  A(S'),  in  an 
analogous  manner.  Notice  that  while  the  cells  in  a 
line  arrangement  are  convex,  in  an  arrangement  of 
segments  they  need  not  even  be  simply  connected. 

Line  and  segment  arrangements  have  been  exten¬ 
sively  studied  in  computational  geometry  (as  well 
as  in  some  other  areas),  as  a  wide  variety  of  com¬ 
putational  geometry  problems  can  be  formulated 
in  terms  of  computing  such  arrangements  or  their 
parts  [11,  14]. 

Given  a  set  L  of  n  lines  and  a  set  P  of  m  points 
in  the  plane,  we  define  A(L,  P)  to  be  the  collection 
of  all  cells  of  A{L)  containing  at  least  one  point  of 
P.  The  combinatorial  complexity  of  a  cell  C,  denoted 
by  \C\,  in  A{L)  is  the  number  of  edges  of  C.  Let 
i^{L,P)  =  ^ceA{L,P)  1^1  denote  the  total  combina¬ 
torial  complexity  of  all  cells  in  A{L,  P),  and  let 

K{n,  m)  =  max  /t(L,  P) , 

where  the  maximum  is  taken  over  all  sets  of  n  lines 
and  over  all  sets  of  m  points  in  the  plane.  It  is  known 
that 

K(n,  m)  =  +  n  +  m) . 

The  upper  bound  was  proven  by  Clarkson  et  al.  [9]; 
previous  results  and  related  work  can  be  found  in 
Canham  [4],  Edelsbrunner  and  Welzl  [13],  Szemeredi 
and  Trotter  [20] 

In  this  paper  we  study  the  problem  of  computing 
A{L,  P),  that  is,  for  each  cell  C  €  A{L,  P)  ,  we  want 
return  the  vertices  of  C  in,  say,  clockwise  order.  We 
will  refer  to  the  cells  of  A{L,  P)  as  the  marked  cells  of 
A{L).  Edelsbrunner  et  al.  [12]  presented  a  random¬ 
ized  algorithm,  based  on  the  random  sampling  tech¬ 
nique  [16],  for  computing  A(P,  P),  whose  expected 
running  time  was 

log  n-\-n  log  n  log  m) , 

for  any  fixed  e  >  0.  A  deterministic  algorithm  with 
running  time 

0[n?l^r{^l^  log*^^^^  n  +  n  log^  n  A-  m  log  n) 
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was  given  by  Agarwal  [1].  These  algorithms  thus 
are  nearly  worst-case  optimal,  but  both  of  them  are 
rather  involved. 

Recently  randomized  incremental  algorithms  have 
been  developed  for  a  wide  variety  of  geometric  prob¬ 
lems,  which  add  the  input  objects  one  by  one  in  a 
random  order  and  maintain  the  desired  structure;  see 
e.g.  [6,  10,  18,  19].  In  our  case,  we  can  add  the  lines 
of  L  one  by  one  in  a  random  order  and  maintain  the 
marked  cells  in  the  arrangement  of  lines  added  so  far. 
However,  this  approach  seems  to  yield  expected  run¬ 
ning  time  of  fl{n^/m+m  log  n)  in  the  worst  case.  We, 
therefore,  do  not  quite  follow  the  randomized  incre¬ 
mental  paradigm. 

We  begin  by  presenting  an  expected  0(m^+n  log  n) 
time  randomized  algorithm  for  computing  A{L,P). 
Notice  that  for  m  <  v^nlogn,  this  algorithm  is  op¬ 
timal.  We  then  apply  the  random  sampling  tech¬ 
nique  in  a  standard  way,  obtaining  an  expected 
log^^^  ;^  +  (m-l-n)  logn)  time  algorithm. 

We  also  study  a  similar  but  more  complicated  prob¬ 
lem  of  computing  the  marked  cells  in  an  arrangement 
of  n  segments.  Let  5  be  a  set  of  n  segments  in  the 
plane.  We  use  an  analogous  notation  A(5,  P)  to  de¬ 
note  the  set  of  the  cells  in  A{S)  containing  at  least 
one  point  of  P,  and  7?(n,  m)  to  denote  the  maximum 
combinatorial  complexity  of  A{S,  P)  over  all  sets  S 
of  n  segments  and  sets  P  of  m  points  in  the  plane. 
Aronov  et  al.  [2]  proved  that 

77(n, m)  =  O  +  n\ogm  +  n a(n)J  . 

A  randomized  algorithm  with  expected  running  time 
logn  -t-  na(n)  log^n log m)  is  de¬ 
scribed  by  Edelsbrunner  et  al.  [12],  and  a  slightly 
faster  deterministic  algorithm  is  presented  by  Agar¬ 
wal  [1]. 

Following  a  similar  strategy  as  for  the  case  of 
lines,  we  first  develop  a  randomized  algorithm  with 
0((m^  -1-  nlogm  -H  nQ(n))logn)  expected  running 
time.  Let’s  remark  that  the  above  upper  bound 
for  T]{n,m)  is  not  known  to  be  tight,  and  a  bound 
like  r]{n,y/n)  =  0{na(n))  (which  is  conjectured  to 
be  the  complexity  of  i/n  cells)  will  immediately  im¬ 
prove  the  expected  running  time  of  our  algorithm 
to  0(nlogna(n)).  Plugging  this  algorithm  to  the 
standard  random  sampling  technique,  as  in  the  case 
of  lines,  we  obtain  a  randomized  algorithm  for  com¬ 
puting  A{S,  P)  in  0{rn?/^n'^l^  log^/^  ^  + 

(m  +  nlogm  -I-  nQ;(n))logn)  expected  time.  If 
the  segments  of  S  have  only  k  —  o(n^)  intersec¬ 
tion  points,  the  expected  running  time  of  the  algo¬ 
rithm  is  log^''^  (m-|-nlogm-b 

na(n))  log  n). 


For  the  analysis  of  the  expected  running  time  of  our 
algorithms  we  will  use  a  generalization  of  a  lemma 
due  to  Chazelle  and  Friedman  [7].  (An  alternative 
analysis  could  probably  be  obtained  using  a  method 
similar  to  that  of  Chazelle  et  al.  [6],  but  we  hope  that 
our  approach  is  somewhat  more  intuitive). 


2  A  generalization  of  Chazelle 
and  Friedman’s  lemma 


Let  5  be  a  set  of  lines  or  segments,  and  P  a  set 
of  points  in  the  plane.  For  a  cell  C  of  the  collec¬ 
tion  A{S,  P),  let  C"  denote  the  collection  of  trape¬ 
zoids  in  the  vertical  decomposition  of  C,^  and  let 
A"{S,P)  =  Uce.4(S,f’)  ^ the  set  of  trape¬ 
zoids  in  the  vertical  decomposition  oiA{S,P).  Abus¬ 
ing  the  notation  slightly,  we  will  use  A"  (5,  P)  to  de¬ 
note  the  corresponding  planar  subdivision  as  well. 

Let  i?  be  a  subset  of  S.  For  a  trapezoid  A  € 
A"{R,P),  let  zi)(A)  denote  the  number  of  elements 
of  S  intersecting  the  interior  of  A. 

Let  n  =  |5|,  and  suppose  i?  is  a  random  subset  of 
S  of  size  r.  For  the  analysis  of  our  algorithms,  we 
are  interested  in  estimating  the  expectation,  over  all 
random  choices  of  R, 


F  [  w{Ar 

AeA"iR,P) 


(2.1) 


where  c  is  a  small  constant  like  c  =  2.  Well-known 
results  concerning  the  so-called  e-nets  (  Haussler  and 
Welzl  [16])  imply  that,  for  every  A  G  A"  (i?),  u;(A)  < 
C{n/r)  logr  with  high  probability,  where  C  is  a  suit¬ 
able  constant.  From  this  one  can  derive  a  bound  for 
(2.1).  We  are,  however,  interested  in  the  following, 
slightly  stronger  bound  (better  by  a  factor  of  log*’  r): 


Proposition  2.1  (i)  Let  L  be  a  set  ofn  lines  and  P 
a  set  of  m  points  in  the  plane.  If  R  Q  L  is  random 
subset  of  size  r,  where  each  subset  of  size  r  is  chosen 
with  equal  probability,  then  for  any  constant  c  >  1 . 

F  [  Y  ■  0{{n/rY) . 

(a)  Let  S  be  a  set  ofn  segments  and  P  a  set  of 
m  points  in  the  plane.  If  R  C  S  is  random  subset 

^The  vertical  decomposition  c"  of  a  cell  C  in  an  arrange¬ 
ments  of  segments  (or  of  lines)  is  obtained  by  drawing  a  vertical 
line  from  each  vertex  of  C  in  both  directions  (within  C)  until 
it  hits  another  edge  of  C. 
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of  size  r,  where  each  subset  of  size  r  is  chosen  with 
equal  probability,  then  for  any  constant  c  >  1. 

E[  ^  w(A)^]  =  77(r,  m)  •  0((n/r)‘^) . 

These  bounds  essentially  say  that  the  cth  moment 
of  the  quantities  w{^)  behaves  as  if  they  were  0{n/r). 
If  we  sum  u;(A)  over  all  cells  in  A{R) — the  case 
where  every  cell  of  .4(1?)  contains  a  point  of  P — then 
Proposition  2.1  follows  from  a  result  of  Clarkson  and 
Shor  [10].  In  our  situation,  where  the  sum  is  taken 
over  only  some  of  the  cells,  the  Clarkson-Shor  frame¬ 
work  does  not  apply  directly  anymore  (the  main  dis¬ 
tinction  between  these  two  situations  will  be  outlined 
below).  We  give  a  proof  based  on  a  generalization 
of  an  approach  due  to  Chazelle  and  Friedman  [7], 
which  is  somewhat  different  from  the  Clarkson-Shor 
method.  Recently,  de  Berg  et  al.  [3]  gave  an  alterna¬ 
tive  proof  of  Proposition  2.1. 

We  derive  a  key  lemma  in  a  somewhat  abstract 
framework;  see  also  [6,  7,  10]  for  various  approaches 
to  axiomatize  similar  situations. 

Let  5  be  a  set  of  objects.  For  a  subset  i?  C  5,  we 
define  a  collection  of  ‘regions’  called  CT{R)-,  in  the 
situation  of  Proposition  2.1  the  objects  are  segments, 
the  regions  are  trapezoids  and  CT(R)  =  A"{R,P). 
Let  T  =  T{S)  =  denote  the  set  of  re¬ 

gions  defined  by  all  possible  subsets  of  5.  We  asso¬ 
ciate  two  subsets  D{A),K{A)  C  S  with  each  region 
AST. 

D{A),  called  the  defining  set,  is  a  subset  of  S  defin¬ 
ing  the  region  A  in  a  suitable  geometric  sense.^  We 
assume  that  for  every  A  e  T,  |£l(A)|  <  d  for  a 
(small)  constant  d.  In  Proposition  2.1,  each  trape¬ 
zoid  A  is  defined  by  at  most  4  segments  (or  lines) 
of  S,  which  constitute  the  set  £>(A);  details  can  be 
found  in  Chazelle  et  al.  [6]. 

K{A),  called  the  killing  set,  is  a  set  of  objects  of 
S,  such  that  including  any  object  of  K{A)  into  R 
prevents  A  from  appearing  in  CT{R).  In  many  ap¬ 
plications  K{A)  is  the  set  of  objects  intersecting  the 
cell  A;  this  is  also  the  case  in  Proposition  2.1.  Set 
w{A)  =  |ir(A)|. 

Let  S,  CT{R),  D{A),  K{A)  be  such  that  for  any 
subset  R  C  S,  CT[R)  satisfies  the  following  axioms: 

(i)  For  any  A  e  CT{R),  D{A)  C  R  and  RnR'(A)  = 
0,  and 

(ii)  If  A  e  CT{R)  and  R'  is  a  subset  of  R  with 
D{A)  C  R',  then  A  G  CT{R'). 

^We  need  not  make  this  precise  here,  as  this  is  only  an 
intuitive  meaning  of  D{A).  The  analysis  depends  only  on  the 
axioms  involving  D{A)  given  below,  and  these  will  be  satisfied 
in  our  specific  examples. 


It  is  easily  checked  that  these  axioms  hold  in  the  sit¬ 
uations  of  Proposition  2.1. 

For  any  natural  number  t,  let  us  denote 

CTtiR)  =  {A  G  CT{R)  I  w{A)  >  tn/r}  . 

We  establish  the  following: 

Lemma  2.2  Given  a  set  S  of  objects,  let  R  be  a  ran¬ 
dom  sample  of  size  r  <  n  =  [S']  drawn  from  S,  and  let 
t  be  a  parameter,  I  <t  <  r/d,  where  d  =  max  |L>(A)|. 
Assuming  that  CT{R),  D{A)  and  K{A)  satisfy  Ax¬ 
ioms  (i)  and  (ii)  above,  we  have 

E I  CTtiR)  I  =  0(2“*)  ■  E  I  CTiR')\ ,  (2.2) 

where  R'  C  S  denotes  a  random  sample  of  size  r'  = 
[r/tj . 

Roughly  speaking,  Lemma  2.2  says  that  the  expected 
number  of  “large”  trapezoids  in  CT{R),  that  is  trape¬ 
zoids  which  the  value  of  tii(A)  exceeds  the  “right" 
value  n/r  more  than  t  times,  decreases  exponentially 
with  t. 

Chazelle  and  Friedman  [7]  proved  a  result  analo¬ 
gous  to  Lemma  2.2  under  the  following  stronger  ax¬ 
iom  replacing  (ii): 

(ii')  If  D(A)  C  R  and  /C(A)  n  i?  =  0,  then  A  G 
CTiR). 

This  assumption  implies  that  the  presence  of  A  in 
CTiR)  depends  only  on  Z?(A)  and  KiA),  thus  it  is 
determined  purely  “locally.”  Notice  that  (ii')  may 
fail  in  the  situation  of  Proposition  2.1.  However,  (ii') 
holds  in  the  special  case,  when  CTiR)  is  the  vertical 
decomposition  of  all  cells  in  .4(7?). 

Proof  of  Lemma  2.2:  Let  R  =  Ui?cs  CTtiR). 
We  have 

E|C'r*(7?)|  =  ^  Pr[A  G  cr(7?)] ,  (2.3) 

AeTt 

E\CTiR')\  =  ^  Pr[A  G  ^(7?')] 

AeT 

>  Pr[A  G  Cr(7?')] .  (2.4) 

ASTt 

We  will  prove  that,  for  each  A  G  Tt, 

Pr[A  G  Cr(7?)]  =  0(2“*)  •  Pr[A  G  Cr(7?')] ,  (2.5) 

which  in  conjunction  with  (2.3)  and  (2.4)  implies 

(2.2). 

Let  4a  denote  the  event  that  71>(A)  C  7?  and 
7f(A)  n  7?  =  0,  and  let  A'^  denote  the  event  7)(A)  C 
7?'  and  77 (A)  n  7?'  =  0. 
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We  rewrite  Pr[A  G  CT{R)]  using  the  definition  of 
conditional  probability; 

Pr[A  S  CT{R)]  =  Pr[^A]  •  Pr[A  G  CT{R)  \  Aa] 
and  analogously 

Pr[A  G  CT[R')]  =  Pr[A;^]  •  Pr[A  G  CT{R')  \  . 

We  observe  that  by  Axiom  (ii),  we  have 

Pr[A  G  CT{R)  I  Aa]  <  Pr[A  G  CT{R')  \  A'a]  .  (2.6) 

Indeed,  Pr[A  G  CT{R')  \  A'^  is  the  probability  that 
A  appears  in  CT{R'),  where  R'  is  created  as  follows: 
Include  all  elements  of  D{A),  and  then  choose  the 
remaining  r'  -  |D(A)|  elements  randomly  among  the 
elements  of  S\(D{A)UK{A)).  We  may  continue  this 
experiment  by  choosing  R  to  be  R'  plus  a  random 
subset  of  r  —  r'  elements  of  5  \  {R'  U  A'(A)).  Clearly 
for  such  subsets  R'  and  R,  Pr[A  G  CT(R)]  <  Pr[A  G 
CT{R')].  Moreover,  the  subset  R  selected  by  this 
experiment  contains  D{A)  plus  r  —  |f5(A)|  random 
elements  of  S  \  {D{A)  U  A’(A)),  so  Pr[A  G  CT{R)]  is 
the  same  as  the  left  hand  side  of  (2.6). 

Therefore 

Pr[A  €  CT{R)]  ^  Pr[AA] 

Pr[A  €  CT{R')]  -  PrjA'^i  ‘ 

(Note  that  r'  =  [r/tj  >  d,  and  hence  both  denomi¬ 
nators  are  nonzero.) 

It  remains  to  estimate  the  latter  ratio,  which  can  be 
done  in  the  same  way  as  by  Chazelle  and  Friedman. 
Let  d  =  |D(A)|,  w  =  'w(A),  and  for  two  non-negative 
integers  a  <  x,  let  x-  =  x{x  —  1)  •  •  •  (x  —  a  -1- 1).  Then 

p.[/ia|  ^  e.) 

C)  ("kkj'O 

r-  {n  —  w  —  r')- 
r'd  (n  —  r')- 

By  our  assumption  r'  >  d,  so  we  obtain 

— — -  <  dt  for  i  =  0, 1, . . . ,  d  —  1. 
r  —  i 

Thus,  the  first  factor  in  the  above  expression  is  0(t‘^). 
To  bound  the  second  factor,  we  observe  that,  for  i  = 
r',r'-Hl,...,r-  1, 


<  1  - 


w 

n 


<  exp{—w/n) . 


Since  w  >  tn/r,  we  have  w/n  >  t/r,  and  therefore 


Pr[^A]  < 
Pr[A'A]  - 


0(t‘^)exp(-(t  -  1))  =  0(2  *), 


as  desired.  n 

We  now  prove  Proposition  2.1. 

Proof  of  Proposition  2.1:  We  will  only  prove  the 
first  part,  the  second  part  is  identical.  For  any  subset 
R  Q  L  oi  size  r,  let  CT{R)  denote  the  set  of  trape¬ 
zoids  in  the  vertical  decomposition  of  the  marked 
cells  of  A{R),  i.e.,  CT{R)  —  A"(i?,  P).  Obviously, 
|OT(i?)|  <  K{r,m).  Now 

E[  ^  MA)^] 

A6.4"  (ii,p) 


=  E 


^(t^)"(|CTt(P)i-icrt_i(i?)i) 


^  (-)'D^  +  ir-E|CT,(i?)l 

t>o 

=  y~^PO(2~*)  •  K,{r/t,m) 

^  t>o 

<  «(r,m)(-)'^0(t'=.2-‘) 

^  t>o 

=  «(r,m)-0((^)‘=)  . 


□ 


3  Computing  cells  in  line  ar¬ 
rangements 

In  this  section  we  describe  a  randomized  algorithm  for 
computing  A(L,  P),  where  L  is  a  set  of  n  lines  and 
P  a  set  of  m  points  in  the  plane.  In  fact,  our  algo¬ 
rithm  computes  the  vertical  decomposition  A  {L,  P). 
Each  face  of  A"  (L,  P)  is  a  trapezoid,  bounded  by  at 
most  two  vertical  segments  and  portions  of  at  most 
two  edges  of  a  cell  of  a"(L,  P).  We  first  present  a 
randomized  algorithm  for  computing  A"(L,  P)  with 
0{m?  -h  nlogn)  expected  time,  which  is  optimal  for 
m  <  y/n  log  n.  We  assume  that  the  points  of  P  are 
sorted  in  nondecreasing  order  of  their  x-coordinates, 
and  that  the  lines  of  L  are  sorted  by  their  slopes.  We 
first  describe  the  outline  of  the  overall  algorithm,  and 
then  discuss  each  of  the  steps  in  detail. 

1.  Let  t  be  some  sufficiently  large  constant.  Choose 
a  random  subset  RC  L  oi  r  =  [n/t\  lines. 

2.  Partition  P  into  q  =  \Vi]  subsets  Pi,  ...  ,P,, 
each  of  size  at  most  k  =  [m/\/tJ,  where 

Pi  ^  {P(i-i)fc+i-  •  ■  ■  ^Pik}  fori  <  q, 

Pq  ~  {P(i3  — l)fc+l!  iPm}' 


79 


3.  For  each  i  <  q,  compute  Pj)  recursively. 

If  a  cell  C  of  A{R)  is  computed  more  than  once, 
retain  only  one  copy  of  C.  (Note  that  multiple 
copies  of  a  cell  C  are  computed  if  C  contains  the 
points  of  more  than  one  Pi’s.)  Since  P  is  sorted 
in  the  x-direction,  it  is  easy  to  detect  multiple 
copies  of  a  cell.  In  this  way,  obtain  a'\r,P). 

4.  For  each  line  £  G  L  \  R,  compute  the  cells  of 
A{R,  P)  that  £  intersects. 

5.  For  each  trapezoid  A  of  yl"  (P,  P),  compute  the 
set  La  C  L  \  P  of  lines  that  intersect  the  interior 
of  A. 

6.  For  each  trapezoid  A  G  .A"(P,  P),  compute  the 

arrangement  of  lines  of  La  within  A,  and  the 
vertical  decompositions  of  its  cells.  For  each  cell 
C  G  ^(P,  P),  perform  a  graph  search  on  trape¬ 
zoids  of  these  vertical  decomposition  to  merge 
appropriate  trapezoids  and  to  discard  superflu¬ 
ous  ones,  thus  forming  the  portion  of  P) 

within  the  cell  C. 

Steps  1-3  are  trivial,  so  we  now  describe  Steps  4-6 
in  more  detail. 

Step  4.  We  want  to  compute  the  cells  of  ^(P,  P) 
intersected  by  each  line  in  L\P.  The  situation  can  be 
viewed  as  follows:  we  have  a  collection  C  of  disjoint 
convex  polygons  (the  cells  of  A{R,  P)),  and  a  set  L\P 
of  lines.  The  collection  C  has  at  most  m  polygons 
with  a  total  of  0{n+w?)  edges^.  For  each  cell  C  £C, 
consider  C*,  the  set  of  points  that  are  dual  to  the  lines 
intersecting  C.  C*  is  a  polygonal  region,  bounded 
by  an  infinite  convex  chain  from  above  and  by  an 
infinite  concave  chain  from  below.  Each  vertex  of  C* 
is  dual  to  the  line  supporting  an  edge  of  C.  For  a 
pair  of  polygons  Ci,C2  G  C,  an  intersection  point  of 
the  edges  of  Cl ,  is  dual  to  a  common  tangent  of 
Cl  and  C2-  Since  Ci,C2  are  disjoint,  the  boundaries 
of  Cl,  Cl  intersect  in  at  most  4  points. 

Let  us  consider  the  arrangement  A{C*)  of  the 
polygonal  chains  bounding  the  regions  C* ,  for  all 
C  G  C.  It  has  Oin  +  TV?)  complexity,  and  can  be  com¬ 
puted  in  time  O(m^-l-nlogn),  for  instance  by  Mulmu- 
ley’s  randomized  incremental  algorithm  [18,  6].  This 
algorithm  actually  computes  the  vertical  decomposi¬ 
tion  A  (C*)  of  the  arrangement,  together  with  a  point 
location  data  structure  with  O(logn)  expected  query 
time.  We  use  this  data  structure  to  locate  the  points 
£*  dual  to  all  lines  £  €  L  \  R.  From  this  we  can  de¬ 
termine,  for  every  £,  the  regions  of  C*  containing  £*, 

®The  latter  estimate  follows  from  the  bound  for  K(n,Tn) 
mentioned  in  Section  1,  in  fact  it  is  the  weaker  bound  proved 
by  Canham  [4]. 


or  in  other  words,  the  polygons  of  C  intersecting  £. 
Indeed,  after  having  located  all  points  of  the  form  £* , 
we  traverse  the  adjacency  graph  of  the  trapezoids  in 
A  (C*).  At  each  trapezoid  r  G  A"(C*)  we  compute 
C*(t),  the  set  of  regions  that  contain  the  trapezoid 
T  G  A^'{C*),  and  output  the  pairs  (£,  C)  for  £*  G  t 
and  C*  G  C*{t).  Suppose  we  arrive  at  r  from  r',  then 
C*{t)  and  C*(t')  differ  by  at  most  one  region  (the  re¬ 
gion  whose  boundary  separates  r  from  r'),  and  thus 
C*(t)  can  he  obtained  from  C*(r')  in  0(1)  time. 

The  total  time  spent  in  this  step  is  0(Tn^  -bnlogn) 
plus  the  number  of  polygon/line  incidences.  The 
expected  number  of  these  incidences  is  bounded  by 
0{K{r,m)-{n/r))  =  O(n-l-m^),  using  Proposition  2.1 
with  r  =  n/t  and  c  =  1. 


Figure  1:  Finding  cr. 

Step  5.  Let  O  be  a  cell  in  A{R,P),  and  let 
Lc  Q  L  \  R  he  the  set  of  lines  intersecting  the  in¬ 
terior  of  C.  For  each  line  £  G  Lc,  we  compute  the 
trapezoids  of  C  intersected  by  £,  as  follows.  Since 
the  lines  in  L  are  sorted  by  their  slopes,  by  being  care¬ 
ful  in  Step  4,  we  can  ensure  that  the  lines  of  Lc  are 
also  sorted  by  their  slopes.  For  each  line  £  G  Lc  we 
compute  the  two  vertices  Vi  ,V2  of  C  that  support  the 
lines  parallel  to  £  (see  Figure  1).  This  can  be  done, 
over  all  lines  of  Lc,  in  0(|Lc|)  time  by  merging  the 
slopes  of  Lc  with  the  slopes  of  the  edges  of  C;  we 
leave  out  the  easy  details  for  the  reader.  Next,  we 
traverse  dC  in  clockwise  as  well  as  counter-clockwise 
order  in  a  lock-step  fashion,  starting  from  both  Vi 
and  V2  simultaneously  (so  we  preform  4  traversals  in 
a  lock-step  fashion,  as  depicted  in  Figure  1),  until 
we  reach  an  intersection  point  a  of  £  and  C.  Since 
£  intersects  C,  we  will  eventually  find  such  an  in¬ 
tersection  point.  Finally,  by  tracing  £  through  C", 
starting  from  a,  we  compute  all  k  trapezoids  of  C" 
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that  £  intersects.  The  time  spent  in  finding  a  and 
tracing  £  is  easily  seen  to  be  0{k).  Summing  over  all 
cells  C  G  A{R,  P)  and  over  all  lines  of  Lq,  the  total 
time  spent  is  ^(^)))  whose  expected 

value,  by  Proposition  2.1  (i),  is  0(m^  +nlogn). 

Step  6.  Let  A  be  a  trapezoid  of  P).  Af¬ 

ter  having  computed  La,  we  compute  the  arrange¬ 
ment  A(La)  using,  say,  a  randomized  incremental 
algorithm.  We  clip  A(La)  within  A,  and  compute 
the  vertical  decomposition  of  the  clipped  arrange¬ 
ment.  For  each  point  p  G  P  n  A,  we  also  compute 
the  trapezoid  of  this  vertical  decomposition  contain¬ 
ing  p.  The  time  spent  in  this  step  is  easily  seen 
to  be  0(w;(A)^  -f-  |P  D  A|logw;(A))  per  trapezoid 
A  G  A"(P,P). 

For  a  cell  C  G  A(P,  P),  let  Ac  be  the  set  of  the 
resulting  trapezoids  that  lie  in  C.  We  now  define  a 
graph  Qc  on  the  trapezoids  of  Ac-  The  vertices  of 
Qc  are  the  trapezoids  of  Ac,  and  two  trapezoids  are 
connected  by  an  edge  if  they  share  a  vertical  edge. 
By  performing  a  depth  first  search  on  Qc,  we  can 
extract  all  connected  components  of  Qc  whose  trape¬ 
zoids  contain  any  point  of  P.  That  is,  we  pick  a  point 
p  G  P  n  C.  Let  Tp  G  Ac  be  the  trapezoid  containing 
p.  We  perform  a  depth  first  search  in  Qc  starting  from 
Tp  until  we  find  the  entire  connected  component  of  Qc 
containing  Tp.  Let  Ac{p)  be  the  set  of  trapezoids  in 
this  component;  then  the  union  of  these  trapezoids  is 
exactly  the  cell  of  A{L,  {p}).  The  vertices  of  the  cell, 
sorted  in  the  clockwise  order,  can  be  easily  obtained 
by  merging  the  trapezoids  of  Ac(p)  in  an  obvious 
manner. 

If  there  is  a  point  of  P  fl  C  that  does  not  lie  in 
Ac{p),  we  repeat  the  same  procedure.  We  continue 
this  step  until  we  have  extracted  all  component  of  Qc 
that  contain  any  point  of  Pfl  C.  This  gives  A{L,Pr\ 
C). 

Repeating  this  step  for  all  cells  of  A{R,  P),  we  ob¬ 
tain  all  cells  of  A{L,  P).  Finally,  we  compute  the  ver¬ 
tical  decomposition  of  all  the  cells.  The  total  running 
time  for  Step  6  is  0(mlogn)-t-X]Ae^(fl  P)  0(u?(A)^) , 
and  its  expected  value  is 

0(m  log  n  -t-  K(r,  m)(n/r)^)  =  0{m?  -f  n) . 

Putting  all  the  pieces  together,  the  total  expected 
running  time  of  Steps  4-6  is  0(m^  +  nlogrz).  Let 
r(n,m)  denote  the  maximum  expected  time  of  the 
entire  algorithm,  then  we  obtain  the  following  recur¬ 
rence. 

9 

T(n,m)  <  T  [\n/t\  ,  mj)  +  C{w?  -|-  nlogn) , 

i=l 


where  rrii  <  ml\ft  for  i  <  q  =  [\/t  ],  Yli=i  ~ 
and  C  is  an  appropriate  constant.  The  solution  of 
this  recurrence  is 

T{n,  m)  =  0{m^  +  n  log  n) . 

If  m  >  a/ti  logn,  we  can  divide  the  points  of  P  into 
groups  of  size  i/n logn,  and  solve  the  subproblems 
separately.  This  standard  batching  technique  yields 
a  more  convenient  bound  for  the  expected  running 
time,  namely  0{m\/n\ogn  +  nlogn).  Hence,  we  can 
conclude 

Lemma  3.1  Given  a  set  L  of  n  lines  and  a  set  P  of 
m  <  n^  points  in  the  plane,  the  cells  of  A{L)  contain¬ 
ing  the  points  of  P  can  be  computed  by  a  randomized 
algorithm  in  expected  time  0{m\/n  logn  -f  nlogn). 

We  now  present  another  randomized  algorithm 
whose  running  time  is  significantly  better  for  larger 
values  of  m.  Although  the  basic  idea  is  the  same  as  in 
[1],  the  algorithm  presented  here  is  simpler  because 
we  allow  randomization. 

We  choose  a  random  subset  P  C  L  of  size  r,  where 

^  r  m^/3 

nV3  log^^^  (n/^/fn) 

Using  a  randomized  incremental  algorithm,  we  con¬ 
struct  a"  (P)  plus  a  point-location  data  structure  for 
a"(P)  in  expected  time  O(r^)  [6].  For  each  trape¬ 
zoid  A  €  a"  (P),  let  La  C  L  \  P  be  the  set  of  lines 
that  intersect  the  interior  of  A  and  Pa  Q  P  the  set  of 
points  that  are  contained  in  A.  La  can  be  computed 
in  time  0{nr)  by  tracing  each  line  through  Ai'  (P)  and 
Pa  can  be  computed  in  expected  time  O(mlogn)  by 
locating  each  point  of  P  in  A"(P).  Set  ua  =  |La| 
and  rriA  =  I  Pa  I-  Let  Za  be  the  set  of  trapezoids  in 
the  vertical  decomposition  of  cells  of  A(La  U  {9A}) 
that  intersect  the  boundary  of  A  and  lie  inside  A. 
Clarkson  et  al.  [9]  proved  that  \Za\  =  0{nA). 

For  each  trapezoid  A  G  A  (P),  it  is  sufficient 
to  compute  A"(La,Pa)  and  Za-  We  compute 
A(La,Pa)  in  expected  time  0(mA\/nA  log  ua  + 
Ha  lognA)  using  Lemma  3.1.  Za  can  be  computed  by 
a  randomized  incremental  algorithm.  Roughly  speak¬ 
ing,  we  clip  the  lines  of  La  \  P  within  A,  add  the 
clipped  segments  one  by  one  in  a  random  order,  and 
maintain  the  vertical  decomposition  of  the  cells  of 
the  segments  added  so  far  which  intersect  dA.  Fol¬ 
lowing  the  same  analysis  as  in  [6],  it  can  be  shown 
that  the  expected  running  time  of  the  algorithm  is 
O(nAlognA).  Hence,  the  expected  running  time  of 
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the  algorithm  is 

E  ^  O  ^rriA \/nAlognA  +  log riA^  + 
0{nr)  +  0(771  log  n) . 

By  a  result  of  Clarkson  and  Shor  [10]  (or  also  by 
Proposition  2.1),  we  have 

E  ^  77a 

A£A"{R) 


AeA'^(R) 

Thus,  the  expected  running  time  of  the  algorithm 
is  bounded  by  O {m \og[n/r)  +  nr  log  y + 777  log  n) . 
Substituting  the  value  r  in  the  above  expression,  we 
obtain 

Theorem  3.2  Given  a  set  L  of  n  lines  and  a  set  P 
of  777  points,  the  faces  of  A{L)  containing  the  points 
of  P  can  be  computed  by  a  randomized  algorithm  in 
expected  +  (tti  +  ti)  logTi)  time. 

4  Computing  cells  in  segment 
arrangements 

Next,  we  present  an  algorithm  for  computing  marked 
cells  in  arrangements  of  segments.  Let  5  be  a  set 
of  77  segments  and  P  a  set  of  m  points  in  the  plane. 
The  goal  is  to  compute  ^(5,  P)  and  its  vertical  de¬ 
composition  (5,  P).  Again,  we  begin  by  a  simpler 
algorithm  which  is  effective  for  few  cells,  and  then 
plug  the  random  sampling  technique  to  handle  larger 
values  of  m. 

The  outline  of  the  first  algorithm  is  the  same  as 
in  the  previous  section,  except  that  we  must  now  in¬ 
terpret  the  operations  in  terms  of  segments.  Since 
the  cells  of  (P,  P)  are  not  necessarily  simply  con¬ 
nected,  we  may  have  to  deal  with  m  +  n  polygons 
even  though  there  are  only  777  cells.  Consequently, 
the  computation  of  the  sets  of  cells  intersected  by 
each  segment  of  S'  \  P  in  Step  4  and  the  computation 
of  Sa  for  each  trapezoid  A  £  A"  (P,  P)  in  Step  5  now 
become  considerably  more  complicated.  Another  dif¬ 
ficulty  in  computing  Sa  is  that  we  now  have  to  detect 
intersections  between  simple  polygons  and  segments 
rather  than  between  convex  polygons  and  lines.  In 
the  remainder  of  this  section  we  will  describe  how  to 
compute  the  sets  Sa- 

The  boundary  of  each  cell  C,  dC,  of  A{R,P)  is 
composed  of  (at  most)  one  outer  component  and  a 


family  of  inner  components  such  that  C  lies  in  the 
interior  of  the  outer  component  and  in  the  exterior  of 
each  inner  component.  Each  component  of  dC  can 
be  regarded  as  a  simple  polygonal  chain.  Let  O  be 
the  set  of  outer  boundary  components  of  the  cells  in 
A{R,  P),  and  let  I  be  the  set  of  the  inner  boundary 
components  of  these  cells.  We  have  \0\  <  m  and 
|I|  <  777  -1-  77.  Let  p  be  the  total  number  of  edges  of 
all  polygons  in  O  Ul;  obviously,  p  <  r]{n/t,  m). 

We  first  decompose  each  segment  g  G  S  \  R  into 
maximal  subsegments,  so  that  each  subsegment  lies 
in  the  interior  of  some  outer  component  O,  i.e.  we 
cut  each  segment  at  the  intersection  points  of  O  and 
S  and  discard  the  subsegments  that  lie  in  the  exte¬ 
rior  of  O.  Let  S  be  the  set  of  resulting  subsegments. 
Next,  for  each  subsegment  cr  G  S,  we  compute  the 
trapezoids  of  a"  (P,  P)  intersected  by  a. 

Suppose  that  we  have  already  computed  S  in 
Step  4.  Then  in  Step  5  we  compute  Sa,  for  all 
A  G  a"  (P,  P),  as  follows.  We  preprocess  each  polyg¬ 
onal  chain  I  £  I,  in  linear  time,  for  ray  shooting 
queries,  so  that  the  first  intersection  point  of  a  query 
ray  and  I  can  be  computed  in  logarithmic  time,  see 
[5,  15].  The  total  time  spent  in  preprocessing  I  is 
0{p)  0{ri{n,m)). 

Let  cr  be  a  segment  of  E  that  lies  in  the  interior  of 
the  outer  component  O  G  O  of  dC.  Let  a,  b  be  the 
endpoints  of  a,  and  let  A(o)  be  the  trapezoid  of  C" 
containing  a.  If  a  is  not  an  endpoint  of  a  segment  of 
S\R,  then  a  lies  on  the  boundary  of  A(a).  We  check 
whether  b  G  A(o).  If  the  answer  is  ‘yes’,  then  A(a) 
is  the  only  trapezoid  of  C*'  intersected  by  a,  and  we 
stop.  If  6  ^  A(o),  we  compute  the  other  intersection 
point,  Or,  of  cr  and  A  (a).  If  ui  lies  on  a  vertical  edge 
of  A(a),  we  also  compute,  in  constant  time,  the  next 
trapezoid  A(ai)  of  c  "  intersected  by  a.  We  then  re¬ 
peat  the  same  step  with  oi  and  A(ai).  if  oi,  on  the 
other  hand,  lies  on  an  edge  of  the  cell  C,  then  ai  lies 
on  the  boundary  of  some  inner  component  /  G  I  of 
C,  and  the  portion  of  the  segment  a  immediately  fol¬ 
lowing  oi  lies  outside  C.  Using  the  ray  shooting  data 
structure,  we  compute  the  next  intersection  point  02 

of  the  polygonal  chain  I  and  the  segment  aib.  Once 
we  know  02,  we  can  also  compute  the  trapezoid  of 
C"  containing  02,  a-nd  we  continue  tracing  a  through 

c". 

For  each  trapezoid  intersected  by  cr,  we  spend 
O(log77)  time,  so  the  total  time  spent  in  comput¬ 
ing  the  fco-  trapezoids  intersected  by  cr  is  0{ka  log  77). 
Summing  over  all  segments  of  E,  the  total  time  spent 
is  E,.eS<^(^<rlog77)  =  O  ’ 

where  77  a  =  |5'a|- 
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Next,  we  describe  how  to  compute  the  set  S.  No¬ 
tice  that  it  is  sufficient  to  compute  all  intersection 
points  between  the  segments  of  S  \  R  and  the  outer 
polygonal  chains  in  O. 

Let  Jo  be  an  interval  corresponding  to  the  projec¬ 
tion  of  the  polygoal  chain  O  e  O  onto  the  x-axis,  and 
let  J  =  {Jo  |OgO}. 

We  construct  an  interval  tree  T  on  J-  see  Mehlhorn 
[17]  for  details  on  interval  trees.  T  is  a  minimum 
height  binary  tree  with  at  most  2m  leaves.  Each  node 
n  of  T  is  associated  with  an  interval  Uy,  and  a  point 
Xy.  Let  Wy  =  UyX  [— oo,  -|-oo]  be  a  vertical  strip,  and 
let  hy  be  the  vertical  line  passing  through  Xy .  For  the 
root  u,  Wy  is  the  entire  plane  and  hy  is  the  vertical 
line  passing  the  middle  endpoint  of  the  intervals  of 
J.  Each  interval  J  G  J  is  stored  at  the  highest  node 
n  of  T  such  that  Xy  G  J. 

Let  Jy  be  the  set  of  intervals  stored  at  v.  We  as¬ 
sociate  two  subsets  Oy,Zy  of  O  with  v.  Let  Oy  = 
{O  I  Jo  e  Jv]  and  Zy  =  [jy^Oy,,  where  the  union 
is  taken  over  all  descendants  of  v,  including  v;  set 
ruy  =  \Oy\  and  Zy  —  \Zy\.  Finally,  let  izy  (resp.  Cu)  de¬ 
note  the  total  number  of  edges  in  Oy  (resp.  Zy).  Since 
each  polygonal  chain  of  O  appears  in  exactly  one 
Oy,  we  have  M  and  Y^y^xC^  ^  2^1ogm. 

Moreover,  it  can  be  shown  that  if  vi ,  V2  are  the  chil¬ 
dren  of  V  then  Zy^,Zy.^  <  Zy/2,  which  implies  that 
Yyer  4  =  0{w?). 

Since  the  polygonal  chains  in  Oy  are  pairwise  dis¬ 
joint  and  all  of  them  intersect  a  vertical  line,  we  can 
regard  Oy  along  with  appropriate  portions  of  the  ver¬ 
tical  line  hy  as  a  simple  polygon  IIi,,  and  prepro¬ 
cess  IIu  in  0(/i„)  time  for  answering  ray  shooting 
queries.  Using  this  data  structure,  one  can  report 
all  k  intersection  points  of  a  segment  g  and  Oy  in 
time  0{{k  +  1)  log  jZy). 

Next,  we  take  the  convex  hull  of  each  polygonal 
chain  in  Zy,  and  preprocess  the  resulting  convex  poly¬ 
gons  into  a  data  structure,  as  described  in  the  previ¬ 
ous  section,  so  that  all  convex  polygons  intersected 
by  a  query  line  can  be  reported  quickly.  Since  any 
two  polygonal  chains  of  O  are  disjoint,  the  bound¬ 
aries  of  their  convex  hulls  intersect  in  at  most  two 
points,  and  so  they  have  at  most  4  common  tan¬ 
gents.  Consequently,  the  line  intersection  searching 
structure  has  size  0{zl  +  (^y).  Moreover,  it  can  be 
computed  in  time  0{zl  -t-  z^jlogCt;  +  Ci;))  using  the 
algorithm  of  [19].  We  also  preprocess  each  O  G  O 
in  linear  time  for  ray  shooting  queries  as  in  [15].  It 
can  be  shown  that  the  total  preprocessing  time  is 
Oim^  +  Yy  (-Zi^logCt,  +  Ci;))  =  0(m2+m  log m  log  n-k 
glogm).  We  omit  the  details. 

Let  g  G  S\Rhe  a,  segment.  All  intersection  points 
of  g  and  O  can  be  computed  as  follows.  We  search  the 


tree  T  with  g  starting  from  the  root.  Let  n  be  a  node 
visited  by  the  query  procedure.  If  the  endpoints  of  g 
do  not  lie  in  the  vertical  strip  Wy,  i.e.,  g  completely 
crosses  Wy,  then  g  intersects  O  G  Zy  if  and  only  if 
the  line  supporting  g  intersects  the  convex  hull  of  O. 
Thus,  we  first  compute  all  polygonal  chains  of  Zy 
intersected  by  g,  using  the  line  intersection  searching 
structure,  and  then,  for  each  O  G  Zy  intersected  by  g, 
we  compute  the  intersection  points  of  g  and  O  using 
the  ray  shooting  data  structure.  If  kg  is  the  number 
of  intersection  points  between  g  and  the  polygonal 
chains  of  Zy,  then  the  total  time  in  reporting  these 
intersections  is  0{{kg  -t-  IjlogCi;)- 

If  one  of  the  endpoints  of  g  lies  in  Wy,  we  can  com¬ 
pute  all  Ug  intersection  points  between  Oy  and  g  in 
time  0{{ag  -f  1)  log/i^),  using  the  ray  shooting  data 
structure  for  Oy .  Let  vi ,  V2  be  the  children  of  the 
node  V.  If  g  intersects  Wy^  (resp.  we  visit  vi 

(resp.  V2).  It  is  easily  shown  that  the  query  proce¬ 
dure  visits  O(logm)  nodes,  and  the  query  time  is 
0((logm  -f  kg)\ogn),  where  kg  is  the  total  number 
of  intersection  points  reported. 

We  repeat  this  procedure  for  all  segments  g  G  S\R. 
Since  fx  <  ri{n,m)  =  0(m^  -h  n(logm  -1-  a{n))) 
and  Yges\Rh  ^  the  total  cost 

of  computing  the  intersection  points  is  0{{m^  -k 
nlogm  -f  na(n))logn  -k  (p.p) As 

in  the  previous  section,  the  time  spent  in  Step  6 
(refining  the  cells  of  Jl'*(i?, P))  is  0(Ya'’^'a)-  Us¬ 
ing  Proposition  2.1  (ii),  we  obtain  that  the  to¬ 
tal  expected  time  spent  in  the  merge  step  is 
O  [{nn?  -k  nlogm  -k  na{n))  logn). 

Following  the  same  analysis  as  earlier,  we 
can  conclude  that  the  total  running  time  of 
the  first  algorithm  for  computing  A{S,  P)  is 
O  ( (m^  -k  n  log  m  +  na{n))  log n) .  We  can  again  use 
the  batching  technique  if  m  is  large.  Omitting  the 
details,  we  obtain 


Lemma  4.1  Given  a  set  S  of  n  segments  and  a  set 
P  of  m  points,  the  faces  of  A{S)  that  contain  a  point 
of  P  can  be  computed  hy  a  randomized  algorithm  in 
expected  time  0{{my/n\ogm+n(\ogm-[-a{n)))  log n). 


For  larger  values  of  m,  we  again  use  the  ran¬ 
dom  sampling  technique  as  in  the  previous  section. 
That  is,  we  choose  a  random  subset  P  C  S'  of  size 

77i2/3  \o^/^{n/ ^Jm) 


r  = 


^1/3  cP-A{nj  ^Jm) 


,  and  compute  A  (R). 


For  each  A  G  Jl''(P),  we  compute  Pa  =  P  H  A 
and  Sa,  the  set  of  segments  that  intersect  A.  We 
clip  the  segments  within  A.  The  total  time  spent 
in  this  step  is  0(r^  +  (m  +  nr)logr).  Let  ^  be  a 
point  lying  in  the  unbounded  face  of  M(S).  For  each 
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A  6  we  compute  A'\Sa,Pa  U  {2}),  in  time 

0((mA\/nA  log ruA  +  nA(logTOA  +  Q(nA)))  logriA), 
using  Lemma  4.1,  and  then  glue  them  together.  We 
omit  the  rather  routine  details  from  here.  The  overall 
expected  running  time  of  the  algorithm  is 


0{{nAia{nA) +logmA)+ 

AeA"{R) 

niA^/nA  log tua  +  0((m  +  nr)  logr) . 


[6]  B.  Chazelle,  H.  Edelsbrunner,  L.  Guibas,  M.  Sharir 
and  J.  Snoeyink,  Computing  a  face  in  an  arrange¬ 
ment  of  line  segments,  SIAM  J.  Computing  22 
(1993),  1286-1302. 

[7]  B.  Chazelle  and  J.  Friedman,  A  deterministic  view 
of  random  sampling  and  its  use  in  geometry,  Com- 
binatorica  10  (1990),  229-249. 

[8]  K.  Clarkson,  Computing  a  single  face  in  an  arrange¬ 
ment  of  segments,  1990,  manuscript. 


Again,  using  the  results  by  Clarkson-Shor  [10]  and 
substituting  the  value  of  r,  we  obtain 


Theorem  4.2  Given  a  set  S  of  n  segments  and  a  set 
P  of  m  points,  the  faces  of  A{S)  that  contain  a  point 
of  P  can  be  computed  by  a  randomized  algorithm  in 
expected  time  log'*^^  + 

nlogm  +  na{n))  logn). 


Finally,  let  us  remark  that  if  .4(5)  is  sparse,  that 
is,  if  it  has  only  k  =  o{n^)  vertices,  then  using 
the  fact  that  the  expected  number  of  trapezoids  in 
.4"(i?)  is  0{kr‘^/n^  +  r),  we  can  do  a  more  care¬ 
ful  analysis,  choose  r  =  n  (f 
can  show  that  the  expected  running  time  of  the  algo¬ 
rithm  is  log^'^®  ^/^(^)-f  (m-l-nlogm-l- 

na(n))  logn) . 


Acknowledgments.  The  authors  thank  Mark  de 
Berg,  Mark  Overmars,  and  Micha  Sharir  for  several 
useful  discussions. 


References 

[1]  P.  Agarwal,  Partitioning  arrangements  of  lines:  II. 
Applications,  Discrete  and  Computational  Geome¬ 
try  5  (1990),  533-573. 


[9]  K.  Clarkson,  H.  Edelsbrunner,  L.  Guibas,  M.  Sharir 
and  E.  Welzl,  Combinatorial  complexity  bounds  for 
arrangements  of  curves  and  spheres.  Discrete  and 
Computational  Geometry  5  (1990),  99-160. 

[10]  K.  Clarkson  and  P.  Shor,  Applications  of  random 
sampling  in  computational  geometry  II,  Discrete 
and  Computational  Geometry  A  (1989),  387-421. 

[11]  H.  Edelsbrunner,  Algorithms  in  Combinatorial  Ge¬ 
ometry,  Springer- Verlag,  Berlin,  1987. 

[12]  H.  Edelsbrunner,  L.  Guibas  and  M.  Sharir,  The 
complexity  of  many  faces  in  arrangements  of  lines 
and  of  segments,  Discrete  and  Computational  Ge¬ 
ometry  5  (1990),  161-196. 

[13]  H.  Edelsbrunner  and  E.  Welzl,  On  the  maximal 
number  of  edges  of  many  faces  in  an  arrange¬ 
ment,  Journal  of  Combinatorial  Theory,  Series  A 
41  (1986),  159-166. 

[14]  L.  Guibas  and  M.  Sharir,  Combinatorial  and 
algorithms  of  arrangements,  in  New  Trends  in 
Discrete  and  Computational  Geometry  (J.  Pach, 
ed.),  Springer- Verlag,  New  York-Berlin-Heidelberg, 
1993,  9-36. 

[15]  J.  Hershberger  and  S.  Suri,  A  pedestrian  approach 
to  ray  shooting:  Shoot  a  ray,  take  a  walk,  Proc.  4th 
ACM-SIAM  Symp.  Discrete  Algorithms,  1993,  pp. 
54-63. 


[2]  B.  Aronov,  H.  Edelsbrunner,  L.  Guibas  and  M. 
Sharir,  Improved  bounds  on  the  complexity  of 
many  faces  in  arrangements  of  segments,  Combi- 
natorica,  12  (1992),  261-274. 

[3]  M.  de  Berg,  K.  Dobrindt  and  O.  Schwarzkopf,  On 
lazy  randomized  incremental  construction,  to  ap¬ 
pear  in  Proceedings  26th  Annual  ACM  Symposium 
on  Theory  of  Computing,  1994. 

[4]  R.  Canham,  A  theorem  on  arrangements  of  lines  in 
the  plane,  Israel  J.  Math.  7  (1969),  393-397. 

[5]  B.  Chazelle,  H.  Edelsbrunner,  M.  Grigni,  L. 
Guibas,  J.  Hershberger,  M.  Sharir  and  J.  Snoeyink, 
Ray  shooting  in  polygons  using  geodesic  triangula¬ 
tions,  Proc.  17th  Int.  Colloq.  Automata,  Languages 
and  Programming,  1991,  pp.  661-673. 


[16]  D.  Haussler  and  E.  Welzl,  Epsilon-nets  and  simplex 
range  queries.  Discrete  Comput.  Geom.  2  (1987), 
127-151. 

[17]  K.  Mehlhorn,  Data  Structures  and  Algorithms  3: 
Multi-dimensional  Searching  and  Computational 
Geometry,  Springer- Verlag,  Berlin,  1984. 

[18]  K.  Mulmuley,  A  fast  planar  partition  algorithm,  I, 
J.  Symbolic  Gomputation  10  (1990),  253-280. 

[19]  K.  Mulmuley,  A  fast  planar  partition  algorithm,  II, 
J.  ACM  38  (1991),  74-103. 

[20]  E.  Szemeredi  and  W.  Trotter  Jr.,  Extremal  prob¬ 
lems  in  discrete  geometry,  Combinatorica  3  (1983), 
381-392. 


84 


Matching  Shapes  with  a  Reference  Point 


Helmut  Alt* 

Freie  Universitat  Berlin 
Fachbereich  Mathematik  und  Informatik 
Takustrafie  9,  D-14195  Berlin,  Germany 
e-mail:  altfflinf  .  fu-berlin.  de 


Oswin  AichholzeP 

Institut  fiir  Grundlagen  der  Informationsverarbeitung 
Technische  Universitat  Graz 
Schiefistattgasse  4,  A-8010  Graz,  Austria 
e-mail:  oaichQigi .  tu-graz .  ac .  at 


Giinter  Rote 

Institut  fiir  Mathematik,  Technische  Universitat  Graz 
Steyrergasse  30,  A-8010  Graz,  Austria 
e-mail:  roteQftug. dnet .  tu-graz . ac .  at 


Abstract 

For  two  given  point  sets,  we  present  a  very  simple 
(almost  trivial)  algorithm  to  translate  one  set  so 
that  the  Hausdortf  distance  between  the  two  sets 
is  not  larger  than  a  constant  factor  times  the  min¬ 
imum  Hausdortf  distance  which  can  be  achieved  in 
this  way.  The  algorithm  just  matches  the  so-called 
Steiner  points  of  the  two  sets. 

The  focus  of  our  paper  is  the  general  study  of 
reference  points  (like  the  Steiner  point)  and  their 
properties  with  respect  to  shape  matching. 

For  more  general  transformations  than  transla¬ 
tions,  our  method  eliminates  several  degrees  of  free¬ 
dom  from  the  problem  and  thus  yields  good  match¬ 
ings  with  improved  time  bounds. 

1  Introduction 

This  work  is  motivated  by  a  problem  that  is  typ¬ 
ical  in  application  areas  such  as  computer  vision 
or  pattern  recognition,  namely,  given  two  figures 
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A,B,  to  determine  how  much  they  “resemble  each 
other”. 

Here,  a  “figure”  will  be  a  union  of  finitely  many 
points  and  line  segments  in  or  triangles  in  E^. 
Note  that  sets  of  curves  in  E^  and  E^  or  surfaces 
in  E^  can  be  approximated  arbitrarily  closely  by 
these  objects.  As  a  measure  for  “resemblance”  we 
wiU  use  the  Hausdorff-metric  Sh,  which  is  a  some¬ 
how  natural  distance  measure  and  gives  reasonable 
results  in  practice  (see  [HKR]).  It  is  defined  as  fol¬ 
lows: 

Definition  1  Let  A,  B  be  compact  subsets  of  E^ 
or  E^.  Then 

SaiA,  B)  :=  max  min  ||a  -  &|1, 

a^A  b£B 

where  ||  j|  is  the  Euclidean  norm  and  the  Hausdorff- 
distance  is  defined  as 

^ff(A,R):=max{^(A,R),^(i?,A)}. 

If  A  and  B  consist  of  n  and  m  line  segments  in 
the  plane,  their  Hausdorff-distance  Sh{A,B)  can 
be  computed  in  time  0{{n  -f  m)log(n  +  m))  (cf. 
[ABB]).  However,  it  is  more  natural  to  assume 
that  A  and  B  are  not  fixed  but  can  be  moved  by  a 
translation,  by  a  rigid  motion  (translation  and  ro¬ 
tation)  or  even  transformed  by  a  similarity  (scal¬ 
ing  and  rigid  motion)  in  order  to  match  them  as 
well  as  possible  and  then  determine  the  minimal 
Hausdorff-distance.  So,  in  general,  we  have  a  set  T 
of  allowed  transformations  and  want  to  determine 
for  given  figures  A  and  B: 

min  S„{A,T{S)). 
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Note  that,  for  similarities,  it  makes  a  difference  if 
we  exchange  the  sets  A  and  B  in  this  problem. 

This  problem  of  finding  an  optimal  matching  has 
been  considered  for  the  two-dimensional  case  in 
several  previous  articles:  In  [ABB],  an  algorithm 
of  running  time  O  {(nm)log{nm)  log* (nm))  for  the 
case  that  T  is  the  set  of  translations  along  one 
fixed  direction  is  described.  An  0  {{nmY  log^ (nm)) 
algorithm  for  arbitrary  translations  is  presented 
in  the  paper  [AST],  which  can  be  improved  to 
O(nm(n-l-m)log(nm))  if  A  and  B  are  finite  sets  of 
points  [HKS].  The  paper  [CGHKKK]  gives  an  al¬ 
gorithm  of  time  complexity  0  ((nm)^  log^ (nm))  for 
arbitrary  rigid  motions.  The  latter  two  algorithms 
use  sophisticated  and  powerful  tools  like  paramet¬ 
ric  search  and  therefore  do  not  seem  to  be  applica¬ 
ble  in  practice. 

Here,  we  foUow  a  different  approach  which  was 
already  used  in  [ABB].  We  do  not  try  to  find  an 
optimal  solution  but  an  approximation  to  the  op¬ 
timal  one  by  simpler  algorithms.  More  precisely,  if 
the  optimal  matching  yields  Hausdorff-distance  8 
our  algorithms  will  find  a  matching  T  such  that 

6H(A,T(B))<a6 

for  some  constant  a  >  1.  We  call  such  a  solution  a 
pseudooptimal  matching  with  loss  factor  a. 

A  somehow  similar  approach  has  been  taken  in 
several  articles  by  Heffernan  and  Schirra  (  [H,HS]) 
considering  matching  of  finite  sets  A,B  of  points 
under  rigid  motions.  They  consider  both,  the  deci¬ 
sion  problem,  whether  for  a  given  £  A  (or  a  subset 
of  a  given  size  k)  can  be  mapped  in  a  1- 1-fashion 
into  the  £-neighborhoods  of  points  of  B,  and  the 
optimization  problem,  namely  to  find  the  smallest 
£  of  this  kind.  Both  problems  are  solved  very  effi¬ 
ciently  by  allowing  that  the  optimal  e  is  not  found 
exactly,  but  within  a  small  tolerance  interval  (ap¬ 
proximate  algorithms). 

The  aim  of  this  paper  is  to  work  out  the  general 
idea  of  using  “reference  points”  for  pseudooptimal 
algorithms.  Reference  points  were  first  introduced 
in  [ABB]  and  used  in  [CKS]  to  obtain  an  0(mn 
log(mn))  approximate  algorithm  for  the  decision 
problem  for  sets  of  line  segments.  Here  we  will  also 
present  a  reference  point  that  gives  better  bounds 
than  the  one  in  [ABB]  and  can  be  applied  to  simi¬ 
larities  and  problems  in  three  dimensions  as  well. 


2  Reference  Point  Methods 

Like  in  [ABB]  pseudooptimal  algorithms  use  suit¬ 
able  reference  points,  which  we  define  for  arbitrary 
dimension  d  as  follows: 

Definition  2  Let  denote  the  set  of  compact 
subsets  of  and  let  T  be  a  set  of  transforma¬ 
tions  on  E'*.  A  mapping  s:  ^  E'^  is  called  a 

reference  point  with  respect  to  T  if  the  following 
two  conditions  are  satisfied: 

(a)  s  is  equivariant  with  respect  to  T ,  i.e.,  for  all 
A,B  ^  C^and  T  £  T  we  have 

s(T(A))=T(s(A)). 

(b )  There  exists  some  constant  c  >  0  such  that  for 
allA,Be  C\ 

\\s(A)~s(B)\\<c-8h(A,B). 

In  other  words,  s  is  a  Lips  chit  z- continuous 
mapping  between  the  metric  spaces  (C‘‘,  8h) 
and  (E*^,  ||  •  ||)  with  Lipschitz  constant  c.  We 
call  c  the  quality  of  the  reference  point  s. 

Based  on  the  existence  of  a  reference  point  for  T 
we  obtain  the  following  algorithms  for  pseudoopti¬ 
mal  matchings  where  T  is  the  set  of  translations, 
rigid  motions,  and  similarity  transformations,  re¬ 
spectively: 

Algorithm  T 

1.  Compute  s(A)  and  s(B)  and  translate  B  by 
s(A)  — s(H)  (so  that  s(B)  is  mapped  onto  5(A)). 
Let  B'  be  the  image  of  B. 

2.  Output  B'  as  the  pseudooptimal  solution  (to¬ 
gether  with  the  Hausdorff-distance  8h(A,B')). 

Algorithm  R 

1.  as  in  Algorithm  T. 

2.  Find  an  optimal  matching  of  A  and  B'  under 
rotations  of  B'  around  s(A). 

3.  Output  the  solution  B"  and  the  Hausdorff- 
distance  8h(A,B"). 

Algorithm  S 

1.  as  in  Algorithm  T. 

2.  determine  the  diameters  d(A)  and  d(B)  and 
scale  B'  by  a  :=  d(A)/d(B)  around  the  cen¬ 
ter  s(A). 

3.  as  Step  2  in  algorithm  R  with  the  scaled  image 
of  B'. 

4.  as  Step  3  in  algorithm  R. 
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As  the  algorithms  R  and  S  are  formulated,  they 
look  only  for  proper  rigid  motions  and  positive  sim¬ 
ilarities,  respectively.  Reflections  can  be  included 
by  simply  running  the  algorithm  a  second  time  with 
a  reflected  copy  of  A. 

These  algorithms  are  simpler  than  the  ones  for 
finding  the  optimal  solutions,  since  after  Step  1  the 
matchings  are  restricted  to  ones  leaving  the  refer¬ 
ence  point  invariant.  In  d  dimensions  this  elimi¬ 
nates  d  degrees  of  freedom.  The  qualities  and  run¬ 
ning  times  of  these  algorithms  are  as  follows: 

Theorem  1  Suppose  that  we  can  determine  a  ref¬ 
erence  point  of  quality  c  for  the  sets  of  transfor¬ 
mations  T  in  the  Algorithms  T,  R,  and  S  in  linear 
time.  In  the  case  of  similarity  transformations  also 
assume  that  s(A)  always  lies  within  the  convex  hull 
conv(A). 

(a)  Algorithm  T  finds  a  pseudooptimal  matching 
for  translations  with  loss  factor  a  =  c  -f-  1 . 

(b)  Algorithm  R  finds  a  pseudooptimal  matching 
for  rigid  motions  with  loss  factor  a  =  c  -f  1 . 

(c)  Algorithm  S  finds  a  pseudooptimal  matching 
for  similarity  transformations  with  loss  factor 
a  =  c  -b  3. 

In  the  plane,  the  running  times  for  two  sets  of 
n  and  m  points  and  line  segments  are  0{fn 
m)  log(n-l-m))  for  Algorithm  T  and  0  {nm\og{nm) 
log*(nm))  for  Algorithms  R  and  S.  In  three  dimen¬ 
sions,  where  A  and  B  are  sets  of  triangles,  the 
running  times  become  0{mn)  for  Algorithm  T  and 
0{{nmyH{n,m))  for  Algorithms  R  and  S.  Here 
H{n,m)  is  the  time  to  compute  the  Hausdorff  dis¬ 
tance. 

Note  that  an  upper  bound  of  0{{n^m  +  nm?) 
log^{nm))  for  H{n,m)  is  known  [AG]. 

For  the  proof  of  the  theorem  we  need  the  fol¬ 
lowing  lemmas,  which  can  be  shown  by  elementary 
geometrical  considerations. 

Lemma  2  Let  B  C  be  a  compact  set  with  di¬ 
ameter  d{B),  and  let  p  be  a  point  in  its  convex  hull 
conv{B).  Let  Ti,T2  be  homotheties  (scalings)  with 
center  p  and  ratios  (scaling  factors)  Oi  and  a^,  re¬ 
spectively.  Then 

SH{r,(B),r2(B))  <  \(a,  -  a2)d(B)\ . 


Lemma  3  If  A,  B  CR'^  are  compact  sets  with  di¬ 
ameters  d(A)  and  d(B),  respectively,  then 

\d(A)-d(B)\  <  2Sh(A,B). 

Proof:  This  holds  because  B  is  contained  in  the 
^-neighborhood  of  A  and  vice  versa.  0 

Proof  of  Theorem  1:  We  prove  only  (c)  which 
implicitly  contains  the  proofs  for  (a)  and  (b).  Con¬ 
sider  an  optimal  similarity  transformation  5opf  If 
can  be  written  as  5opt  =  Topt  o  Topt,  where  Topt 
a  rigid  motion  and  Topt  is  a  homothety  with  ra¬ 
tio  a'.  Let  6  be  the  optimal  Hausdorff-distance 
S  =  SH(A,Sopt(B)).  Then 

\\s(A)  -  s(S,p,(B))\\  <  cS. 

Let  t  be  the  translation  by  s(A)  —  s(Sopt(B))]  then 
S  =  to  Sopt  is  a  similarity  transformation  mapping 
s(B)  onto  s(A)  and 

(1)  6h(A,S(B))  <(c+l)S. 

Write  5  as  5  =  r  0  T,  where  f  a  rigid  motion 
mapping  s(B)  onto  s(A)  and  r  is  a  homothety  with 
center  s(A)  and  ratio  a'.  Let  a  —  d(A)/d(B)  as  in 
Algorithm  S,  r  the  homothety  with  center  s(A)  and 
ratio  a,  and  S  =  t  oT.  Then 

(2)  6^  (A,  S(B))  <Sh(A,  S(B))  +  6h  (S(B),  S(B)) 


Now, 

(3)  6jj(S(B),S(B))  =  Sh  {r(f(B)),  r(f(P))) 


< 


(a' -  a)d(f(B))\,  by  Lemma  2 


=  \(a'  -  a)d(B)\  =  \a'd(B)  -  d(A)\ 

=  \d(Sopt(B))-d(A)\, 

(since  a'  was  the  ratio  of  ^opt) 

<2-SH(Sopt(B),A)  =  26, 

by  Lemma  3.  From  (1),  (2),  and  (3)  we  have 

(4)  <(c+3)^ 

for  a  similarity  transformation  S  composed  of  a 
rigid  motion  that  maps  s(B)  onto  s(A)  and  a  ho¬ 
mothety  with  center  s(A)  and  ratio  a.  Since  Algo¬ 
rithm  S  finds  the  optimum  among  these  similarity 
transformations  the  bound  (4)  holds  for  it,  as  well. 

For  the  time  bound  we  observe  that  Step  1  can 
be  done  in  linear  time,  and  Step  2  in  O(nlogn  -\- 
mlogm).  (In  three  dimensions,  the  diameters  can 
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Figure  1:  5(^4)  =  (1/3,0),  s{B()  —  (1/2,0),  where  s  is  either  the  center  of  the  smallest  enclosing  ellipse 
or  the  centroid  of  the  figure.  This  can  be  seen  by  applying  an  affine  transformation  which  maps  to  an 
equilateral  triangle  or  to  a  square. 


be  computed  trivially  in  0{v?  B'm?')  time.)  Step  3, 
finding  the  optimal  matching  under  rotations,  can 
be  done  in  time  O(nmlog(nm)log*(nm))  in  the 
plane  by  the  algorithm  of  [ABB],  whose  analysis 
relies  on  Davenport-Schinzel  sequences.  This  algo¬ 
rithm  explicitly  computes  the  Hausdorff-distance, 
so  Step  4  is  for  free.  In  3-space,  we  have  rotations 
in  around  a  fixed  center,  which  is  an  optimal 
matching  problem  with  2  degrees  of  freedom.  It 
can  be  solved  in  time  OiinmYHin.m))  by  meth¬ 
ods  of  [ABB].  □ 

3  The  Steiner  Point 

The  previous  section  would  be  useless  if  it  were  not 
possible  to  find  suitable  reference  points.  In  [ABB] 
it  was  observed  that  in  the  two-dimensional  case 
the  point  s(A)  =  (s^ax,  2/inax),  where  x^ax  and 
are  the  maximal  x-  and  ^/-coordinates  of  points  in 
A,  is  a  reference  point  of  quality  -\/2  for  transla¬ 
tions. 

For  rigid  motions  the  situation  is  not  as  easy. 
We  wiU  first  list  a  few  points  that  come  to  mind 
but  turn  out  not  to  be  reference  points.  In  fact, 
for  arbitrary  small  ^  >  0,  we  can  construct  figures 
Ai,  Bj  for  which  Sh{As,  B^)  <  but  ||s(A{)  - 
s(i?i)||  is  not  in  0{6)  or  does  not  even  converge  to 
0  for  ^  0.  Some  of  these  counterexamples  are 


from  [B],  some  are  new: 

s(A)  is  the  centroid  of  the 

||•s(A^)  —  5(i?i)|| 

a)  vertices  of  the  convex  huU 

fi(l) 

b)  convex  hull 

^  fi(l) 

c)  smallest  enclosing  circle 

fi(v^) 

d)  smallest  enclosing  ellipse 

17(1) 

e)  smallest  enclosing  rectangle 

0(1) 

Figure  1,  for  example,  shows  Ag,  Bg  for  cases  b) 
and  d).  In  [ABB]  also  a  positive  example  of  a  refer¬ 
ence  point  for  rigid  motions  in  two  dimensions  was 
given:  the  centroid  of  the  boundary  of  the  convex 
hull.  It  was  shown  that  this  reference  point  is  of 
quality  at  most  47r  +  3  w  15.57.  Here,  we  will  give 
a  reference  point  which  works  even  for  similarity 
transformations,  is  easy  to  compute,  can  be  gener¬ 
alized  to  higher  dimensions,  and  whose  quality  is 
4/7r  Si  1.27. 

First  we  observe  that  we  can  without  loss  of  gen¬ 
erality  restrict  our  attention  to  convex  figures.  In 
fact,  in  [ABB]  it  was  shown  that  for  any  two  com¬ 
pact  sets  A,B: 

Sniconv^A),  conv{B))  <  6h(A,B). 

From  that  it  follows  easily  that  a  reference  point  for 
the  convex  hull  of  a  compact  set  A  is  a  reference 
point  for  A,  as  well. 

Our  candidate  for  a  reference  point  is  the  so- 
called  Steiner  point,  which  has  been  investigated 
intensively  in  the  field  of  convex  geometry  [G,S, 
Sch]. 

Definition  3  We  denote  by  B^  the  d-dimensional 
unit  ball  and  by  its  boundary,  the  [d  —  1)- 

dimensional  unit  sphere  in  R'^. 

Let  A  be  a  convex  body  [convex  and  compact 
subset)  in  E*^.  The  support  function  E'^  E 
of  A  is  given  by 

/ia(u)  =  max  (a,  u) 

ag  A 

[see  Figure  2). 

The  Steiner  point  s(A)  of  A  is  defined  as 
d  f 

Vol(5^-i)  J  ^A[u)udoj[n), 
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Figure  2:  The  support  functions  hA(u),  hB(u)  of  two  convex  bodies  A,B. 


where  du>{u)  is  the  surface  element  of 

For  a  non-convex  compact  set  A  G  we  define 
the  support  function  and  hence  the  Steiner  point 
by  the  same  definitions.  They  coincide  with  the 
support  function  and  the  Steiner  point  of  the  convex 
hull  conv{A). 

For  the  Steiner  point  we  obtain 

Theorem  4  The  Steiner  point  is  a  reference  point 
for  similarity  transformations  in  arbitrary  dimen¬ 
sion  d.  For  d  =  2,  its  quality  is  A  ji: ,  for  d  =  3  it  is 
3/2,  for  arbitrary  d  it  is  less  than  -\/2/7r  •  ^/d  +  1. 

Proof:  The  equivariance  of  the  Steiner  point  under 
similarity  transformations  is  well  known  [G,Sch]. 
For  the  bound  on  the  quality,  we  observe  that  for 
two  convex  bodies  A  and  B,  ||/ix(u)  —  /iB(u)||  < 
6h{A,B)  =:  S  for  any  u  6  S‘^~^  (see  Figure  2). 

Now  let  p  =  s(A)  -  s{B),  and  consider  the 
inner  product  of  p  with  an  arbitrary  unit  vec¬ 
tor  e.  Without  loss  of  generality  we  assume  that 
e  =  (0, . . .  ,  0, 1)  is  the  unit  vector  in  the  d-th  co¬ 
ordinate  direction. 

(p,e)  =  (s(A)  -  s{B),e) 

=  Vol(5rf-r)  /(^^(")-^s(u))(u,e)da;(u) 

5<i-i 


Vol{S^-^) 


j  (+d)(u,e)dw(u) 


J {-S){u,e)  duj{u) 


Vol(5‘'-i)/2 


J  (u,e)dw(u) 


The  expression  in  brackets  in  the  last  line  is  noth¬ 
ing  but  the  d-th  coordinate  of  the  center  of  gravity 
of  the  upper  unit  half-sphere.  If  we  compute  the 
integral  by  projecting  away  the  d-th  coordinate  and 
integrate  over  (xi,X2,  ■  ■  ■  ,  Xd-i)  €  B‘‘~^  C  a 

straightforward  calculation  gives  that  the  surface 
element  is  transformed  by 

du;(u(a;i,  a;2, . . .  =  - - -dxidx2  ■  ■  -dx^^i. 

(u,  e) 

Thus  the  integral  turns  out  to  be  just  the  volume 
of  B‘^~^ ,  and  we  get  the  following  bound  on  (p,e): 

(P,e)  _  6d  diT’^/yV  (^) 

_A  A 
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using  the  formulas  for  the  volume  and  surface  area 
of  higher-dimensional  balls  and  spheres.  This  last 
expression  is  a  bound  on  the  length  of  p  since  e 
was  in  fact  an  arbitrary  unit  vector  and  therefore 
can  be  substituted  by  p/||p||.  The  values  r(3/2)  = 
a/^/2,  r(2)  =  1,  and  r(5/2)  =  3y^/4  give  the 
claimed  quaUty  bounds  for  d  =  2  and  d  =  3.  The 
quotient  of  the  two  F-functions  is  between  \/dj2 
and  \/{d-\-  l)/2,  and  this  gives  the  general  bound. 

□ 

From  this  proof  one  can  see  how  to  construct 
an  example  showing  that  the  bound  cannot  be 
improved:  must  be  divided  into  two  half¬ 

spheres,  and  /taIu)  —  /in(u)  will  ideally  always  be 
equal  to  or  — depending  on  the  half-sphere  in 
which  u  lies.  Figure  3  shows  two  two-dimensional 


Figure  3:  ||s(A5)  -  5(5{)||  is  close  to  4/7r  •  6. 

point  sets  and  B^.  is  just  a  circle  of  radius  r, 
and  Bi  consists  of  a  “distorted”  circle  and  an  ad¬ 
ditional  point.  If  we  allow  to  apply  any  similarity 
to  Bs  in  order  to  minimize  the  Hausdorff  distance 
from  Ai,  the  optimal  position  is  as  shown  in  Fig¬ 
ure  3,  and  the  Hausdorff  distance  is  6.  The  distance 
of  the  Steiner  point  s[Bi)  from  the  center  of 

the  circle  can  be  calculated  as  2/7r  •  ^/rS  arccos 
which  approaches  A/tt  ■  6  as  r  goes  to  oo.  If  one 
lets  the  two  Steiner  points  coincide,  the  Hausdorff 
distance  rises  by  this  amount,  showing  that  l-fd/yr 
is  indeed  the  loss  factor  of  Algorithm  T.  Since 
As  is  rotation-symmetric,  this  holds  also  for  Algo¬ 
rithm  R.  The  above  construction  generalizes  easily 
to  higher  dimensions. 

The  following  theorem  is  well-known  [G,S]. 

Theorem  5  The  Steiner  point  of  a  convex  poly¬ 
tope  is  the  weighted  sum  of  its  vertices,  where  the 
weight  of  vertex  v  is  that  fraction  of  the  surface  of 
the  unit  sphere  that  lies  between  the  unit  vectors 


normal  to  the  hyperplanes  meeting  at  v  (the  nor¬ 
malized  exterior  angle  at  v).  (See  Figure  f  for  a 
two-dimensional  example.) 

For  smooth  convex  bodies,  the  Steiner  point  can  al¬ 
so  be  defined  as  the  centroid  of  a  non-uniform  mass 
distribution  on  the  boundary,  where  the  density  is 
the  (Gaussian)  curvature. 


Figure  4:  The  weight  of  vertex  u  is 

Combining  Theorems  1,  4,  and  5  we  get: 

Theorem  6  Let  A  and  B  be  sets  of  n  and  m  line 
segments  in  d  =  2  dimensions  or  n  and  m  triangles 
in  d  =  3  dimensions.  Then  pseudooptimal  match¬ 
ings  can  be  found  for  A  and  B  applying  the  cor¬ 
responding  algorithms  of  Section  2  as  indicated  in 
the  following  table. 


T 

running  time 

loss  factor 

translations 

d  =  2 

0  ((n  +  m)  log(n  -f  m)) 

4/7r  -f  1 

d  =  3 

0(H(n,  m)) 

2.5 

rigid  motions 

d  =  2 

0  (nm  log(rim)  log*  (nm)) 

4/7r  -f  1 

d  =  3 

0  ((nm)^ H (n,  m)) 

2.5 

similarities 

d  =  2 

0  (nm  \og(nm)  log*  (nm)) 

4/^  +  3 

d  =  3 

0((nm)^H(n,  m)) 

4.5 

Proof:  For  the  proof  note  that  the  Steiner  point 
for  a  convex  polygon  or  polytope  can  be  comput¬ 
ed  in  linear  time  because  of  Theorem  5,  after  the 
convex  huUs  have  been  constructed  in  O(nlogn  -f- 
mlogm)  time.  The  bound  of  0(H(n,m))  (as  in 
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Figure  5:  A  lower  bound  for  the  quality  of  a  reference  point. 


Theorem  1)  for  translations  in  three  dimensions 
comes  from  the  final  computation  of  the  Hausdorff 
distance.  Just  finding  the  pseudooptimal  transla¬ 
tion  takes  only  O(nlogn  -|-  mlogm)  time.  EO 

4  Lower  Bounds 

For  translations  in  two  dimensions  the  following 
lower  bound  on  the  quality  of  reference  points  can 
be  shown: 

Theorem  7  The  quality  c  of  any  reference  point 
for  translations  (Algorithm  T)  in  two  (or  higher) 
dimensions  cannot  be  better  (i.e.,  smaller)  than 

V473. 

Proof:  We  consider  three  point  sets  (see  the  upper 
part  of  Figure  5):  A  circle  C  with  center  o  and  ra¬ 
dius  5;  a  keyhole- shaped  figure  K  with  two  circular 


arcs  of  radii  2  and  8  and  opening  angles  of  Stt/S  and 
7r/3,  respectively,  and  with  an  additional  point  in¬ 
side;  and  finally  an  equilateral  triangle  A  =  915293 
with  side  length  10.  The  optimal  matching  between 
C  and  K  and  between  K  and  A  is  shown  in  the 
lower  part  of  Figure  5.  It  is  obtained  by  super¬ 
imposing  the  points  o  and  p  (marked  by  crosses) 
or  the  points  p  and  91,  respectively.  The  dimen¬ 
sions  given  in  the  lower  middle  part  of  Figure  5 
also  exhibit  the  position  of  the  extra  point  in  the 
keyhole  K. 

Let  us  first  assume  that  the  reference  point  s(C') 
of  the  circle  is  its  center  o.  Since  6h(C,K)  =  3, 
the  reference  point  s(K)  of  K  must  lie  in  a  circle 
of  radius  3c  around  p.  Similarly,  s(A)  must  lie  in 
a  circle  of  radius  2c  around  s(K),  i.e.,  in  a  circle 
of  radius  5c  around  91.  If  we  turn  K  by  120“  in 
both  ways,  we  can  conclude  in  a  similar  way  that 
s(A)  must  lie  in  circles  of  radius  5c  around  92  and 


around  q^.  By  equivariance  with  respect  to  trans¬ 
lations,  the  triangle  A  has  only  a  single  reference 
point,  and  therefore  the  three  circles  must  inter¬ 
sect,  as  shown  in  the  lower  right  part  of  Figure  5. 
This  means  that  c  must  be  at  least  a/4/3  «  1.155. 

If  the  reference  point  s(C)  is  not  the  center  o,  the 
only  difference  is  that  the  centers  of  the  final  three 
circles  wiU  be  translated  by  the  respective  amount. 

□ 

The  above  construction  can  be  generaUzed  to  d 
dimensions  to  give  a  lower  bound  of  y/2d/{d  -f  1) 
on  the  quabty  c. 

Comparing  a/4/3  «  1.155  with  the  upper  bound 
of  4/7r  w  1.27  for  the  Steiner  point,  we  note  that 
the  gap  is  quite  small.  We  are  quite  sure  that  the 
lower  bound  can  be  improved,  and  we  believe  that 
the  Steiner  point  is  the  optimal  reference  point  with 
the  best  quality  guarantee. 
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Piecewise-Linear  Interpolation  between  Polygonal  Slices* 


Gill  Barequet^  Micha  Sharir^ 


Abstract 

In  this  paper  we  present  a  new  technique  for  piecewise- 
linear  surface  reconstruction  from  a  series  of  parallel 
polygonal  cross-sections.  This  is  an  important  mrob- 
fem'  in  medical  imaging,  surface  reconstruction  from 
topographic  data,  and  other  applications.  We  reduce 
the  problem,  as  in  most  previous  works,  to  a  series  of 
problems  of  piecewise-linear  interpoption  between  each 
pair  of  successive  slices.  Our  algorithm  uses  a  partial 
curve  matching  technique  for  matchiM  parts  of  the  con¬ 
tours,  an  optimal  triangulation  of  3-D  polygons  for  re¬ 
solving  the  unmatched  parts,  and  a  minimum  spanning 
tree  heuristic  for  interpolating  between  non  simply  con¬ 
nected  regions.  Unlike  previous  attempts  at  solving  this 
problem,  our  algorithm  seems  to  handle  successfully  any 
kind  of  data.  It  allows  multiple  contours  in  each  slice, 
with  any  hierarchy  of  contour  nesting,  and  avoids  the 
introduction  of  counter-intuitive  bridges  between  con- 
tours,  proposed  in  some  earlier  papers  to  handle  inter- 
polation  between  multiply  connected  regions.  Experi¬ 
mental  results  on  various  complex  examples,  involving 
actual  medical  imaging  data,  are  presented,  and  show 
the  good  and  robust  performance  of  our  algorithm. 

Keywords:  surface  reconstruction,  tiling,  surface  fit¬ 
ting,  Branching  surfaces,  slice  interpolation,  triangula- 
tion,  dynamic  programming,  geometric  hashing,  curve 
matching,  polyhedra. 


1  Introduction 


The  problem  of  reconstructing  the  boundary  of  a  solid 
object  from  a  series  of  parallel  planar  cross-sections  h^ 
attracted  much  attention  in  the  literature  during  the 
past  two  decades.  The  main  rriotivation  for  this  prob¬ 
lem  comes  from  medical  imaging  applications,  where 
cross-sections  of  human  organs,  such  as  bones,  tumors 
and  tissues,  are  obtained  by  CT  (Computerized  Tomog¬ 
raphy)  or  MRI  (Magnetic  Resonance  Imaging)  appa- 
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rata.  These  cross-sections,  hereafter  called  slices,  are 
the  basis  for  interpolating  the  boundary  surface  of  the 
organ.  The  interpolated  object  can  then  be  displayed 
in  graphics  applications,  or  (more  recently)  even  manu¬ 
factured  by  an  NC  (Numerically  Controlled)  or  an  RP 
(Rapid  Prototyping)  machine.  Another  motivation  for 
this  problem  is  the  non-destructive  digitization  of  ob¬ 
jects;  after  an  object  is  scanned  by  an  echo-graphic  or 
an  X-ray  apparatus,  the  obtained  slices  are  used  for  the 
reconstruction  of  the  original  object.  Yet  another  rno- 
tivation  is  the  reconstruction  of  a  3-dimensional  model 
of  a  terrain  from  topographic  elevation  contours. 

Many  solutions  were  suggested  for  the  pure  raster  in¬ 
terpolation.  These  usually  handle  two  raster  linages 
where  each  pixel  is  either  white  or  black,  or  assigned 
a  grey-level  taken  from  a  fixed  range.  The  interpola¬ 
tion  produces  one  or  more  intermediate  raster  images, 
which  smoothly  and  locally  turn  the  first  image  into  the 
second  one.  Then,  the  bounding  surface  is  detected  us¬ 
ing  other  methods,  such  as  edge  detection  techniques, 
for  identifying  places  of  transition  from  the  inside  to 
the  outside  of  the  object.  In  the  grey  level  case,  these 
methods  include  some  thresholding  mechamsrn  which 
decides  which  levels  are  ‘inside’  the  object  and  which 
are  not.  Cline  et  al.  [6]  attempted  to  convert  directly 
the  voxel  data  into  a  polyhedral  surface,  su^esting  the 
TTiarching  cubes  technique,  which  producecf  very  small 
triangles  whose  size  was  roughly  the  same  as  that  of  the 
input  voxels. 

Many  other  solutions,  including  the  approach  taken 
in  this  paper,  assume  that  the  interpolation  is  preceded 
by  an  edge-detection  process,  which  is  invoked  for  each 
of  the  slices.  Thus,  each  slice  is  then  assumed  to  be  rep¬ 
resented  by  a  hierarchy  of  non-crossing  contours,  each 
being  a  closed  simple  Jordan  curve,  which  represent  the 
boundaries  between  “material”  and  “non-inaterial  ar¬ 
eas;  in  general,  the  depth  and  breadth  of  this  hierarchy 
is  not  restricted,  and  a  contour  may  enclose  any  num¬ 
ber  of  other  contours,  which  themselves  may  enclose 
other  contours,  and  so  on.  In  practice,  each  contour  is 
given  as  a  discrete  circular  sequence  of  points  along  it 
and  we  can  thus  regard  it  as  a  simple  closed  polygonal 
curve,  whose  vertices  are  the  given  points.  Finally,  we 
may  also  assume  that  the  exterior,  unbounded  region  in 
each  planar  slice  represents  “non-material”  (the  model 
is  assumed  to  be  bounded). 

Thus  the  problem  that  we  face  is;  Given  a  series  of 
parallel  planar  slices  (which  we  will  assume  to  be  par¬ 
allel  to  the  aiy-plane),  each  consisting  of  a  collection 
of  non-crossing,  but  possibly  nested,  closed  and  simple 
polygonal  curves,  with  the  above  properties,  we  want 
to  reconstruct  a  polyhedral  solid  model  whose  cross 
sections  along  the  given  planes  coincide  with  the  in¬ 
put  slices.  A  natural  simplification  of  the  problem,  also 
taken  in  most  earlier  works,  is  to  consider  only  a  single 
pair  of  successive  parallel  slices,  and  to  construct  a  solid 
model  within  the  layer  delimited  by  the  planes  of  the 
slices,  which  interpolates  between  the  given  slices.  The 
union  (or,  rather,  concatenation)  of  these  models  will 
give  us  a  solution  model  for  the  full  problem. 

Before  continuing,  we  should  remark  that  the  solution 
is  not  uniquely  detoed,  and  the  measure  of  'goodness 


93 


(a)  Wrap-around 
tiling 


(b)  Forced  contour  (c)  Separata  treatment 
association  of  non-overlapping 

contours 


Figure  1:  Contour  association  and  tiling 


of  a  proposed  solution  is  rather  subjective  and  intu¬ 
itive.  Of  course,  if  each  of  the  two  slices  consists  of  a 
single  contour,  and  these  contours  roughly  ‘sit  above 
each  other’,  then  we  expect  the  solution  to  be  a  single 
'drum-like’  polytope  whose  boundary  consists  of  a  cir¬ 
cular  sequence  of  triangles  ‘wrapping  around’  the  two 
contours;  see  Figure  1(a).  However,  even  in  the  simple 
case  of  one  contour  in  each  slice,  if  the  sy-projections  of 
the  two  contours  are  far  away  from  each  other,  it  is  not 
clear  which  is  a  better  solution:  to  construct  a  highly 
slanted  'pipe’  that  connects  between  these  contours  (as 
in  Figure  1(b)),  or  to  regard  the  lower  contour  as  the 
top  cover  of  some  pillar-like  solid,  and  the  upper  con¬ 
tour  as  the  bottom  cover  of  another  pillar-like  solid  (as 
in  Figure  1(c)).  The  choice  of  a  solution  can  become 
much  more  arbitrary  in  more  involved  cases. 

We  first  briefly  review  the  fairly  extensive  literature 
on  this  problem.  Most  of  the  earlier  works  only  stud¬ 
ied  the  variant  where  each  slice  contains  only  one  con¬ 
tour.  In  the  seq^uel  we  denote  it  as  the  one-to-one  case, 
as  opposed  to  the  one-io-many  and  the  many-io-many 
cases.  These  studies  either  sought  a  global  optimization 
of  some  objective  function  or  settled  with  a  local  tiling- 
advancing  rule,  after  the  tiling  starting  points  at  the  two 
contours  were  somehow  determined,  e.g.  the  closest  pair 
of  vertices  between  the  contours. 

In  the  full  version  we  review  these  works  in  more  de¬ 
tail,  but  since  they  are  all  iiiferior  to  our  solution,  in 
that  they  cannot  handle  arbitrary  data,  we  omit  this 
review  here.  These  works  are  by  Keppel  [19],  Fuchs  et 
al.  [9],  Sloan  and  Painter  [28],  Cook  et  al.  [7],  Chris¬ 
tiansen  and  Sederberg  [5],  Shantz  [25],  Batnitzky  et  al. 
[2],  Sloan  and  Hrechanyk  [26],  Ganapathy  and  Dennehy 
[10],  and  by  Wang  and  Aggarwal  [29].  A  good  survey  on 
all  the  works  cited  so  far  is  given  by  Sloan  and  Painter 
[27].  They  decompose  each  method  into  its  building 
blocks,  and  compared  the  various  methods  accordingly. 
They  also  describe  a  testbed  for  evaluating  and  com¬ 
paring  these  techniques. 

Zyda,  Jones  and  Hogan  [32]  made  an  attempt  to  han¬ 
dle  the  many-to-many  case,  but  their  method  could  not 
handle  any  branching  cases,  and  produced  unsatisfac¬ 
tory  results  for  partially  overlapping  contours,  as  in  Fig¬ 
ure  4.  Other  limitations  involved  specific  geometries. 
However,  they  suggested  solutions  for  these  limitations, 
which  might  help  m  certain  cases,  but  which  required 
some  interaction  with  the  user. 

Boissonnat  [3]  presented  a  totally  different  approach. 
He  constructed  the  Delaunay  triangulation  for  each 
slice,  projected  one  triangulation  onto  the  other,  and 
obtained  a  collection  of  tetrahedra,  aiming  to  maximize 
the  sum  of  their  volumes.  This  was  a  considerable  step 
towards  handling  the  case  where  each  slice  has  multiple 


contours.  Boissonnat  mentioned  three  typical  examples 
where  his  standard  method  failed  to  produce  good  re¬ 
sults,  thus  requiring  special  treatment.  The  first  exam¬ 
ple  contained  two  overlapping  contours  but  with  con¬ 
siderable  differences  in  their  geometry.  The  second  ex¬ 
ample  consisted  of  two  similar  contours,  where  one  of 
them  also  contained  a  hole  polygon.  And  the  last  ex¬ 
ample  showed  a  branching  problem  but  without  contour 
overlaps.  Our  algorithm  handles  successfully  (and  eas¬ 
ily)  all  these  ‘bad’  examples.  Boissonnat  suggested  a 
correction  scheme,  which  either  changed  the  geometry 
of  one  of  the  slices,  or  constructed  one  or  two  interme¬ 
diate  slices  between  the  original  ones.  These  idea  were 
further  developed  by  Boissonnat  and  Geiger  ([4],  [11]). 

There  have  been  a  few  other  recent  works  that  also 
attempted  to  handle  the  more  general  cases;  these  are 
by  Kehtarnavaz  and  De  Figueiredo  [17],  Kehtarnavaz. 
Simar  and  De  Figueiredo  [18],  Ekoule,  Peyrin  and  Odet 
[8],  Meyers,  Skinner  and  Sloan  [22],  and  by  Welzl  and 
Wolfers  [30].  However,  all  these  treatments  suffered 
from  a  variety  of  problems,  treated  only  certain  re¬ 
stricted  cases,  and  worked  only  in  favorable  situations 
where  the  two  slices  closely  resemble  each  other. 

Finally,  Gitlin,  O’Rourke  and  Subramanian  [12]  prove 
that  it  is  not  always  possible  to  find  an  interpolating 
polyhedron  between  two  polygons,  that  lie  in  two  par¬ 
allel  planes.  That  is,  any  attempted  interpolation  pro¬ 
duces  a  self-intersecting  surface.  This  result  holds  only 
when  the  interpolating  triangles  are  all  assumed  to  con¬ 
nect  between  the  two  polygons  vertices,  i.e.  to  have  two 
vertices  taken  from  one  polygon  and  the  third  from  the 
other  polygon.  We  do  not  assume  this  in  our  approach, 
and  we  indeed  get  a  non-intersecting  interpolation  when 
we  apply  our  algorithm  on  their  example. 

Two  cornprehensive  reviews  of  many  of  the  works  on 
reconstructing  3D  objects  from  cross-sections  are  given 
by  Schumaker  [23]  and  by  Hagen,  Miiller  and  Nielson 
[14].  The  first  review  is  not  restricted  to  piecewise-linear 
constructions  based  on  polygonal  slices,  but  also  refers 
to  parametric  and  tensor  representations  of  contours, 
and  describes  volumetric  and  surface  approaches. 

We  propose  a  new  approach  to  the  interpolation  prob¬ 
lem.  Our  alMrithm  handles  well  slices  with  multiple 
contours,  ancfdoes  not  rely  on  any  resemblance  between 
the  slices.  We  accept  slices  which  contain  anv  num¬ 
ber  of  contours,  arbitrarily  different  in  their  geometries. 
The  xy-projections  of  contours  of  different  slices  may 
arbitrarily  overlap;  we  do  not  make  any  distinction  in 
the  treatment  of  contours  which  fully  overlap,  partiallv 
overlap  or  do  not  overlap  at  all.  Many  of  the  previous 
works,  such  as  [19],  [9],  [26]  and  [10],  either  prohibit 
the  creation  of  triangles  in  the  same  slice,  or  specifi¬ 
cally  define  steps  where  this  action  is  allowed.  We  do 
not  make  a  distinction  between  triangles  which  connect 
the  two  slices  and  those  which  totally  lie  wdthin  a  sin¬ 
gle  slice.  Generally  speaking,  our  algorithm  is  based  on 
an  analysis  of  the  resemblance  between  the  t\vo  slices. 
Thus,  we  separately  treat  similar  contour  portions  that 
are  matched  between  the  two  slices,  and  then  treat  the 
remaining  portions  that  do  not  match.  We  refrain  from 
creating  artificial  bridges  between  contours  that  might 
conflict  with  the  geometry  of  the  other  slice,  as  already 
noted  in  [25]  and  others.  In  the  only  case  where  we  con¬ 
struct  such  bridges,  they  are  guaranteed  not  to  conflict 
with  the  geometry  of  the  other  slice,  i.e.  not  to  intersect 
the  projection  of  any  other  contour.  We  do  not  have 
to  introduce  intermediate  slices.  To  recap,  our  algo¬ 
rithm  appears  to  overcome  all  the  technical  difficulties 
that  hampered  the  performance  of  previous  solutions, 
it  treats  data  in  full  generality,  and  the  extensive  ex¬ 
perimentation  that  we  have  conducted  indicates  that  it 
performs  very  well  on  complicated  large-size  data.  We 
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Figure  2:  Matching  contour  portions 


regard  our  algorithm  as  a  significant  step  in  the  solution 
of  this  problem.  For  an  illustration  of  the  performance 
of  our  algorithm  on  real-life  examples,  see  Figures  17 
and  18. 

Here  is  a  brief  overview  of  our  algorithm.  We  first 
match  similar  contour  portions  between  the  two  slices 
(e.g.  the  upper  arcs  pq  and  p'q'  in  Figure  2).  Then  we 
‘stitch’  (or  ‘tile’)  each  pair  of  matched  contour  portions 
by  a  sequence  of  adjacent  triangles  forming  a  ‘band’  be¬ 
tween  the  portions.  With  some  care,  if  we  take  the  union 
of  the  original  contours  and  new  bands,  and  cancel  out 
duplicate  edges,  we  obtain  a  collection  of  closed  spatial 
polygonal  curves,  each  of  which  may  be  composed  of 
pieces  of  contours  on  both  slices  and  of  some  edges  of 
the  connecting  triangles.  Moreover  our  matching  pro¬ 
cedure  essentially  guarantees  that  the  ay-projections  of 
these  curves  are  pairwise  disjoint,  although  they  may 
be  nested  within  each  other.  If  no  nesting  occurs,  we 
simply  triangulate  each  of  these  spatial  polygons,  us¬ 
ing  a  simple  dynamic  programming  approach,  which 
roughly  aims  to  minimize  the  total  area  of  the  tnangula- 
tion.  If  nesting  occurs,  we  take  one  polygon  P  with  all 
polygons  Pi, . .  .,Pk  whose  ay-proiections  are  directly 
nested  within  that  of  P,  and  apply  a  minimum  span¬ 
ning  tree  procedure  that  introduces  edges  coniiecting 
between  these  polygons  and  yielding  an  ay-projection 
which  is  simply  connected,  so  we  can  then  proceed  to 
triangulate  the' resulting  polygonal  curve,  as  above.  See 
Figure  3  for  an  illustration.  More  details  of  all  these 
steps  will  be  given  later  in  the  paper. 

For  the  purpose  of  identifying  matching  portions  of 
the  contours  we  use  a  partial  curve  matching  technique, 
which  was  first  suggested  by  Kalvin  et  al.  [16]  and 
by  Schwartz  and  Sharir  [24].  This  technique,  which 
uses  the  so-called  Geometric  Hashing  method,  originally 
solved  the  following  curve  matching  problem:  Given  two 
curves  in  the  plane,  such  that  one  is  a  (slight  deforma¬ 
tion  of  a)  proper  subcurve  of  the  other,  find  the  trans¬ 
lation  and  rotation  of  the  subcurve  that  yields  the  best 
least-squares  fit  to  the  appropriate  portion  of  the  longer 
curve. 

This  technique  was  extended  and  used  in  computer 
vision  for  automatic  identification  of  partially  obscured 
objects  in  two  or  three  dimensions,  an  important  prob¬ 
lem  in  robotics  applications  of  computer  vision,  which 
has  attracted  much  attention;  see  Hong  and  Wolfson 
[15],  Wolfson  [31],  and  Kishon,  Hastie  and  Wolfson  [20]. 

A  simplified  variant  of  this  technique  has  recently 
been  used  by  Barequet  and  Sharir  [l]  for  the  totally 
different  problem  of  detecting  and  repairing  gaps  in  the 
boundary  of  a  polyhedron,  a  problem  which  often  arises 
in  the  creation  of  polyhedral  approximations  of  CAD 
models.  In  the  variant  used  in  [1],  as  well  as  the  one 
used  in  this  paper,  no  motion  of  one  curve  relative  to 
the  other  is  allowed,  so  the  technique  becomes  consid¬ 
erably  simpler. 

The  paper  is  organized  as  follows.  In  Section  2  we  give 
a  more  precise  definition  of  the  problem  and  present 
an  overview  of  the  algorithm  (more  detailed  than  the 


one  given  above).  The  later  sections  describe  in  detail 
certain  phases  of  the  algorithm.  Section  3  describes  the 
matching  of  contour  portions,  and  Section  4  describes 
the  actual  surface  reconstruction.  In  Section  5  we  briefly 
analyze  the  complexity  of  the  algorithm,  and  present 
experimental  results.  We  end  in  Section  6  with  some 
concluding  remarks. 


2  Overview  of  the  Algorithm 

The  input  to  the  algorithm,  as  described  in  the  Intro¬ 
duction,  is  a  pair  of  parallel  planar  slices  parallel  to  the 
xy-plane,  each  slice  consisting  of  a  list  of  closed  and 
simple  polygonal  contours,  which  do  not  intersect  each 
other.  The  containment  hierarchy  of  the  contours  niay 
be  omitted;  in  this  case  we  compute  it  ourselves,  using 
a  simple  line-sweep  technique.  Contours  of  even  nest¬ 
ing  level  (starting  at  0,  which  labels  external  contours) 
are  such  that  their  interior,  in  a  sufficiently  small  neigh¬ 
borhood  of  the  contour,  is  the  “material”,  and  contours 
of  odd  level  are  those  whose  interior,  sufficiently  near 
them,  is  the  “non-material”.  We  orient  each  contour  so 
that,  when  viewing  the  contour  from  above,  the  mate¬ 
rial  lies  to  the  right  of  the  contour  (thus  all  even-level 
contours  are  oriented  in  the  clockwise  direction,  when 
viewed  from  above,  and  all  odd-level  contours  are  ori¬ 
ented  in  the  counterclockwise  direction).  If  necessary, 
we  re-orient  the  contours  in  these  consistent  directions. 
We  remark  that  we  need  to  compute  the  contour  hierar¬ 
chy  only  to  obtain  the  consistent  orientation  of  contours; 
the  hierarchy  itself  is  not  used  in  the  algorithm. 

Our  proposed  algorithm  consists  of  the  following 
steps: 

1.  Data  acquisition: 

(a)  Orient  all  the  contours  in  each  slice  in  consistent 
directions,  as  explained  above.  If  the  input  does  not 
include  this  information,  compute  the  contour  nesting 
hierarchy  in  each  slice,  and  use  it  to  obtain  the  desired 
orientations. 

2.  Matching  contour  portions: 

(a)  Discretize  each  contour  polygon  into  a  cyclic  se¬ 
quence  of  vertices,  so  that  the  arc  length  between  each 
pair  of  consecutive  vertices  is  equal  to  some  (small) 
given  parameter. 

(b)  Vote  for  contour  matches.  Each  pair  of  distinct  ver¬ 
tices  of  the  discretized  contours,  one  on  each  slice,  whose 
mutual  horizontal  distance  is  below  some  threshold  pa¬ 
rameter,  contributes  one  vote.  The  vote  is  for  the  match 
between  these  two  contours  with  the  appropriate  shift, 
which  maps  one  of  these  vertices  to  the  other. 

(c)  Transform  the  resulting  votes  into  a  collection  of 
candidates  of  partial  contour  matches. 

3.  Reconstructing  the  surface: 

(a)  Stitch  together  each  pair  of  contour  portions  that 
have  been  matched  in  the  above  step,  by  adding  trian¬ 
gles  which  connect  between  these  portions.  The  new 
triangles  are  oriented  consistently  with  the  contours. 

(b)  Combine  the  remaining  contour  edges  into  spatial 
cycles  (denoted  as  clefts),  obtained  by  taking  the  union 
of  the  contour  edges  in  both  slices  and  of  the  edges  of 
the  stitching  triangles,  and  by  canceling  out  duplicate, 
oppositely-oriented  edges.  When  projected  onto  the  xy- 
plane,  these  cycles  do  not  intersect  each  other.  Find 
their  nesting  hierarchy,  and,  for  each  cleft  C .  construct, 
if  necessary,  a  system  of  straight  ‘bridges’  that  connect 
between  C  and  its  holes  (immediate  children  in  the  hi¬ 
erarchy),  so  as  to  turn  them  into  a  single  cycle,  which 
now  replaces  the  cleft  C . 

(c)  Triangulate  the  resulting  3-D  clefts,  using  a  3-D  min- 
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(c)  Remaining  clefts  (d)  Simplified  clefts 


(e)  Triangulated  clefts  (f)  The  final 

reconstruction 


Figure  3:  The  different  steps  of  our  algorithm 


imum  area  triangulation  technique. 

The  various  steps  of  the  algorithm  are  illustrated  in 
Figure  3.  Figure  3(a)  shows  a  pair  of  slices  in  a  branch¬ 
ing  situation.  The  lower  slice  contains  one  contour  (de¬ 
noted  by  5i),  and  the  upper  slice  contains  three  con¬ 
tours  (denoted  by  52).  Figure  3(b)  shows  the  tiling  of 
the  three  matches  found  between  these  two  slices.  Fig¬ 
ure  3(c)  shows  the  remaining  clefts,  which  form  a  shal¬ 
low  hierarchy  of  nested  polygons.  Figure  3(d)  shows 
the  clefts  after  the  hole  elimination  step,  and  their 
minimum-area  triangulations  are  shown  in  Figure  3(e). 
The  final  surface  reconstruction  is  shown  in  Figure  3(f). 


3  Matching  Contour  Portions 

For  lack  of  space,  we  omit  here  a  detailed  description 
of  the  data  acquisition  step,  which  was  already  briefly 
described  above,  and  is  anyway  fairly  straightforward. 


3.1  Contour  Discretization 

Each  contour  polygon  is  refined  and  discretized  into  a 
cyclic  sequence  of  points.  This  is  done  by  choosing  some 
sufficiently  small  arc  length  parameter  s,  and  by  gen¬ 
erating  equally-spaced  points,  at  distance  s  apart  from 
each  other  (along  the  polygon  boundary).  We  need  to 
choose  s  so  that  it  is  much  smaller  than  (a)  the  length  of 
any  original  edge  of  any  contour,  and  (b)  the  minimum 
straight  distance  between  any  pair  of  contour  points 
which  lie  on  different  contours  (on  the  same  slice)  or 
lie  on  the  same  contour  and  their  distance  along  the 
contour  is  sufficiently  large. 

3.2  Voting  for  Contour  Matches 

We  first  try  to  match  pairs  of  portions  of  contours, 
where  each  portion  in  a  pair  belongs  to  a  different 


slice.  These  matches  aim  to  detect  regions  of  simi¬ 
larity  between  the  boundaries  of  the  interpolating  ob¬ 
ject  along  the  two  slices.  Naturally,  contour  portions 
which  are  similar  in  the  two  slices  must  have  similar  se¬ 
quences  of  footprints.  Thus,  our  next  goal  is  to  search 
for  pairs  of  sufficiently  long  subsequences  that  closely 
match  each  other.  In  our  approach,  two  subsequences 
and  [qj , .  . . ,  are  said  to  closely 

match  each  other,  if,  for  some  chosen  parameter  c  >  0. 
the  number  of  indices  k  for  which  Hpi+i  —  1j+k\\  <  s 
is  sufficiently  close  to  £.  (Here  the  norm  ||  •  ||  is  the 
Euclidean  distance  between  the  sp-projections  of  the 
points.)  We  perform  the  following  voting  process,  where 
votes  are  given  to  good  point-to-point  matches. 

The  contours  are  given  as  cyclic  ordered  sequences 
of  vertices,  and  we  break  each  of  them  arbitrarily  to 
make  it  into  a  linear  sequence.  All  the  vertices  of 
the  lower  slice  are  preprocessed  for  range- searching,  so 
that,  for  each  vertex  v  of  the  upper  slice,  we  can  ef¬ 
ficiently  locate  all  the  vertices  of  the  lower  slice  that 
lie  in  some  e-neighborhood  of  (the  xp-projections  of)  v. 
We  have  used  a  simple  heuristic  projection  method  for 
the  range  searching,  that  reduces  the  search  to  a  pair 
of  1-dimensional  searches;  this  may  be  inefficient  in  the 
worst  case,  but  works  very  well  in  practice. 

The  positions  along  a  contour  sequence  c,  whose 
length  IS  ic,  are  numbered  from  0  to  —  1.  Assume 
that  the  querying  vertex  v  is  in  position  i  of  contour 
sequence  Cj.  Then,  each  vertex  retrieved  by  the  query, 
which  is  in  position  7'  in  contour  sequence  C2,  contributes 
a  vote  for  the  match  between  contours  ci  and  C2  with  a 
shift  equal  to  (j  —  i)  (mod  £c^). 

Obviously,  matches  between  long  portions  of  contours 
are  reflected  by  a  large  number  of  votes  for  the  appro¬ 
priate  shift  between  the  matching  contours.  We  thus 
collect  the  shifts  with  a  large  nurnber  of  votes,  identify 
(roughly)  the  starting  and  ending  points  of  the  corre¬ 
sponding  match,  and  possibly  also  remove  (small)  por¬ 
tions  of  the  matches,  near  their  endpoints,  which  are 
also  shared  by  other  matches. 

Note  that  this  procedure  will  only  match  contour  por¬ 
tions  that  are  consistently  oriented.  This  is  important, 
since  it  guarantees  that  the  material  region  lies  on  the 
same  side  of  the  two  contours,  thus  implying  that  if 
we  stitch  the  gap  between  these  two  contours,  we  get 
a  proper  portion  of  the  boundary  of  a  possible  inter¬ 
polating  solid.  Also  note  that  the  e  parameter  for  the 
range-searching  queries  is  not  a  function  of  the  input. 
It  is  rather  our  a  priori  estimation  of  the  physical  size 
of  the  difference  between  similar  contour  portions  of  the 
two  slices. 

In  some  cases  small  portions  of  the  contours  are  in¬ 
cluded  in  more  than  one  candidate  match.  This  usu¬ 
ally  happens  at  the  connection  between  two  different 
matches,  involving  the  same  contour  on  one  slice  and 
different  contours  on  the  other.  We  simply  eliminate 
those  portions  common  to  more  than  one  significant 
candidate  match. 


3.3  Accepting  Match  Candidates 

Each  match  is  given  a  score.  The  score  may  also  reflect, 
in  addition  to  the  number  of  votes  for  the  appropriate 
shift,  other  quality  measures  (such  as  the  closeness  of 
the  vertices  on  the  two  matching  contour  portions).  Our 
setting  of  the  scoring  function  is  described  in  Section  5. 

A  crucial  property  of  the  matches,  which  we  have  to 
achieve  when  tuning  the  parameters  which  control  the 
detection  of  matches,  is  that  every  intersection  between 
the  s;y-projections  of  any  pair  of  contours,  one  from  each 
slice,  will  lead  to  a  match  between  the  corresponding 
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Figure  4:  Intersecting  contours  with  no  long  matching 

portions  Figure  5:  Tiling  a  match 


contours,  although  it  might  be  very  short.  This  will  en¬ 
sure  that  there  will  be  at  least  one  match  between  each 
pair  of  overlapping  contours  (namely,  contours  with  in¬ 
tersecting  2;y-projections),  even  if  the  relative  amount 
of  overlap  is  very  small.  The  reason  for  this  is  given 
shortly  below. 

We  achieve  this  by  accepting  very  short  match  can¬ 
didates  (even  of  length  2  or  3),  as  long  as  their  quality 
fits  our  bounds.  We  have  to  make  sure  that  the  dis¬ 
cretization  parameter  is  sufficiently  small  with  respect 
to  the  voting  threshold  and  with  respect  to  the  smallest 
contour  feature  size,  as  described  above,  in  order  not 
to  miss  contour  intersections.  An  extreme  exarnple  is 
shown  in  Figure  4,  where  four  short  matches  are  indeed 
found  between  the  two  contours. 

4  Reconstructing  the  Surface 


contour  1 


(a)  Original  contours  (b)  Aft«r  matching  and  tiling 

Figure  6:  A  short  match  and  the  clefts  near  an  inter¬ 
section  of  contours 


of  two  sorted  lists,  and  turns  out  to  produce  reason¬ 
ably  looking  triangulated  boundary  patches  between  the 
matched  contour  portions.  Figure  5  shows  such  a  tri¬ 
angulation.  Alternative  advancing  rules  were  described 
by  Christiansen  and  Sederberg  [5],  and  by  Ganapathy 
and  Dennehy  [10]. 


4.1  Stitching  the  Matches 


Each  match  consists  of  two  directed  polygonal  chains, 
whose  Ku-projections  are  very  close  to  each  other.  We 
arbitrarily  choose  one  end  of  the  match,  and  ‘merge’ 
the  two  chains  as  if  they  were  sorted  lists  of  numbers. 
In  each  step  of  the  merge  we  have  pointers  to  the  cur¬ 
rent  vertices,  u  and  u,  in  the  two  chains,  and  make  a 
decision  as  to  which  chain  should  be  advanced,  say  v  ad¬ 
vances  to  a  new  vertex  w.  Then  we  add  the  new  triangle 
Luvw  to  the  boundary  of  the  constructed  polyhedron, 
and  advance  the  current  vertex  (from  v  to  w)  along  the 


appropriate  chain.  When  we  reach  the  last  vertex  of 
one  chain,  we  may  further  advance  only  the  other  one. 
This  process  terminates  when  we  reach  the  last  vertices 
of  both  chains. 

The  triangles  that  we  create  are  oriented  consistently 
with  the  contours.  To  achieve  this,  we  invert  the  orien¬ 
tation  of  all  contour  polygons  of  one  slice,  say  the  lower 
one,  and  orient  each  triangle  so  that  the  edge  it  shares 
with  a  contour  is  oriented  oppositely  to  the  contour; 
see  Figure  5  for  an  illustration  of  the  directions  of  the 
tiling  triangles.  This  orientation  guarantees  that  the 
reconstructed  solid  boundary  is  oriented  in  a  consistent 
manner. 

Several  advancing  rules  were  examined,  and  the  fol¬ 
lowing  simplest  one  proved  itself  the  best.  Assume 
that  the  current  vertices  of  the  two  chains  are  vj  and 
u|,  which  are  followed  by  and  respectively. 

Then,  if  +  we  ad¬ 

vance  the  first  chain;  otherwise  we  advance  the  second 
chainF  This  bears  close  resemblance  to  the  merging 


'That  is,  we  advance  so  that  the  newly  added  triangle  has 
smaller  perimeter;  actually,  for  program  efficiency,  we  eventually 
used  the  squares  of  the  distances,  with  equally  good  results. 


4.2  Filling  the  Clefts 

After  tiling  the  matching  contour  portions.  \ye  remain 
with  the  unmatched  portions.  These,  combined  with 
the  extreme  edges  of  the  tiling  sequences,  form  a  collec¬ 
tion  of  closed  3-D  polygons,  which  we  refer  to  as  clefts. 
Finding  these  clefts  is  straightforward.  First,  recall  that 
we  have  already  inverted  the  orientation  of  all  contour 
polygons  of  the  lower  slice.  Consider  the  contour  poly¬ 
gons  of  the  two  slices  (the  inverted  lower  slice  and  the 
upper  slice),  as  well  as  the  tiling  triangles,  as  the  for¬ 
mal  sum  of  their  directed  edges,  and  add  up  all  these 
polygons,  with  the  convention  that  e  +  (— ^  =  0.  After 
these  cancellations,  the  resulting  sum  consists  qf  all  the 
desired  clefts.  Since  each  polygon  that  participates  in 
the  formal  sum  is  a  directed  cycle,  the  resulting  sum 
is  easily  seen  to  represent  a  collection  of  pairwise  edge- 
disjoint  directed  cycles. 

This  collection  of  3-D  polygonal  cleft  cycles  has  the 
property  that,  when  projected  onto  the  xy-plane,  no 
two  cycles  intersect.  This  is  because  we  have  already 
detected  all  the  projected  contour  intersections  in  the 
matching  phase,  and  because  the  extreme  tiling  edges 
in  the  triangulation  of  a  match  are  likely  to  degenerate 
or  almost  degenerate  in  the  projection.  This  situation 
is  illustrated  in  Figure  6.  The  only  vyay  this  property 
can  be  violated  is  when  an  extreme  tiling  edge  crosses 
another  edge  of  a  contour.  For  this  to  happen,  either 
the  extreme  edge  must  be  highly  slanted,  or  the  con¬ 
tour  must  have  very  sharp  turns.  This  can  be  avoided 
by  an  appropriate  fine  tuning  of  the  discretization  and 
matching  threshold  parameters,  and  in  any  case  we  did 
not  face  such  a  situation  in  all  of  our  comprehensive 
experiments  (which  involved  fairly  complex  and  rather 
‘adversary’  data).  To  recap,  while  in  rare,  worst-case 


(a)  Original  contour 
orientations 


(b)  Claft  cycle 
orientations 


Figure  7:  Cleft  orientations 


(a)  An  acceptable  (b)  A  conflicting 
bridge  bridge 

Figure  8;  Bridges  in  simple  branching  cases 


scenarios,  projected  cleft  cycles  might  intersect,  we  will 
assume  in  what  follows  that  this  does  not  occur. 

The  sy-proiections  of  these  cleft  polygons  might 
again  form  a  hierarchy  of  polygon  nesting.  We  check 
this  possibility  by  invoking  again  the  same  une-sweeping 
procedure  used  in  the  data  acquisition  step.  This  time, 
all  the  polygons  are  guaranteecl  to  be  correctly  oriented. 
Figure  7  illustrates  this  situation.  Figure  7(a)  shows  a 
branching  case,  where  the  lower  slice  contains  one  con¬ 
tour,  and  the  upper  slice  contains  two  contours.  Af¬ 
ter  tiling  the  detected  rnatch,  inverting  the  lower  slice, 
and  canceling  out  opposite  edge  occurrences,  two  nested 
cleft  cycles  remain,  already  oriented  consistently,  as 
shown  as  Figure  7(b). 

Let  C  be  a  cleft  whose  immediate  children  in  the  hier¬ 
archy  are  Cl, . . . ,  Ci.  We  define  an  undirected  weighted 
complete  graph  G,  so  that  each  vertex  of  G  is  one  of 
these  k  +1  cycles,  and  the  weight  of  an  edge  connect¬ 
ing  two  cycles  is  the  minimum  distance  between  the 
a;j/-projections  of  these  cycles.  (We  assume  that  our 
contour  discretization  is  dense  enough,  so  that  the  min¬ 
imum  vertex-to-vertex  distance,  which  is  what  we  actu¬ 
ally  computed,  serves  as  a  sufficiently  good  approxima¬ 
tion  of  the  actual  minimum  distance.)  We  now  compute 
a  minimum  spanning  tree  T  of  G,  and  form  a  bridge  be¬ 
tween  each  pair  of  cycles  connected  by  an  edge  of  T;  the 
bridge  is  a  straight  segment  connecting  the  two  nearest 
vertices  on  these  cycles.  It  is  easily  verified  that  the 
ary-projections  of  the  bridges  do  not  cross  each  other, 
and  also  do  not  cross  the  a;i/-projection  of  any  cycle. 
We  create  two  oppositely-oriented  copies  of  each  bridge 
and  add  them  to  the  given  cycles.  This  eliminates  the 
'windows’  Ci, . .  . ,  Ck  and  replaces  the  whole  configura¬ 
tion  by  a  single  composite,  self-touching  but  otherwise 
simple  polygonal  cycle. 

We  emphasize  that  bridge  assembly  was  the  most  sig¬ 
nificant  obstacle  in  previous  works,  which  used  this  tool 
for  reducing  a  branching  situation  to  the  simple  one- 
to-one  case.  That  was  because  a  bridge  in  one  slice 
could  conflict  with  the  geometry  of  the  other  slice,  by 
having  its  projection  intersect  a  contour,  as  shown  in 
Figure  8(b).  As  noted,  we  do  not  face  this  problem. 

We  note  that  this  procedure  is  required  only  in  com¬ 
plicated  cases  (few  of  them  are  presented  in  Section  5); 
in  most  practical  instances  clefts  do  not  tend  to  be 
nested.  In  any  case,  after  this  hole  elimination  step, 
we  are  left  with  a  collection  of  closed  polygonal  cleft  cy¬ 
cles,  with  the  property  that  their  a:j/-projections  enclose 
pairwise-disjoint  regions. 

Our  next  goal  is  to  triangulate  each  cleft  cycle.  Since 
many  such  triangulations  are  possible,  we  seek  a  trian¬ 
gulation  which  minimizes  the  total  area  of  the  triangles. 
More  precisely,  we  want  to  solve  the  following  problem: 
Given  a  3-dimensional  closed  polygonal  curve  P,  and 
an  objective  function  P  defined  on  all  triangles  (called 
'weight  in  the  sequel),  find  the  triangulation  of  P  (i.e.,  a 


collection  of  triandes  spanned  by  the  vertices  of  P.  so 
that  each  edge  of  P  is  incident  to  exactly  one  triangle, 
and  all  other  triangle  edges  are  incident  to  two  trian¬ 
gles  each),  which  minimizes  the  total  sum  of  T  over  its 
triangles. 

For  this  purpose,  we  closely  follow  the  dynamic  pro¬ 
gramming  technique  of  Klincsek  [21]  for  finding  a  poly¬ 
gon  triangulation  in  the  plane,  which  minimizes  the  to¬ 
tal  sum  of  edge  lengths.  Let  P  —  (uq,  iii, ....  = 

tio)  be  the  given  polygon.  Let  (0  <  i  <  j  <  n  -  1) 
denote  the  weight  of  the  best  triangulation  of  the  polyg¬ 
onal  curve  {vi, . . .  ,Vj  =  Vi).  We  then  apply  a 
straightforward  dynamic  programming  approach,  which 
initializes  each  Wi^i+2  to  P{vi,Vi+i,Vi+2)-  and  then 

computes  each  W{i,k),  for  <:  -  ?  =  3,4, _ n  -  1,  as 

iTiini<m<A:[kF’gm  +  Wm,k  +  P(t'i,  Vm  i  ^^i)]-  We  Omit  here 
the  easy  further  details.  The  actual  value  of  P(u,  v.  w) 
that  we  haye  used  is  a  weighted  average  of  several  terms, 
the  most  significant  one  Being  the  area  of  the  triangle; 
see  Sectiori  5  for  more  details. 

Finally,  if  the  interpolation  involves  the  uppermost  or 
lowermost  slice  in  the  given  sequence,  we  also  have  to 
add  to  the  reconstructed  object  boundary  the  regions 
of  material  on  that  slice,  so  as  to  ‘close’  the  volume  of 
that  object. 

5  Experimental  Results 

We  have  implemented  the  whole  algorithm  on  a  Dig¬ 
ital  DECstation  5000/240  and  on  a  Sun  SparcStation 
II  in  C.  We  have  experimented  with  the  algorithm 
on  several  data  files  obtained  by  CT  or  MRI  scan¬ 
ners,  and  obtained  very  good  results  in  practically  all 
cases.  The  input  usually  consisted  of  about  fifty  to 
one  hundred  cross-sections,  from  0.5  mm  to  2.0  mm 
apart.  The  tuning  of  the  parameters  (discretization 
length  and  size  of  neighborhood  in  the  geometric  hash¬ 
ing)  was  very  robust,  and  large  variation  of  these  pa¬ 
rameters  produced  nearly  identical  results.  We  usu¬ 
ally  used  1.0  mm  as  the  discretization  parameter,  and 
3.0  mm  for  the  voting  threshold  (for  human  organs 
whose  global  size  was  between  5  to  20  cm  in  all  di¬ 
mensions).  We  allowed  up  to  two  successive  point  mis¬ 
matches  along  a  match.  A  point-to-point  match  con¬ 
tributed  the  amount  of  l/(d+  0.1)  to  the  match  score, 
where  d  was  the  xy  distance  between  the  two  points. 
We  considered  only  match  candidates  which  received 
4  votes  or  more  and  whose  scores  were  above  15.0. 
(Our  discretization  was  sufficiently  dense  so  that  this 
choice  still  captured  all  contour  overlaps.)  The  objec¬ 
tive  function  P  for  the  cleft-triangulation  was  taken  to 
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Figure  9:  A  synthetic  example 


Figure  10:  A  synthetic  branching  example 


be  0.85A  +  O.OhP  +  O.lOi?,  where  A  is  the  area  of  the 
triangle,  P  is  its  perimeter,  and  R  is  the  ratio  between 
the  largest  and  the  smallest  of  its  three  edges.  All  these 
parameters  were  user  defined,  but  modifying  them  did 
not  achieve  any  better  results.  ^  ■ 

We  can  measure  the  complexity  of  the  algorithm  in 
terms  of  four  variables:  k,  the  total  number  of  input 
points  along  the  contour  edges  of  two  consecutive  slices, 
n,  the  total  number  of  points  after  the  arc  length  dis¬ 
cretization  step,  the  number  of  clefts,  and  h,  the 
size  of  the  biggest  cleft  (after  the  hole-elimination  step). 
Usually,  k  is  considerably  smaller  than  n.  The  number 
^  of  clefts  could  be  in  the  worst  case  as  large  as  Q{k^), 
but  in  practice  ^  is  smaller  than,  or  at  least  comparable 
with  the  number  of  contours  c  (which  is  typically  much 
smaller  than  k). 

Due  to  lack  of  space,  we  do  not  give  here  a  for¬ 
mal  analysis  of  the  complexity  of  the  algorithm.  Such 
an  analysis  produces  terms  that  may  Be  very  large; 
for  example,  the  cleft  triangulation  step  (based  on  dy¬ 
namic  programming)  takes  0{^h^)  time,  which  is,  the¬ 
oretically,  and  in  complex  situations  also  pragmatically, 
the  most  e.xpensive  portion  of  our  algorithrn.  The  en¬ 
tire  algorithm  runs  on  practical  instances  in  average 

0{(k  +  0  log(A;  -f  ^)  -h  n  -t-  time. 

Here  are  some  specific  examples  of  the  performance 
of  the  algorithm: 

Figure  9  shows  a  simple  case,  similar  to  the  first  prob¬ 
lematic  example  of  Boissonnat  [3].  Each  slice  in  this 
example  contains  exactly  one  contour.  The  tiled  match 
appears  in  white,  whereas  the  triangulated  cleft  appears 
in  black. 

Figure  10  shows  a  synthetic  branching  example, 
where  the  lower  slice  contains  two  contours,  whereas 
the  upper  slice  contains  only  one  contour.  Two  long 
matches  were  found  and  tiled  by  the  white  trian¬ 
gles.  The  remaining  cleft  appears  in  between  the  two 
matches,  and  its  triangulation  appears  in  black. 

Figure  11  shows  a  more  complicated  synthetic  exam¬ 
ple,  where  the  lower  slice  contains  one  contour,  and  the 
upper  slice  contains  three  contours.  Figure  11(a)  shows 
a  top  view  of  this  situation,  whereas  Figure  11(b)  shows 
an  isometric  view  of  it.  Figure  11(c)  shows  the  surface 
reconstruction.  The  tiling  of  the  single  match  appears 
in  white.  The  remaining  cleft  consists  of  one  cycle  which 
encloses  two  other  cycles.  Two  bridge  constructions 
compose  the  three  polygons  into  a  single  one,  and  its 


(b)  Before  reconstruction 


(c)  After  reconstruction 


Figure  11:  A  synthetic  complicated  example 


triangulation  appears  in  black. 

The  next  several  figures  show  the  typical  performance 
of  our  algorithm,  eis  observed  from  its  execution  on  a  se¬ 
ries  of  cross-sections  of  a  human  jaw  bone.  Figure  12 
shows  the  reconstructed  surface  between  two  slices.  The 
tiles  of  the  matches  appear  in  white,  and  the  triangula¬ 
tions  of  the  two  remaining  clefts  appear  in  black.  Fig¬ 
ure  13  shows  two  slices  with  a  branching  region.  The 
match  tiles  appear  in  white,  whereas  the  cleft  triangu¬ 
lations  appear  in  black.  Figure  14  presents  a  consider- 
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(a)  Before  reconstruction 


(b)  After  reconstruction 


Figure  13:  A  simple  branching  case 


(a)  Before  reconstruction 


(b)  After  reconstruction 
Figure  14:  A  complicated  branching  case 


(a)  Before  reconstruction 


(b)  After  reconstruction 


Figure  15:  A  multiple  branching  case 


(a)  Before  reconstruction 


(b)  After  reconstruction 


Figure  16:  A  composite  case 


ably  more  complicated  situation,  where  the  reconstruc¬ 
tion  turns  out  to  be  ‘intuitively  correct’.  Figure  14(a) 
shows  that  the  lower  slice  contains  a  contour  with  two 
hole  contours,  and  the  upper  slice  contains  only  one 
contour,  which  mostly  lies  above  the  “material”  region 
of  the  lower  slice.  The  surface  reconstruction  is  shown 
in  Figure  14(b).  The  three  match  tiles  appear  in  white, 
and  the  cleft  triangulations  appear  in  black.  The  reader 
may  verify  that  the  two  “non-material  pillars”  repre¬ 
sented  by  the  two  hole  contours  in  the  lower  slice  were 
connected  to  the  unbounded  “non-material”  region  in 
the  upper  slice.  Similar  complex  examples  are  given  in 
Figures  15  and  16. 

The  reconstruction  of  the  whole  jaw  bone,  whose  in- 
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Model 

Synthetic 

Jaw  Bone 

Slices 

2 

96 

Layers 

1 

95 

Contours: 

Total 

3.0 

417.0 

Per  Slice 

1.5 

4.3 

Vertices: 

Total 

143.0 

17,646.0 

Per  Contour 

47.7 

42.3 

Matches: 

Total 

2.0 

330.0 

Per  Layer 

2.0 

3.5 

Clefts: 

Total 

1.0 

275.0 

Per  Layer 

1.0 

2.9 

Time  (Seconds): 

All: 

Stitching 

0.07 

12.50 

Triangulation 

0.13 

37.12 

Total 

0.20 

49.62 

Per  Layer 

0.20 

0.52 

Table  1 :  Performance  of  the  algorithm 


Figure  17:  A  fully  reconstructed  human  jaw  bone 


(c)  (d) 

Figure  18:  A  topographic  terrain 


put  consisted  of  96  slices,  is  shown  in  Figure  17.  The 
result  was  a  valid  polyhedral  description,  which  con¬ 
tained  about  60,000  triangles.  The  reconstructed  jaw 
contained  209  3-D  cavities  (fully  enclosed  in  the  outer 
3-D  reconstructed  boundary). 

Our  experimentations  were  mostly  performed  on 
medical  imaging  data.  However,  as  we  mentioned  in 
the  introduction,  the  reconstruction  problem  has  other 
applications  as  well.  One  such  application  is  the  re¬ 
construction  of  a  terrain  from  elevation  contour  data  in 
topographic  maps.  This  application  is  much  simpler, 
because  of  the  ly-monotomcity  of  terrains.  For  exam¬ 
ple,  contours  cannot  overlap  in  this  case  (although  they 
can  be  nested).  Figure  18  shows  the  reconstruction  of 
a  terrain  in  the  Zikhron-Ya’akov  area  in  Israel.  Fig¬ 
ures  18(a,b)  show  the  elevation  contours  (a  top  view  and 
an  isometric  view).  The  data  contained  seventeen  levels 
which  are  10  meters  apart,  starting  from  20  and  ending 
at  180  meters  above  see  level.  These  levels  consisted  of 


111  contours  made  of  5,329  edges.  Figures  18(c,d)  show 
the  full  reconstruction  of  the  terrain  (again,  a  top  view 
and  an  isometric  view). 

Finally,  we  measured  the  performance  of  our  imple¬ 
mentation  on  two  of  the  examples  described  above.  All 
the  time  measurements  were  taken  on  a  Digital  DEC- 
station  5000/240.  Table  1  summarizes  the  performance 
of  the  algorithm  on  the  simple  branching  case  shown  in 
Figure  10,  and  on  the  whole  jaw  bone  shown  in  Fig¬ 
ure  17.  Note  that  the  data  of  the  jaw  bone  was  accu¬ 
mulated  from  95  layer  reconstructions.  Therefore,  some 
of  the  data  is  averaged  per  layer.  We  note  that  in  medi¬ 
cal  data,  at  least  the  data  with  which  we  experimented, 
successive  slices  tend  to  differ  a  lot  in  their  geometries. 
Therefore,  many  clefts  are  created  and  their  (relatively 
time-consuming)  triangulations  affect  the  total  running 
time  of  the  algorithm. 


6  Conclusion 

We  have  proposed  in  this  paper  an  algorithm  for  solving 
the  practical  problem  of  polyhedral  interpolation  be¬ 
tween  parallel  polygonal  slices.  This  problem  has  many 
medical  and  geographic  applications,  and  also  appears 
to  be  a  fairly  basic  and  interesting  problem  in  computer 
gr^hics  and  solid  modeling. 

C5ur  method  produces  a  relatively  smooth  boundary, 
due  to  the  contour  discretization.  In  situations  where 
the  addition  of  new  vertices  is  not  desired,  e.g.  due  to 
data  explosion,  our  system  uses  the  discretization  only 
for  the  matching  step,  but  the  tiling  itself  and  the  fol¬ 
lowing  minimum-area  triangulation  are  performed  on 
the  contours  containing  only  the  original  points. 

We  feel  that  our  technique  reconstructed  the  bound¬ 
ary  of  various  organs  in  an  intuitively  appealing  manner; 
one  might  say  that  our  algorithm  demonstrated  some 
‘understanding’  of  the  underlying  problem.  The  results 
were  more  than  adequate  even  in  extreme  cases  of  tiling 
between  two  seemingly  totally  different  slices. 

We  plan  to  continue  the  experiinentation  with  our  al¬ 
gorithm,  to  test  its  performance  limits  and  see  if  there 
are  data  instances  on  which  the  algorithrn  might  not 
perform  well,  thus  requiring  further  calibrations  and  en¬ 
hancements  (in  view  of  our  experimentation  so  far,  we 
doubt  that  anything  really  problematic  will  arise). 
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Abstract 

We  present  practical  methods  for  approximate  geomet¬ 
ric  pattern  matching  in  d-dimensions  along  with  exper¬ 
imental  data  regarding  the  quality  of  matches  and  run¬ 
ning  times  of  these  methods  verses  those  of  a  branch- 
and-bound  search.  Our  methods  are  faster  than  previ¬ 
ous  methods  but  still  produce  good  matches. 

1  Introduction 

Suppose  we  are  given  a  set  B  of  n  points  in  ,  which  we 
shall  call  the  background,  and  a  set  P  of  m  points  in  3?'^, 
which  we  shall  call  the  pattern.  The  geometric  pattern 
matching  problem  is  to  determine  a  rigid  motion,  taken 
from  some  class  of  motions,  such  that  each  point  in  P 
is  moved  to  a  point  in  B.  This  problem  is  motivated 
by  a  number  of  problems  in  computer  vision,  including 
character  recognition.  Of  course,  it  is  easy  to  solve  this 
problem  in  0(nmlog  n)  time  just  by  keeping  the  points 
of  B  in  some  dictionary. 

Unfortunately,  this  approach  is  very  sensitive  to  noise 
in  the  background.  Thus,  it  is  actually  more  natural  to 
ask  for  a  rigid  motion  of  P  such  that  each  point  of  P 
is  moved  near  to  a  point  in  B.  We  call  this  the  approx¬ 
imate  geometric  pattern  matching  problem.  Formally, 
we  desire  a  rigid  motion  T,  taken  from  some  class  of 
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motions  T,  such  that  the  directed^  Hausdorff  distance 
between  T{P)  and  B  is  minimized.  Recall  that  the  di¬ 
rected  Hausdorff  distance,  h{C,  D),  from  a  point  set  C 
to  another  point  set  D  is  defined  as 

h{C,  D)  —  max  min  p(c,  d), 
c6C*  d^D 

where  p  is  the  usual  Euclidean  distance  between  c 
and  d. 

1.1  Previous  Work 

This  line  of  research  has  a  rich  history,  which  really  be¬ 
gins  in  earnest  in  the  computational  geometry  literature 
with  work  by  Alt,  Mehlhorn,  Wagener,  and  Welzl  [3]  on 
methods  for  finding  congruences  between  two  sets  of 
points  A  and  B  under  rigid  motions.  Besides  giving  a 
number  of  exact  methods,  they  introduce  an  important 
approximate  version  of  the  problem,  where  one  is  given 
a  tolerance  e  >  0  and  asked  to  find  a  motion  T,  if  it  ex¬ 
ists,  that  allows  a  matching  between  each  point  in  T{A) 
and  a  point  in  B  at  most  distance  e  away.  Moreover, 
they  also  consider  the  optimization  version  of  this  prob¬ 
lem,  in  which  one  wishes  the  smallest  e  admitting  such 
a  motion.  This  version  of  the  problem  is  very  close  to 
the  problem  we  address  in  this  paper.  Unfortunately, 
the  running  times  for  this  version  of  the  problem  are 
quite  high. 

Imai,  Sumino,  and  Imai  [21]  show  that  these  bounds 
can  be  reduced  somewhat  if  one  is  already  given  an  as¬ 
signment  of  points  in  A  to  points  in  B.  Likewise,  Arkin, 
Kedem,  Mitchell,  Sprinzak,  and  Werman  [4]  show  that 
one  can  improve  the  running  times  in  the  approximate 
case  if  the  “noise  regions”  are  disjoint.  Even  so,  the 
methods  derived  in  these  papers  are  still  relatively  non¬ 
trivial  and  the  running  times  for  all  but  the  most  simple 
motions  are  still  quite  high. 

In  work  more  directly  related  to  this  paper,  sev¬ 
eral  researchers  [6,  7,  18,  19,  20,  25]  have  studied 

^The  undirected  Hausdorff  distance  is  defined  as  H(C,  D)  = 
max{^(C,  D),  /i(D,  C)}. 
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methods  for  finding  rigid  motions  that  minimize  ei¬ 
ther  the  directed  or  undirected  HausdorfF  distance  be¬ 
tween  the  two  point  sets.  All  of  these  methods  are 
based  upon  intersecting  higher-degree  curves  and/or 
surfaces,  which  are  then  searched  (sometimes  paramet¬ 
rically  [1,  8,  9,  10,  22])  to  find  a  global  minimum.  This 
reliance  upon  intersection  computations  makes  for  al¬ 
gorithms  that  are  potentially  numerically  unstable,  are 
conceptually  complex,  and  have  running  times  that  are 
high  for  all  but  the  most  trivial  motions.  Indeed,  Ruck- 
lidge  [26]  gives  evidence  that  such  methods  must  have 
high  running  times. 

The  high  running  times  of  these  methods  motivated 
Heffernan  [16]  and  Heffernan  and  Schirra  [17]  to  con¬ 
sider  an  approximate  decision  problem  for  approximate 
point  set  congruence  (they  did  not  study  point  set  pat¬ 
tern  matching).  The  general  framework  is  that  one  is 
given  a  parameter  e  >  0  and  asked  to  solve  the  ap¬ 
proximate  set  congruence  problem  [3],  except  that  one 
is  allowed  to  “give  up”  if  one  discovers  that  c  is  “too 
close”  to  the  optimal  value  e*  (i.e.,  the  Hausdorff  dis¬ 
tance  between  T(A)  and  B).  By  allowing  algorithms  to 
be  “lazy”  in  this  way,  they  show  that  the  running  times 
can  be  significantly  reduced  from  the  previous  bounds. 
However,  this  may  not  be  the  best  way  to  approach 
the  problem  with  an  approximation  algorithm,  since 
in  some  instances  the  program  will  consume  a  possibly 
large  amount  of  computing  time  and  terminate  without 
an  approximation.  Unfortunately,  the  running  times  of 
the  methods  in  [16,  17]  increase  substantially  if  one  tries 
to  get  a  yes-or-no  answer  for  an  e  close  to  e* .  Thus,  it 
is  difficult  to  use  their  methods  to  approximate  e*. 

1.2  Our  Results 

In  this  paper  we  present  a  simple  (almost  trivial)  ap¬ 
proach  for  approximate  point  set  pattern  matching  un¬ 
der  rigid  motions,  where  one  is  given  a  pattern  set  P 
of  m  points  in  and  a  background  B  of  n  points  in 
3?^^  and  asked  to  find  a  rigid  motion  T  that  minimizes 
h(T(P),  B).  Our  methods  are  based  upon  a  simple 
“pinning”  strategy.  They  are  fast,  easy  to  implement, 
and  do  not  suffer  from  any  numeric  instability  difficul¬ 
ties.  Moreover,  since  they  are  defined  for  the  (more 
general)  directed  Hausdorff  measure,  they  are  tolerant 
of  noise  in  the  background.  Our  methods  are  not  exact, 
however. 

Instead,  in  the  spirit  of  approximation  methods  for 
other  hard  problems  (e.g.,  NP-hard  problems  [11,  14]), 
we  derive  algorithms  that  are  guaranteed  to  come  close 
to  the  optimal  value,  6*,  which  is  also  the  approach 
taken  by  Alt  et  al.  [2]  for  approximate  polygon  match¬ 
ing.  In  particular,  each  of  our  methods  gives  a  rigid 
motion  T  such  that  h(T(P),  B)  <  a6* ,  for  some  small 
constant  a  >  1. 

We  justify  the  implementability  of  our  methods 
through  empirical  tests.  In  particular,  we  study  the 


Motion 

Best  Match 

Our  Method 

Factor 

T  in  3?" 

nni^  logn  [19] 

nmlogn 

2 

rn  in  31^ 

n^m^logn  [6] 

n^m  log  n 

4 

T  in  31'^ 

log^  n  [19] 

nm  log  n 

2  +  e 

n  in 

- 

n'^m  log  n 

4  +  e 

TTl  in  3?^ 

- 

n^m  log  n 

8  +  e 

Table  1:  Our  Results.  We  give  the  previous  best- 
match  asymptotic  running  times,  our  asymptotic  run¬ 
ning  times,  and  our  worst-case  approximation  fac¬ 
tor  guarantee.  The  transformations  considered  are 
T  ^translation  and  TZ  =rotation,  and  their  combina¬ 
tion.  The  parameter  e  can  be  any  positive  constant  less 
than  1.  Also,  the  previous  methods  for  the  3-d  transla¬ 
tion  case  is  actually  for  undirected  Hausdorff  distance, 
and  the  translation  method  in  [19]  is  for  d  <3. 


running  time  and  match  quality  performance  of  our 
methods  when  run  on  various  input  instances,  and  com¬ 
pare  this  performance  with  that  of  a  more  conventional 
procedure  based  upon  a  branch-and-bound  search  of  a 
discretized  configuration  space.  Our  experimentation 
gives  evidence  that  our  methods  are  indeed  quite  fast 
in  practice,  yet  nevertheless  are  still  able  to  find  rigid 
motions  with  good  matches.  In  addition  to  this  analy¬ 
sis,  we  also  give  some  heuristics  that  speed  up  the  run¬ 
ning  time  in  practice,  even  though  they  do  not  actually 
improve  the  worst-case  running  time. 

In  the  next  section  we  describe  our  methods  under 
various  rigid  motions,  in  2-,  3-,  and,  in  some  cases, 
higher  dimensions.  Our  results  are  summarized  in  Ta¬ 
ble  1.  We  give  some  of  the  results  of  our  experimental 
work  in  Section  3,  and  we  conclude  in  Section  4. 

2  Our  Algorithms 

In  this  section  we  give  each  of  our  methods  for  approx¬ 
imate  geometric  pattern  matching  including  bounds  on 
the  quality  of  match  that  will  be  produced  relative  to 
the  absolute  best  possible  match. 

2.1  Translation  Only  in 

Suppose  we  are  given  a  set  B  of  n  points  in  3?“^  and 
a  set  P  of  m  points  in  where  d  is  considered  con¬ 
stant.  In  this  subsection  we  give  an  efficient  method 
for  finding  a  translation  T'  such  that  h{T'{P),B)  is 
at  most  2/i(ropt(B),  B),  where  Topt(B)  is  the  optimal 
translation,  i.e.,  the  one  that  minimizes  h(T{P),  B), 
taken  over  all  translations  T. 

Pick  some  point  p  G  B  as  a  “representative”  for 
P.  For  each  b  E  B,  define  Tj  to  be  the  transla¬ 
tion  that  takes  p  to  b.  Our  method,  then,  is  to 
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find  min(,gB{/i(Tj(P),5)}  as  our  best  approximate 
match,  and  let  T'  be  the  translation  Tj  that  achieves 
this  bound.  This  can  clearly  be  done  in  0{nm  * 
Nearest  dip)),  where  Nearestdiji)  is  the  time  needed  to 
perform  a  nearest-neighbor  query  in  an  n-point  set  (in 
this  case  B)  in  3?'^. 

Of  course,  if  d  =  2,  then  we  can  achieve 

Nearest2{n)  =  O(logn)  by  answering  nearest-neighbor 
queries  using  point  location  in  a  Voronoi  diagram  for 
B  (e.g.,  [12,  23,  24]),  which  requires  O(nlogn)  prepro¬ 
cessing.  For  higher  dimensions,  this  approach  is  not 
as  efficient,  however.  So,  our  method  for  implementing 
nearest  neighbors  will  instead  be  based  on  the  (practi¬ 
cal)  method  of  Arya  et  al.  [5],  which  finds  approximate 
nearest  neighbors  in  O(logn)  time  in  for  any  con¬ 
stant  dimension  d  >  2,  after  O(nlogn)  preprocessing. 
Their  method  can  be  tuned  to  return  a  point  whose  dis¬ 
tance  is  at  most  a  (H-£)-factor  larger  than  the  nearest- 
neighbor  distance,  for  any  constant  0  <  e  <  1. 

Lemma  2.1  h{T'{P),B)  <  2/i(Topt(P),  5)  m  and 
h{T'{P),B)  <  (2  +  e)hiTopt{P),B)  in  for  d  >  3, 
with  e  >  0  being  any  constant  less  than  1. 

Proof:  (For  5ft^)  For  simplicity  of  expression,  de¬ 

fine  hopt  =  h(Topt{P),P)-  Observe  that  for  each 
P  G  ?opt(P),  there  exists  an  associated  b  E  B  that 
is  within  a  distance  /lopt  of  P-  Consider  the  process  of 
translating  the  entire  pattern  Topt(P)  so  that  a  par¬ 
ticular  point  p  now  coincides  with  its  associated  back¬ 
ground  point  6.  This  translation  will  cover  a  distance 
of  at  most  /lopt  and  will  therefore  increase  the  distance 
from  any  other  point  in  the  pattern  to  its  associated 
background  point  by  at  most  /lopt-  Therefore,  it  will 
have  a  directed  Hausdorff  distance  of  at  most  twice 
that  of  Topt(P).  This  translation  will  be  one  of  those 
generated  and  checked  by  our  algorithm.  Thus,  our  al¬ 
gorithm  will  produce  a  translation  that  has  a  directed 
Hausdorff  distance  that  is  at  most  a  factor  of  two  times 
the  minimal.  (For  with  d  >  3,  this  factor  is  multi¬ 
plied  by  (1  -t-  e'),  where  e'  is  defined  so  that  the  approx¬ 
imation  factor  for  our  method  becomes  2  -f-  e.)  ■ 

2.2  Translation  and  Rotation  in 

Suppose  we  are  given  a  set  5  of  n  points  in  and  a 
set  P  of  m  points  in  3?^.  In  this  subsection  we  give  an 
efficient  method  for  finding  a  Euclidean  motion  (trans¬ 
lation  and  rotation)  E'  such  that  h{E'{P),  B)  is  at  most 
4h{Eopt{P),  B),  where  EoptiP)  is  an  optima/ Euclidean 
motion,  i.e.,  one  that  minimizes  h{E{P),  B),  taken  over 
all  valid  motions  E. 

Select  from  the  pattern  diametrically  opposing  points 
and  call  them  r  and  k.  Point  r  is  treated  as  both  the 
distinct  representative  of  the  pattern  for  the  transla¬ 
tion  part  of  the  transformation  and  it  is  treated  as  the 
center  of  rotation  for  the  rotation  part  of  the  transfor¬ 
mation.  Specifically,  for  each  b  £  B,  define  Tj  to  be  the 


translation  that  takes  r  to  6.  Also  for  s.  h'  £  B ,b'  ^  b , 
define  Rv  to  be  the  rotation  that  makes  r,  6',  and 
k  collinear.  Let  be  the  Euclidean  motion  that 

is  the  combination  of  Tj  and  Pj'.  Our  method  is 
to  find  minj,6-6i3{/i(£'i,,i,'(P),B)}  as  our  best  approx¬ 
imate  match,  and  let  E'  be  the  Euclidean  motion  Et^b' 
that  achieves  this  bound.  This  can  clearly  be  done  in 
0{n?m* Nearest3{n))  time,  which  we  can  implement  to 
be  0(n^m  log  n). 

Lemma  2.2  h{E'{P),B)  <  4:h{Eopt{P),  B),  for  any 
constant  0  <  e  <  1. 

Proof;  For  simplicity  of  expression,  define  /lopt  = 
h{Eopt{P),B).  Observe  that  for  each  p  £  Eopt{P), 
there  exists  an  associated  b  £  B  that  is  within  a  dis¬ 
tance  /lopt  of  p.  Consider  the  process  of  translating  the 
entire  pattern  E'opt(P)  so  that  the  particular  point  r 
now  coincides  with  its  associated  background  point  b. 
This  translation  will  cover  a  distance  of  at  most  hopt 
and  wilt  therefore  increase  the  distance  from  any  other 
point  in  the  pattern  to  its  associated  background  point 
by  at  most  /lopt-  Now  consider  the  process  of  rotating 
the  entire  pattern  about  point  r  so  that  line  between  r 
and  k  now  passes  through  the  background  point  asso¬ 
ciated  with  k  in  E'opt(P)-  This  rotation  will  have  the 
effect  of  moving  point  k  by  at  most  2/iopt-  Since  k  is 
the  fartherest  point  in  the  pattern  from  the  center  of 
rotation,  all  other  pattern  points  will  be  moved  by  a 
distance  of  at  most  2/iopt.  Thus,  any  given  point  in 
the  pattern  can  be  moved  by  at  most  /lopt  during  the 
translation  and  at  most  2/iopt  during  the  rotation,  and 
could  have  been  initially  at  most  /lopt  away  from  its 
associated  background  point.  Therefore,  each  point  in 
the  pattern  will  be  at  most  a  distance  of  4/iopt  from  a 
background  point.  The  pattern  in  its  current  position 
coincides  with  one  of  the  Euclidean  transformation  gen¬ 
erated  and  checked  by  our  algorithm.  ■ 

2.3  Rotation  in 

Suppose  we  are  given  a  set  B  of  n  points  in  3?^  and 
a  set  F  of  m  points  in  3?^.  In  this  subsection  we 
give  an  efficient  method  for  finding  a  (pure)  rotation 
R',  about  the  origin,  such  that  h{R'{P),  B)  is  at  most 
'ih{Ropt{P),  B),  where  RoptiP)  is  an  optimal  rotation, 
i.e.,  one  that  minimizes  h{R{P),B),  taken  over  all  ro¬ 
tations  R. 

Find  a  point  pi  £  P  that  is  furthest  from  the  ori¬ 
gin.  Find  a  point  P2  &  P  that  maximizes  its  per¬ 
pendicular  distance  to  the  line  defined  by  the  origin 
and  point  pi.  For  each  b'  £  B,  define  Rib'  to  be 
the  rotation  that  makes  the  origin,  pi  and  b'  collinear. 
For  each  b"  £  B,b"  ^  b' ,  define  R2b"  to  be  the  ro¬ 
tation  about  the  origin-pi  axis  that  makes  the  ori¬ 
gin,  pi,  P2,  and  6"  coplanar.  Our  method,  then,  is 
to  find  minj,'_(,»gij{/i(i?2(,"(Fli,'(P)),  B)}  as  our  best 
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approximate  match,  and  let  R'  be  the  resultant  rota¬ 
tion  R2iii(Rli,,(P))  that  achieves  this  bound.  This  can 
clearly  be  done  in  0{rRm  *  Nearest^in))  time,  which 
is  0(n^m log^u)  if  one  uses  the  best  current  point 
location  method  for  a  3-dimensional  convex  subdivi¬ 
sion  [15]  to  query  nearest  neighbors  in  a  3-dimensional 
Voronoi  diagram  (e.g,,  see  [12,  23,  24]).  Our  preference, 
however,  is  to  achieve  a  faster  (and  more  practical) 
0{p?m\ogn)  time  bound  using  the  approximate  near¬ 
est  neighbors  method  of  Arya  et  al.  [5].  This  slightly 
weakens  our  approximation  factor,  as  we  show  in  the 
following  lemma,  but  we  feel  this  is  justified  by  the  time 
efficiency. 

Lemma  2.3  h{R'{P),B)  <  (4 e)h(J?opt(P),  S),  for 
any  constant  0  <  e  <  1. 

Proof:  Since  the  e  term  is  a  direct  consequence 

of  our  using  approximate  nearest  neighbor  searching 
to  achieve  Nearestz{n)  —  O(logn),  it  is  sufficient  to 
show  that  actual  nearest  neighbors  would  give  an  ap¬ 
proximation  factor  of  4.  For  simplicity  of  expression, 
define  h^pt  —  h(^Ropt{P),B).  Observe  that  for  each 
P  €  i?opt(P),  there  exists  an  associated  b  E  B  that 
is  within  a  distance  hopj  of  p.  Consider  the  process 
of  rotating  the  entire  pattern  Popt(P)  so  that  pi,  the 
furthest  pattern  point  from  the  origin,  now  becomes 
collinear  with  its  associated  background  point  b  and  the 
origin.  This  process  can  move  any  point  in  the  pattern 
by  at  most  hopt-  Now  consider  the  second  rotation  that 
aligns  p2  with  its  matching  background  point.  This  ro¬ 
tation  may  move  p2  a  distance  of  at  most  2hopt,  and 
therefore  it  may  move  any  point  in  the  pattern  by  at 
most  2/IqpI;.  This  rotation  will  be  one  of  those  generated 
and  checked  by  our  algorithm.  ■ 

2.4  Translation  and  Rotation  in  3?^ 

Suppose  we  are  given  a  set  B  of  n  points  in  5?^  and  a  set 
P  of  m  points  in  3?^  this  subsection  we  give  an  effi¬ 
cient  method  for  finding  a  Euclidean  transformation  E' 
such  that  h{E'{P),  B)  is  at  most  (8  +  e)/i(£'opt(P),  B), 
where  Eopt{P)  is  the  opfz’ma/ Euclidean  transformation, 
i.e.,  the  one  that  minimizes  h(^E(^P'),  B),  taken  over  all 
such  transformations  E. 

Select  from  the  pattern  diametrically  opposing  points 
and  call  them  v  and  k.  Choose  a  point  I  E  P  such  that 
the  perpendicular  distance  from  /  to  the  line  vk  is  max¬ 
imum.  For  each  b  E  B,  define  Tj  to  be  the  translation 
that  takes  pattern  point  r  to  6. 

For  each  b'  E  B,b'  b,  define  to  be  the  rotation 
that  causes  r,  k  and  6'  to  become  collinear.  For  each 
b  E  B,b  h  ,h"  ^  b,  define  to  be  the  rotation 
about  the  rk  axis  that  brings  h"  into  the  r,  k,  I  plane. 

Our  method,  then,  is  to  compute  the  value  of 
mini  j/ j//g5{h(_R2i,//(i?l;,/(Tj(P))),  5)}  as  our  best  ap¬ 
proximate  match,  and  let  E'  be  the  Euclidean  trans¬ 
formation  R2i„{Rlf,,{T},{P))  that  achieves  this  bound. 


This  can  be  done  in  0(n^m  *  Nearests^n))  time.  As 
above,  we  achieve  Nearest3(n)  =  O(logn)  using  ap¬ 
proximate  nearest-neighbor  searching  [5],  and  end  up 
with  the  following  result: 

Lemma  2.4  h{E'{P),B)  <  (8 -f- e)/i(E'opt(P),  5),  for 
any  constant  0  <  e  <  1. 

Proof;  For  simplicity  of  expression,  define  hopt  = 
hiEoptiP),  B).  In  addition,  as  in  previous  proofs,  let 
us  show  that  the  expansion  factor  is  8  if  one  were  to  use 
actual  nearest  neighbors  instead  of  approximate  near¬ 
est  neighbors.  Observe  that  for  each  p  E  E'opt(P),  there 
exists  an  associated  b  E  B  that  is  within  a  distance 
hopt  of  p.  Consider  the  process  of  translating  the  entire 
pattern  Eopt{P)  so  that  r  becomes  coincident  with  its 
associated  background  point.  This  process  can  move 
any  point  in  the  pattern  by  a  distance  of  at  most  hopt- 
Now  consider  the  process  of  rotating  the  entire  pat¬ 
tern  so  that  line  rk  passes  through  the  background 
point  that  was  associated  with  k.  This  rotation  can 
move  any  point  in  the  pattern  by  at  most  2hopt.  Now 
consider  a  second  rotation  that  brings  the  background 
point  associated  with  I  into  the  rlk  plane.  This  rota¬ 
tion  may  move  pattern  point  p2  a  distance  of  at  most 
4hopt,  and  therefore  it  may  move  any  point  in  the  pat¬ 
tern  by  at  most  4hopt.  This  rotation  will  be  one  of 
those  generated  and  checked  by  our  algorithm.  The 
translation  may  have  moved  any  point  a  distance  of 
at  most  hopt,  the  first  rotation  may  have  moved  any 
point  a  distance  of  at  most  2hopt  farther,  and  the  sec¬ 
ond  rotation  may  have  moved  any  point  a  distance  of 
at  most  4hopt  still  farther.  Considering  that  any  given 
pattern  point  may  have  been  a  distance  of  hopt  away 
from  its  associated  background  point  to  start  with,  no 
pattern  point  can  be  farther  than  8hopt  from  its  asso¬ 
ciated  background  point.  Our  algorithm  will  therefore 
produce  a  Euclidean  transformation  that  has  a  directed 
Hausdorff  distance  that  is  at  most  a  factor  of  eight  times 
the  minimal.  ^ 

Thus,  we  have  shown  how  one  can  easily  achieve  all 
the  results  claimed  in  Table  1. 

3  The  Experimental  Setup 

We  have  claimed  more  than  this,  however,  in  that  we 
assert  that  our  methods  are  practical.  Therefore,  so  as 
to  justify  this  stronger  claim,  we  have  produced  an  im¬ 
plementation  of  our  methods,  and  we  have  compared 
this  implementation  with  a  method  that  produces  best 
matches  to  an  arbitrary  precision  using  a  conventional 
branch-and-bound  search  of  a  discretized  configuration 
space.  This  conventional  method  seems  to  be  the  most 
practical  previous  best  match  procedure  (we  did  not 
feel  it  was  practically  feasible  to  implement  the  previ¬ 
ous  intersection-based  methods).  As  we  show  through 
our  experimental  results,  however,  this  conventional 
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method  is  still  quite  slow  compared  to  our  method,  and 
the  matches  it  finds  are  not  that  much  better  than  the 
ones  that  our  method  finds. 

3.1  Example  Generation 

We  generated  problem  instance  examples  in  dimension 
d  by  randomly  choosing  a  set  of  n  background  points  in 
the  unit  d-cube  with  lower  left  corner  at  the  origin.  The 
points  are  generated  by  selecting  each  of  the  d  coordi¬ 
nates  from  the  range  [0,1)  with  uniform  distribution. 
Once  we  generate  this  background,  we  randomly  select 
m  points  from  this  background  to  be  an  unperturbed 
pattern.  Each  pattern  point  is  then  perturbed  by  a 
small  amount  so  that  the  pattern  no  longer  identically 
resembles  a  subset  of  the  background  points.  The  per¬ 
turbation  is  done  as  follows.  A  parameter  5  defines  the 
maximum  distance  that  a  pattern  point  can  be  moved. 
The  original  position  of  a  pattern  point  and  the  dis¬ 
tance  S  define  a  hypersphere  centered  at  the  point’s 
original  position.  The  pattern  point  will  be  moved  from 
its  original  position  to  a  point  inside  of  this  d-sphere 
such  that  the  probability  that  it  will  be  placed  in  any 
subvolume  of  the  sphere  is  equal  to  the  ratio  of  that 
subvolume  to  the  volume  of  the  entire  d-sphere.  In 
other  words,  perturbation  causes  each  pattern  point, 
p  to  be  randomly  placed  with  uniform  distribution  in 
the  hypersphere  with  center  p  and  radius  6.  After  this 
perturbation  process,  the  pattern  is  referred  to  as  a  per¬ 
turbed  pattern.  We  use  this  perturbed  pattern  and  the 
background  from  which  it  was  generated  as  the  sets  P 
and  B,  respectively. 

3.2  Implementation  of  the  Approxi¬ 
mate  Match  Algorithms 

This  section  gives  the  details  of  the  implementation 
of  our  approximate-match  algorithm  for  four  different 
matching  problems  involving  translation  and  rotation 
in  2  and  3  dimensions. 

3.2.1  Translation  in  9?^ 

As  described  in  Section  2.1,  our  approximate  pattern 
matching  algorithm  translates  the  pattern  so  that  the 
distinct  representative  of  the  pattern  coincides  with 
each  of  the  n  background  points  in  succession.  For 
each  such  translation,  the  Hausdorff  distance  is  calcu¬ 
lated  and  compared  with  the  best  found  so  far.  If  the 
new  Hausdorff  distance  is  smaller  that  the  best  found  so 
far,  the  position  of  the  pattern  (i.e.,  the  position  of  the 
distinct  representative)  and  this  new  best  distance  re¬ 
place  those  recorded  so  far.  After  the  pattern  has  been 
translated  to  all  of  the  background  points,  we  output 
the  best  found.  This  position  of  the  pattern  is  known 
to  produce  a  Hausdorff  distance  that  is  within  a  factor 
of  two  of  optimal. 


There  are  several  heuristics  one  can  apply  to  this 
method,  which  do  not  improve  the  worst-case  running 
time,  but  which  do  improve  the  running  time  in  prac¬ 
tice.  First,  since  the  relative  qualities  of  the  various 
translations  are  identical  using  p  to  be  either  the  dis¬ 
tance  between  the  points  or  the  distance  squared  be¬ 
tween  the  points,  we  use  distance  squared  to  avoid  the 
costly  square  root  function  in  this  inner-most  loop.  At 
the  end  of  the  program,  when  we  have  found  the  trans¬ 
lation  that  we  want,  we  then  take  the  square  root  to 
get  the  actual  Hausdorff  distance. 

We  also  use  a  condition  that  terminates  the  while- 
loop  early  once  it  is  known  that  a  particular  placement 
need  not  be  further  considered.  Observe  that  in  the  cal¬ 
culation  of  the  Hausdorff  distance,  we  are  finding  the 
maximum  amount  by  which  a  pattern  point  deviates 
from  its  nearest  background  point.  As  we  determine 
this  quantity  for  each  of  the  pattern  points  we  have  a 
partial-maximum  at  any  given  point  in  the  loop.  If  this 
partial-maximum  ever  exceeds  the  best  Hausdorff  dis¬ 
tance  found  so  far,  the  placement  that  we  are  checking 
is  known  to  be  suboptimal  and  does  not  warrant  fruther 
consideration.  We  therefore  terminate  the  while  loop  as 
soon  the  partial  computation  of  the  Hausdorff  distance 
exceeds  the  global  best  found  so  far. 

3.2.2  Translation  in 

The  algorithm  for  translation  in  3?^,  and  for  that  mat¬ 
ter  is  identical  to  the  algorithm  for  translation  in 
,  except  that  the  distances  are  computed  in  a  higher 
dimension. 

3.2.3  Translation  and  Rotation  in  3?^ 

The  implementation  of  the  algorithm  described  in  Sec¬ 
tion  2.2,  is  also  quite  straightforward.  Diametrically 
opposing  pattern  points  are  chosen,  one  of  which  will 
serve  as  both  the  distinct  representative  of  the  pattern 
and  as  also  the  center  of  rotation.  The  algorithm  then 
translates  the  pattern  so  that  the  distinct  representa¬ 
tive  coincides  with  each  of  the  n  background  points  in 
succession.  After  each  translation,  the  pattern  is  ro¬ 
tated  about  the  current  position  of  the  distinct  repre¬ 
sentative  a  total  of  n  —  1  times  so  that  after  each  rota¬ 
tion,  the  other  antipodal  point  is  aligned  with  another 
one  of  the  background  points.  We  now  have  the  pattern 
in  one  of  the  n  *  (n  -  1)  positions  at  which  we  check 
the  Hausdorff  distance.  As  with  the  translation-only 
case,  we  maintain  the  best  Hausdorff  distance  found  so 
far  and  the  position  of  the  pattern  that  produced  it. 
If  at  any  time  one  of  the  n  *  (n  —  1)  placements  has 
a  directed  Hausdorff  distance  that  is  better  than  the 
best  found  so  far,  our  records  are  updated  to  reflect 
this  new  best  position  and  Hausdorff  distance.  Again, 
we  use  the  distance-squared  and  early  loop-termination 
heuristics  in  order  to  further  speed  up  the  algorithm. 
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3.2.4  Translation  and  Rotation  in  3?^ 

The  algorithm  for  producing  an  approximate  match  for 
translation  and  rotation  in  is  as  follows.  We  again 
select  the  distinct  representative  and  the  antipode  of 
the  pattern  as  we  have  done  in  Section  3.2.3.  In  this 
case,  we  also  select  a  third  pattern  point,  called  the  ra¬ 
dial  point,  which  has  the  property  that  it  is  the  great¬ 
est  distance  away  from  the  line  passing  through  the 
distinct  representative  and  the  antipode.  Our  approxi¬ 
mate  match  algorithm  is  comprised  of  three  nested  for- 
loops.  The  outer- most  loop  translates  the  pattern  such 
that  the  distinct  representative  of  the  pattern  coincides 
with  each  of  the  n  background  points  in  succession. 
The  next  loop  chooses  one  of  the  remaining  n  —  1  back¬ 
ground  points  and  rotates  the  pattern  about  the  current 
position  of  the  distinct  representative  so  that  the  an¬ 
tipode  becomes  aligned  with  this  selected  background 
point.  The  inner- most  for-loop  selects  a  third  back¬ 
ground  point  from  the  remaining  n  —  2  and  performs  a 
second  rotation  of  the  pattern,  this  time  about  the  line 
passing  through  the  current  position  of  the  distinct  rep¬ 
resentative  and  the  current  position  of  the  antipode,  to 
bring  the  plane  defined  by  the  distinct  representative, 
the  antipode,  and  the  radial  point  into  a  position  that 
includes  the  background  point  chosen  by  this  third  for- 
loop.  Note  that  there  are  two  rotations  summing  to 
27r  that  that  will  do  this,  one  bringing  the  radial  point 
nearer  to  the  third  chosen  background  point  and  the 
other  taking  it  farther  away.  We  are  of  course  interested 
in  the  one  that  brings  it  closer,  which  is  the  rotation 
that  has  the  smaller  absolute  value. 

For  each  of  the  n{n  —  l)(n  —  2)  placements  produced 
by  the  above  described  for  loops,  the  Hausdorff  distance 
of  the  placement  is  generated  and  the  current  best  is 
kept.  At  the  termination  of  our  algorithm,  we  output 
the  best  placement  found. 

Implementation  of  the  rotations  described  in  this  sec¬ 
tion  are  explained  in  detail  in  Foley  and  vanDam  [13]. 

3.3  Implementation  of  the  Branch  and 
Bound  Algorithms 

This  section  gives  the  implementation  details  of  the  cor¬ 
responding  branch-and-bound  algorithm  for  matching 
problems  involving  translation  and  rotation  in  2  and  3 
dimensions. 

3.3.1  The  Conventional  Method  for  Transla¬ 
tion  in 

The  conventional  method  against  which  we  compared 
our  method  is  a  recursive  algorithm.  It  receives  a 
square  defined  by  a  center  point  and  a  side  length.  It 
then  “probes”  the  center  of  the  square  by  translating 
the  pattern  so  that  the  distinct  representative  of  the 
pattern  is  in  the  center  of  the  square.  For  the  pattern 


in  this  position,  the  Hausdorff  distance  is  calculated.  If 
this  distance  is  the  best  found  so  far,  it  is  recorded  along 
with  the  probe  point  (center  of  square).  The  algorithm 
then  recurses  on  each  of  its  four  quadrants. 

Notice  that  squares  that  are  passed  to  the  recursive 
algorithm  will  in  general  exclude  both  pattern  points 
and  background  points.  The  pattern  and  background 
are  not  cropped  to  fit  these  squares,  these  squares  sim¬ 
ply  describe  areas  in  which  we  would  consider  placing 
the  distinct  representative  of  the  pattern  in  an  effort  to 
find  good  matches.  The  recursion  is  terminated  when 
it  reaches  a  predefined  maximum  depth  or  if  it  is  cer¬ 
tain  that  placement  of  the  distinct  representative  at 
any  point  in  the  square  will  not  produce  a  Hausdorff 
distance  that  is  better  than  the  best  found  so  far.  One 
observation  that  we  can  use  to  terminate  the  depth  of 
recursion  early  is  that  the  Hausdorff  distance  can  be  de¬ 
creased  by  an  amount  of  at  most  x  when  the  pattern  is 
translated  by  a  distance  of  a;.  If  the  value  of  the  Haus¬ 
dorff  distance  produced  by  probing  the  center  of  the 
square  is  so  great  relative  to  the  best  found  so  far  that 
placing  the  distinct  representative  at  any  point  in  the 
square  is  known  to  produce  a  Hausdorff  distance  that 
does  not  beat  the  best  found  so  far,  we  no  longer  need 
to  recursively  search  this  square  and  we  can  terminate 
this  branch  of  the  recursion. 

In  the  approximate  match  algorithm,  we  allow  the 
condition  an  early  exit  to  the  while  loop  if  the  Haus¬ 
dorff  distance  of  a  particular  placement  is  known  to 
be  suboptimal.  In  the  conventional  algorithm,  we  do 
not.  The  Hausdorff  distances  are  used  as  probe  values 
and  large  probe  values  have  a  tendency  to  terminate 
recursion  early.  We  do  not  use  this  break  out  of  the 
while  loop  since  it  would  return  to  the  recursive  algo¬ 
rithm  probe  values  which  were,  in  most  cases,  smaller 
than  those  that  are  accurate.  Such  smaller  probe  val¬ 
ues  inhibit  pruning  and  we  find  that  is  better  to  allow 
the  probing  algorithm  to  run  to  completion  and  return 
accurate  probe  values. 

3.3.2  The  Conventional  Algorithm  for  Trans¬ 
lation  and  Rotation  in  3?^ 

The  conventional  method  for  Translation  and  Rotation 
in  3^2  involves  searching  the  three-dimensional  configu¬ 
ration  space  in  which  the  x  and  y  position  of  the  distinct 
representative  of  the  pattern  comprise  two  of  the  di¬ 
mensions  and  the  angular  position  of  the  pattern  about 
the  distinct  representative,  d,  comprises  the  third. 

We  search  the  configuration  space  defined  by  x,  y,  G 
[0, 1]  and  the  angle  6  G  [-tt,  tt].  We  probe  the  center  of 
this  volume  by  translating  the  pattern  so  that  the  dis¬ 
tinct  representative  resides  at  point  (0.5,  0.5)  and  rotate 
the  pattern  about  the  distinct  representative  so  that  the 
antipode  of  the  pattern  is  at  0  radians  in  relation  to  the 
distinct  representative.  We  then  calculate  the  directed 
Hausdorff  distance  of  the  pattern  in  this  position  and 
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if  it  is  better  than  the  best  found  so  far,  we  record  it 
along  with  the  probe  point,  x,  y,  and  6  that  produced 
it.  We  then  recurse  on  each  of  the  8  octants  until  the 
maximum  depth  of  recursion  is  reached  or  it  is  certain 
that  no  placements  in  the  configuration  space  that  we 
are  currently  searching  will  have  a  directed  Hausdorff 
distance  that  is  better  than  the  global  best  found  so  far, 
in  which  case  this  branch  of  the  recursion  is  terminated. 

3.3.3  Translation  and  Rotation  in 

The  conventional  method  for  Translation  and  Rotation 
in  3?^  is  again  the  search  of  a  configuration  space,  this 
time  a  6-dimensional  space.  One  has  three  degrees  of 
freedom  in  placing  the  distinct  representative  of  the 
pattern  relative  to  the  background,  specifically  the  x, 
y,  and  2  position,  and  one  has  three  degrees  of  freedom 
in  orienting  the  pattern  in  three-space  once  the  distinct 
representative  has  been  pinned.  Two  of  these  degrees 
of  freedom  involve  the  location  of  the  antipode  of  the 
pattern  relative  to  the  distinct  representative,  which 
can  be  thought  of  as  the  <l>  and  6  components  of  the 
spherical  coordinate  of  the  antipode  in  relation  to  the 
distinct  representative.  The  final  degree  of  freedom  is 
the  angular  orientation  of  the  pattern  about  the  axis 
from  the  distinct  representative  and  the  antipode. 

This  conventional  algorithm  is  similar  in  nature  to 
the  two  previously  presented.  The  center  of  the  6-space 
is  probed,  by  placing  and  orienting  the  pattern  accord¬ 
ing  to  the  value  for  each  of  the  6  variables  defining  a 
placement  that  are  contained  in  the  probe  point.  The 
directed  Hausdorff  distance  of  the  pattern  in  this  po¬ 
sition  is  calculated  and  the  algorithm  recurses  on  each 
of  the  2®  =  64  subregions  of  the  configuration  space 
until  either  the  maximum  depth  of  recursion  has  been 
reached  or  until  pruning  occurs. 

3.4  Experiment  1:  Ratio  of  approxi¬ 
mate  to  actual  for  Translation  in  3?^ 

For  the  case  of  translation,  we  have  provided  an  upper 
bound  on  the  directed  Hausdorff  distance  of  the  approx¬ 
imate  match  algorithm  of  twice  that  of  the  directed 
Hausdorff  distance  of  the  optimal  match.  It  is  likely, 
in  practice,  however,  that  the  approximate  match  algo¬ 
rithms  will  produce  matches  with  better  directed  Haus¬ 
dorff  distances  than  this  worst  case.  It  is  our  conjecture 
that  for  large  sparse  B’s  and  large  sparse  P’s,  the  ap¬ 
proximate  match  algorithms  will  produce  matches  that 
are  (1  -t-  A)  *  hopt,  where  A  is  the  ratio  of  the  expected 
distance  by  which  a  point  will  be  perturbed  divided  by 
the  maximum  distance  by  which  a  point  will  be  per¬ 
turbed.  In  particular,  for  our  perturbation  strategy  in 
two  dimensions,  A  should  be  close  to  l/v/^,  since  this 
is  the  expected  distance  the  representative  point  p  is 
from  its  “match  point”  in  B,  and  it  is  likely  that  there 
is  another  point  in  P  that  is  close  to  /lopt  from  its  match 


Figure  1:  Ratios  of  approximation  method  to  con¬ 
ventional  method. 


point  in  a  way  that  it  gets  moved  the  maximum  amount 
possible  when  we  move  p  to  its  match  point. 

We  conducted  an  experiment  to  test  this.  Thirty  sets 
of  background  points  were  generated,  each  having  be¬ 
tween  50  and  500  points.  From  each  background,  a  pat¬ 
tern  of  size  10  was  selected  and  perturbed.  The  pattern 
was  then  matched  to  its  associated  background  using 
both  the  approximate  match  algorithm  and  a  conven¬ 
tional  match  algorithm.  The  ratio  of  the  directed  Haus¬ 
dorff  distance  of  the  approximate  match  algorithm  to 
the  conventional  match  algorithm  is  plotted  in  Figure  1. 
The  average  of  the  ratios  plotted  is  1.69,  which  is  close 
to  the  predicted  value  of  1.71. 

3.5  Experiment  2:  Ratio  of  approxi¬ 
mate  to  actual  for  Translation  in  3?^ 

In  a  manner  similar  to  Experiment  1,  it  could  be  conjec¬ 
tured  that  the  matches  produced  by  the  approximate 
match  algorithm  in  three  dimensions  will  be  close  to 
l/</2,  since  this  is  the  average  distance  by  which  a 
point  is  perturbed  with  our  perturbation  strategy  rela¬ 
tive  the  maximum  distance  with  which  a  point  will  be 
perturbed  with  our  perturbation  strategy. 

Experiment  3  was  conducted  to  test  this.  Forty-five 
sets  of  background  points  were  generated,  each  having 
between  5  and  100  points.  From  each  background,  a 
pattern  of  size  5  was  selected  and  perturbed.  The  pat¬ 
tern  was  then  matched  to  its  associated  background  us¬ 
ing  both  the  approximate  match  algorithm  and  a  con¬ 
ventional  match  algorithm.  The  ratio  of  the  directed 
Hausdorff  distance  of  the  approximate  match  algorithm 
to  the  conventional  match  algorithm  is  plotted  in  Fig¬ 
ure  2.  The  average  of  the  ratios  plotted  is  1.679,  which 
is  close  to  the  predicted  value  of  1.79.  The  fact  that  the 
experimental  average  is  less  than  the  predicted  value 
can  be  attributed  to  the  relatively  small  pattern  size  of 
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Figure  2:  Ratios  of  approximation  method  to  con¬ 
ventional  method  Translation  only  in  3D. 


5  points,  which  decreases  the  probability  that  one  of  the 
pattern  points  will  be  translated  by  a  nearly  maximal 
amount  when  the  distinct  representative  of  the  pattern 
is  translated  to  its  associated  background  point. 

3.6  Experiment  3:  Running  Times 

The  approximate  match  algorithms  have  worst-case 
time  complexities  that  are  much  lower  than  those  of 
the  actual  match  algorithms.  We  conjectured,  however, 
that  they  should  run  faster  in  practice.  This  experi¬ 
ment  examines  the  extent  to  which  they  do,  comparing 
them  to  what  should  be  a  good  practical  algorithm — 
branch-and-bound.  This  experiment  involves  two  cases, 
the  translation-only  case  in  3?^  and  the  translation-and- 
rotation  case  in  3?^.  So  as  to  eliminate  any  system  ef¬ 
fects  on  the  running  time  data,  we  keep  a  counter  of 
floating  point  operations  used  by  each  algorithm  as  op¬ 
posed  to  using  the  Unix  time  command.  The  numbers 
presented  in  the  graphs  are  numbers  obtained  from  this 
counter. 

In  the  translation-only  case  in  thirty  sets  of  back¬ 
ground  points  were  generated,  each  having  between 
50  and  500  points.  From  each  background,  a  pattern 
of  size  10  was  selected  and  perturbed.  The  pattern 
was  then  matched  to  its  associated  background  using 
both  the  approximate  match  algorithm  and  a  conven¬ 
tional  match  algorithm  both  with  and  without  heuris¬ 
tic  speedups.  The  results  of  this  experiment  are  shown 
in  Figure  3.  In  every  case  the  approximate  match  al¬ 
gorithm,  even  without  the  heuristic  speedup,  had  a 
smaller  running  time  than  the  conventional  algorithms. 

In  the  translation-and-rotation  case  in  3?^,  24  sets 
of  background  points  were  generated,  each  having  be¬ 
tween  5  and  25  points.  From  each  background,  a  pat¬ 
tern  of  size  5  was  selected  and  perturbed.  The  pat¬ 
tern  was  then  matched  to  its  associated  background 
using  the  approximate  match  algorithm  with  heuris¬ 
tic  speedups,  a  depth-first  branch-and-bound  algo¬ 
rithm  and  a  breadth-first  branch-and-bound  algorithm. 


Figure  3:  Running  times  of  the  four  methods. 


Figure  4:  Running  times  of  the  two  3-D  methods. 
Line  1  graphs  the  running  times  for  the  conven¬ 
tional  method  and  line  2  graphs  the  running  times 
for  our  method 
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Figure  5:  Running  times  of  the  four  methods. 
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The  running  times  of  the  depth-first  and  breadth-first 
branch-and-bound  algorithms  were,  in  all  instances 
within  a  factor  of  0.01  of  each  other  and  are  therefore 
plotted  as  a  single  line  in  Figure  4  which  depicts  the 
results  of  this  experiment. 

The  branch-and-bound  algorithms  search  a  six  di¬ 
mensional  space  comprised  of  three  degrees  of  freedom 
in  the  translation  of  the  pattern  and  three  degrees  of 
freedom  in  the  rotation  of  the  pattern.  This  produces 
a  rather  large  branching  factor  2®  =  64  in  the  recursive 
algorithms,  and  necessitated  the  depth  of  these  algo¬ 
rithms  to  be  limited  to  3.  With  this  (necessary)  depth 
limitation,  the  approximate-match  algorithm  actually 
found  better  matches  than  the  branch-and-bound  algo¬ 
rithm  did  in  all  of  the  24  cases,  in  spite  of  the  fact  that 
the  branch-and-bound  algorithms  required  on  average 
4479  times  as  many  floating-point  operations.  The  av¬ 
erage  directed  Hausdorff  distance  of  the  match  pro- 
ducted  by  the  approximate-match  algorithm  and  the 
average  directed  Hausdorff  distance  of  the  match  pro¬ 
duced  by  the  branch-and-bound  algorithms  are  given 
in  Figure  5  for  comparison. 

3.7  Experiment  3:  Running  Times  vs. 
Depth  of  Recursion 

The  depth  of  search  of  the  conventional  match  algo¬ 
rithms  that  we  have  implemented  must  be  limited.  This 
experiment  depicts  the  extent  to  which  the  running 
time  of  the  algorithm  increases  as  the  depth  of  recursion 
is  increased.  Further,  it  shows  the  substantial  speedup 
obtained  by  pruning  the  search.  The  results  of  this 
experiment  are  depicted  in  Figure  6. 

Four  backgrounds  were  generated,  each  having  50 
points.  From  each  background,  a  pattern  of  size  10 
was  selected  and  perturbed.  The  conventional  method 
without  pruning  was  run  ten  times  on  the  first  back¬ 
ground,  with  the  depth  of  recursion  being  varied  from  1 
to  10.  The  conventional  method  with  pruning  was  run 
ten  times  on  the  next  three  backgrounds,  and  again 
ten  times  on  each,  with  the  recursion  depth  being  var¬ 
ied  from  1  to  10.  An  average  of  the  running  times  for 
the  three  cases  with  pruning  were  taken  and  the  results 
were  plotted. 

4  Conclusion 

We  have  given  approximate  pattern  matching  algo¬ 
rithms  for  translation,  rotation,  and  Euclidean  trans¬ 
formations  for  two,  three,  and  sometimes  even  higher 
dimensions.  These  matching  algorithms  are  guaran¬ 
teed  to  give  a  match  with  a  directed  Hausdorff  dis¬ 
tance  that  is  no  greater  than  a  small  constant  times 
the  best  achievable  directed  Hausdorff  distance.  In  ad¬ 
dition,  they  have  a  time  complexity  that  is  substan¬ 
tially  smaller  that  those  of  existing  pattern  matching 


Figure  6:  Running  times  with  and  without  prun¬ 
ing. 


algorithms,  they  are  easy  to  implement,  and  they  run 
fast  in  practice,  as  predicted. 
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Abstract 

We  devise  techniques  to  manipulate  a  collection  of 
loosely  interpenetrating  spheres  in  three-dimensional 
space.  Our  study  is  motivated  by  the  representation 
and  manipulation  of  molecular  configurations,  mod¬ 
eled  by  a  collection  of  spheres.  We  analyze  the  sphere 
model  and  point  to  its  favorable  properties  that  make 
it  more  easy  to  manipulate  than  an  arbitrary  collec¬ 
tion  of  spheres.  For  this  special  sphere  model  we 
present  efficient  algorithms  for  computing  its  union 
boundary  and  for  hidden  surface  removal.  The  effi¬ 
ciency  and  practicality  of  our  approach  are  demon¬ 
strated  by  experiments  on  actual  protein  data. 


1  Introduction 

In  this  paper  we  devise  techniques  to  represent  and 
manipulate  a  collection  of  overlapping  spheres  in 
three-dimensional  space.  Often,  manipulating  three- 
dimensional  geometric  objects  that  intersect  is  time 
consuming.  However,  by  imposing  several  constraints 
on  the  spheres  and  their  interaction,  we  obtain  a  set¬ 
ting  where  efficient  and  practical  techniques  are  quite 
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easy  to  obtain.  Our  study  is  motivated  by  the  rep¬ 
resentation  and  manipulation  of  molecular  configura¬ 
tions,  modeled  by  a  collection  of  spheres. 

A  common  approach  to  representing  the  three- 
dimensional  geometric  structure  of  a  molecule,  is  to 
represent  each  of  its  atoms  by  a  “hard”  sphere.  It 
is  also  common  to  assume  that  the  nuclear  arrange¬ 
ment,  i.e.,  the  relative  displacement  of  the  spheres, 
is  fixed  (it  is  often  the  so-called  equilibrium  nuclear 
configuration).  There  are  recommended  values  for 
the  radius  of  each  atom  sphere  and  for  the  distance 
between  the  centers  of  every  pair  of  spheres.  In  this 
model,  the  spheres  are  allowed  to  interpenetrate  one 
another,  therefore  it  is  sometimes  referred  to  as  the 
“fused  spheres”  model  (see,  e.g.,  Figure  1).  The  enve¬ 
lope  surface  of  the  fused  spheres  may  be  regarded  as 
a  formal  molecular  surface.  It  is  evident,  that  various 
properties  of  molecules  are  disregarded  in  this  simple 
model.  However,  in  spite  of  its  approximate  nature,  it 
has  proven  useful  in  many  practical  applications.  For 
more  background  material  and  references,  see,  e.g., 
the  survey  paper  by  Mezey  [16]. 


Figure  1:  Molecule  1:  A  small  molecule  surrounded 
by  water  with  a  total  of  42  atoms. 
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We  study  the  hard  sphere  model  from  a  compu¬ 
tational  geometry  point  of  view,  that  is,  we  study 
the  combinatorial  and  algorithmic  behavior  of  a  col¬ 
lection  of  n  (possible  intersecting)  spheres  in  3-space, 
having  some  special  properties.  We  make  several  sim¬ 
ple  observation,  showing  that,  because  of  these  spe¬ 
cial  properties,  the  spheres  in  this  model  can  be  effi¬ 
ciently  manipulated.  For  example,  we  show  that  the 
maximum  combinatorial  complexity  of  the  boundary 
of  the  union  of  the  fused  spheres  is  0{n),  whereas  for 
an  arbitrary  collection  of  spheres  (even  unit  spheres) 
this  may  be  0(n^)  (see,  e.g.,  [13]).  These  results  are 
described  in  Section  2. 


Figure  2:  Molecule  2:  A  small  protein  with  100 
atoms. 

In  Section  3  we  take  advantage  of  the  favorable  be¬ 
havior  of  the  model,  to  devise  a  data  structure  for 
answering  intersection  queries  of  the  form:  Given  a 
hard  sphere  model  M  of  a  molecule  and  a  query  atom 
sphere  Q  (both  given  in  a  fixed  placement  in  3-space), 
report  which  spheres  of  M  are  intersected  by  Q.  In 
Section  4  we  use  this  data  structure  to  efficiently  com¬ 
pute  the  boundary  of  the  union  of  the  spheres.  We 
also  show  that  this  algorithm  can  be  used  to  compute 
a  related  type  of  molecular  surfaces,  so-called  solvent 
accessible  surfaces.  Finally,  in  Section  5,  we  present 
efficient  algorithms  for  hidden  surface  removal,  either 
by  computing  a  depth  order  or  by  computing  the  vis¬ 
ibility  map  of  a  molecule. 

Our  work  is  closely  related  to  the  study  of  fatness. 
It  has  been  shown  that  certain  problems  in  computa¬ 
tional  geometry  can  be  solved  much  more  efficiently 
when  the  objects  involved  have  no  long  and  thin  parts 
(see,  e.g.,  [15,  17,  20]).  Clearly,  spheres  are  fat.  In 
3-space  though  fatness  is  not  enough  to  guarantee  ef¬ 
ficient  algorithms  when  the  objects  are  allowed  to  in¬ 
tersect.  Fortunately,  the  extra  properties  in  the  hard 
sphere  model  provide  enough  additional  constraints 


Figure  3:  Molecule  3:  A  piece  of  DNA  with  757 
atoms. 

on  the  objects  to  allow  for  efficient  algorithms.  It 
exemplifies,  yet  another  time,  that  in  practical  situa¬ 
tions  a  collection  of  geometric  objects  having  certain 
additional  properties  (obtained  from  observing  real- 
life  situations),  behaves  favorably. 


Figure  4:  Molecule  4:  A  larger  protein  with  971 
atoms. 

We  support  our  claim  for  efficiency  and  practical¬ 
ity  of  the  data  structure  and  the  algorithms  that  we 
present  by  reporting  results  of  experiments  that  were 
carried  out  on  actual  protein  data  retrieved  from  the 
“Protein  Data  Bank”  at  Brookhaven  National  Lab¬ 
oratory  [1,  2].  In  Figures  1  till  4  you  find  pictures 
of  some  of  the  molecules  that  we  used.  All  of  these 
pictures  were  created  using  the  algorithms  described 
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in  Section  5. 


2  The  Hard  Sphere  Model 

Overlapping  spheres  in  three-dimensional  space  may 
in  general  be  rather  unwieldy  objects.  The  arrange¬ 
ment  (i.e.,  the  subdivision  of  space)  defined  by  n 
spheres  in  3-space  may  have  complexity  0(n®)  in  the 
worst  case  and  their  union  boundary  may  have  com¬ 
plexity  0(n^).  Moreover,  efficient  algorithms  for  ma¬ 
nipulating  a  collection  of  overlapping  spheres  are  of¬ 
ten  complex  and  rather  time  consuming.  However, 
the  atom  spheres  in  the  model  of  a  molecule  have 
properties  that  we  can  exploit  to  obtain  efficient  and 
simple  algorithms  for  manipulating  them.  One  such 
property  is  that,  although  two  spheres  may  interpen¬ 
etrate,  their  centers  cannot  get  too  close  to  one  an¬ 
other.  The  other  useful  property  is  that  their  radii 
range  in  a  fairly  restricted  range;  see,  e.g.,  [11,  p.  J-3] 
for  a  list  of  van  der  Waals  radii,  which  are  one  type 
of  acceptable  radii.  Actually,  each  molecular  mod¬ 
elling  package  seems  to  use  its  own  slightly  different 
set  of  radii.  The  following  table  shows  the  list  the 
radii  (in  Angstrpm)  of  the  spheres  we  use  to  repre¬ 
sent  the  main  types  of  atoms.  (Taken  from  an  actual 
molecular  modelling  package.) 


c 

Cal 

H 

N 

0 

P 

S 

1.70 

2.07 

0.90 

1.55 

1.52 

1.80 

1.80 

In  Theorem  2.1  below,  we  state  the  conditions 
that  make  the  sphere  model  of  a  molecule  favor¬ 
able.  (Similar  observations  were  recently,  indepen¬ 
dently  made  by  Varshney  and  Brooks  [21].)  From 
this  point  on,  we  will  distinguish  between  a  ball 
and  a  sphere.  Each  atom  with  radius  r,  and  cen¬ 
ter  at  Cj  induces  a  ball  Bi  =  {p\d{p,Ci)  <  and  a 
sphere  Si  =  {p|d(p, q)  =  n},  where  d{pi,p2)  is  the 
Euclidean  distance  between  the  points  pi  and  p2  in 
3-space. 

Theorem  2.1  Let  M  =  {Hi, . . . ,  jB„}  be  a  collec¬ 
tion  of  n  balls  in  3-space  with  radii  ri,...,r„  and 
centers  at  ci,...,c„.  Let  Tmin  =  mini ri  o,nd  let 
r^nax  =  maxi  ri.  Also  let  S  =  {Fi, . . . ,  5„}  be  the  col¬ 
lection  of  spheres  such  that  Si  is  the  boundary  surface 
of  Bi-  If  there  are  positive  constants  k,p  such  that 
^niax  <  and  for  each  Bi  the  ball  with  radius  p  ■  ri 
anT concentric  with  Bi  does  not  contain  the  center  of 
any  other  ball  in  M  (besides  Ci),  then: 

(i)  For  each  Bi  €  M,  the  maximum  number  of  balls 
in  M  that  intersect  it  is  bounded  by  a  constant. 

(ii)  The  maximum  combinatorial  complexity  of  the 
boundary  of  the  union  of  the  balls  in  M  is  0{n). 


Proof.  Part  (i);  Consider  a  ball  Bi.  Let  B  be  the 
ball  of  radius  n  +  2rmax,  centered  at  Ci.  Clearly,  any 
ball  in  M  that  intersects  Bi  must  lie  completely  in¬ 
side  B.  For  each  Bj  that  lies  completely  inside  B,  let 
Pj  denote  the  ball  of  radius  |  •  rmin  centered  at  Cj . 
We  claim  that  the  Pj's  are  pairwise  interior-disjoint. 
Suppose  the  opposite,  namely,  that  there  are  two  such 
balls  /3jj  and  whose  interiors  intersect.  This  im¬ 
plies  that  Cjj  and  cj^  are  less  than  p  ■  rmin  apart. 
This,  in  turn,  means  that  the  ball  with  radius  p  ■ 
and  center  at  contains  the  center  of  the  ball  B^^  i 
contradicting  the  assumptions  of  the  theorem. 

Hence,  by  volume  consideration,  the  total  number 
of  balls  that  are  completely  contained  in  B  cannot 
exceed 


(^i  T  2fmax)^  ^  (drmax) 
(e.  .r  .  13  -  (e.  .  -iS 

1, 2  '  niin  J  ^2  niiny 


<  216- (-)A 

p 


Part  (ii)  follows  from  Part  (i)  because  Part  (i)  im¬ 
plies  that  the  number  of  features  involving  B^  on  the 
union  boundary  is  bounded  by  a  constant.  Indeed, 
if  the  balls  are  in  general  position  then  a  feature  in¬ 
volving  a  ball  Bi  is  either  a  face  on  the  boundary  of 
Bi,  or  an  intersection  arc  of  Bi  with  another  ball,  or 
an  intersection  point  of  Bi  with  two  other  balls.  All 
these  features  belong  to  the  two-dimensional  subdivi¬ 
sion  (or  arrangement)  formed  on  the  boundary  of  Bi 
by  a  constant  number  of  circles,  each  being  the  inter¬ 
section  of  Bi  with  another  ball.  The  complexity  of 
this  arrangement  is  evidently  bounded  by  a  constant. 
Thus,  the  overall  complexity  of  the  union  boundary 
is  0(n).  n 


The  following  table  gives  the  values  oik,  p  and  the 
maximal  and  average  number  of  balls  intersecting  a 
single  ball  for  our  four  example  molecules  (Figures  1 
to  4). 


mol. 

k 

P 

max 

aver. 

1 

2.30 

0.64 

14 

5.0 

2 

1.18 

0.71 

12 

6.4 

3 

2.00 

0.63 

15 

7.4 

4 

1.18 

0.71 

12 

6.3 

As  can  be  seen  k  is  small  and  p  is  large  as  required, 
resulting  in  a  small  number  of  intersections  per  ball 
(being  much  smaller  than  the  worst-case  bound  of 
216{k/p)^). 

We  now  turn  to  discuss  the  subdivision  of  three- 
dimensional  space  induced  by  a  collection  of  spheres; 
the  arrangement  of  the  spheres.  The  study  of  ar¬ 
rangements  plays  a  fundamental  role  in  geomet¬ 
ric  computing.  We  denote  the  arrangement  of  the 
spheres  in  M  by  A{M),  that  is,  A{M)  is  the  subdivi¬ 
sion  of  3-space  into  cells  of  dimensions  0, 1,2  and  3, 
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induced  by  the  spheres  in  M  (see,  e.g.,  [4,  pp.  144- 
150]).  The  combinatorial  complexity  of  the  arrange¬ 
ment  A{M)  is  defined  to  be  the  number  of  cells  of 
various  dimensions  in  this  subdivision.  For  arbitrary 
sets  of  spheres  the  complexity  can  be  0(n^).  The  fol¬ 
lowing  is  an  immediate  corollary  of  Theorem  2.1  (i): 

Corollary  2.2  The  complexity  of  the  arrangement 
A{M)  of  the  spheres  in  M  as  defined  in  Theorem  2.1 
is  0(n). 

For  most  algorithmic  uses,  however,  a  raw  arrange¬ 
ment  is  an  unwieldy  structure.  The  difficulty  is  that 
many  cells  in  an  arrangement  can  have  very  complex 
topologies,  and  thus  navigating  around  them  is  diffi¬ 
cult.  What  we  often  want  is  a  further  refinement  of 
the  cells  of  an  arrangement  into  pieces  that  are  each 
homeomorphic  to  a  ball  and  have  constant  descrip¬ 
tion  complexity. 

A  prevailing  and  general  technique  for  decompos¬ 
ing  arrangements  is  the  vertical  decomposition  (see, 
e.g.,  [3]),  defined  as  follows.  For  every  sphere  Si, 
we  call  the  curve  of  intersection  of  Si  and  the  hor¬ 
izontal  plane  through  the  center  of  Si  the  equator 
of  the  sphere.  Let  E  be  the  collection  of  curves  on 
the  spheres  in  M  consisting  of  intersection  curves  be¬ 
tween  any  pair  of  spheres  and  the  equators  of  the 
spheres.  Let  7  be  a  curve  in  E.  We  extend  a  vertical 
segment  upward  and  downwards  from  every  point  of 
7  until  it  hits  a  sphere  in  M  or  extends  to  infinity.  We 
repeat  this  process  for  every  curve  in  E.  As  a  result 
we  get  a  collection  of  vertical  walls  that  together  with 
the  spheres  in  M  subdivide  3-space  into  a;2/-monotone 
3D  cells.  We  then  project  each  cell  onto  the  a;r/-plane, 
and  extend  vertical  segments  (with  respect  to  the  y- 
direction)  from  every  vertex  of  the  projection  and 
from  every  a;-extreme  point  in  the  projection  of  a  cell, 
such  that  the  segments  are  maximal  and  contained  in¬ 
side  the  projection  of  the  cell.  We  then  extend  each 
of  these  segments  into  a  vertical  wall  contained  inside 
the  original  3D  cell.  For  details  and  illustrations  of 
vertical  decompositions  for  arrangements  of  spheres 
in  3-space,  see  [4]. 

Clarkson  et  al.  [4]  show  that  the  complexity  of  the 
vertical  decomposition  of  an  arrangement  of  spheres 
is  dominated  by  the  complexity  of  the  vertical  walls 
erected  from  curves  in  E.  They  show  a  slightly  super- 
cubic  upper  bound  on  the  complexity  of  the  vertical 
decomposition  of  an  arbitrary  collection  of  n  spheres 
in  3-space.  In  the  next  theorem  we  show  another 
favorable  property  of  the  collection  of  spheres  that 
we  study: 

Theorem  2.3  The  complexity  of  the  vertical  decom¬ 
position  of  the  arrangement  A{M)  for  a  collection  of 
spheres  M  as  defined  in  Theorem  2.1  is  0{n'^). 


Proof.  We  distinguish  between  two  portions  of  3- 
space:  inside  the  union  of  the  balls  (i.e.,  the  balls 
corresponding  to  the  spheres  in  M),  and  outside  the 
union  of  the  balls.  The  complexity  of  the  vertical 
decomposition  of  A{M)  inside  the  union  of  the  balls 
is  evidently  0(n),  by  Theorem  2.1  (i).  Hence,  from 
this  point  on  we  bound  the  complexity  of  the  verti¬ 
cal  walls  that  lie  completely  outside  the  union  of  the 
balls.  These  walls  are  erected  from  curves  that  lie  on 
the  union  boundary.  Recall  that  the  equators  of  the 
spheres  are  also  considered  curves  in  the  arrangement 
A{M). 

Fix  an  edge  7  of  A{M)  that  lies  on  the  union 
boundary,  and  let  be  the  vertical  surface  which 
is  the  union  of  vertical  lines  through  points  on  7.  Let 
F  be  the  collection  of  faces  of  A{M)  that  lie  on  the 
union  boundary,  where  a  face  is  a  maximal  portion  of 
a  sphere  in  M  that  does  not  meet  any  other  sphere 
or  the  equator  of  that  sphere,  and  is  not  contained  in 
any  other  sphere.  By  Theorem  2.1  (parts  (i)  and  (ii)) 
F  consists  of  0{n)  faces  (or  surface  patches),  each 
bounded  by  a  small  number  (bounded  by  a  constant) 
of  low-degree  algebraic  curves. 

For  every  face  /  in  F,  let  /'  be  f  D  f  is  a 
collection  of  a  small  number  of  low  degree  algebraic 
curves  on  By  standard  arguments,  the  complex¬ 
ity  of  the  vertical  wall  extended  from  7  and  that  lies 
outside  the  union,  is  determined  by  the  complexity 
of  the  lower  envelope  (or  upper  envelope,  or  both) 
defined  by  the  collection  F'  =  {/']/  G  F}  with  re¬ 
gard  to  the  curve  7.  Since  the  curves  F'  are  either 
pairwise  disjoint,  or  a  pair  of  curves  in  F'  meet  at  an 
endpoint  of  at  least  one  of  the  curves,  and  they  do  not 
intersect  otherwise,  the  complexity  of  the  envelope  is 
0{n).  We  repeat  this  argument  for  every  edge  7  on 
the  union  boundary.  There  are  0{n)  such  edges,  and 
the  upper  bound  follows.  □ 

The  bound  is  sharp.  One  can  easily  construct  a  set 
of  (even  non-intersecting)  unit  spheres  that  achieve 
0(n^)  complexity.  It  is  though  possible  to  construct 
an  0{n)  decomposition  of  the  arrangement.  Consider 
a  three-dimensional  grid  whose  unit  size  (along  each 
of  the  coordinate  axes)  is  2  x  Tmax-  The  grid  induces 
a  partitioning  of  3-space  into  axis  parallel  unit  cubes 
(those  cubes  whose  vertices  are  grid  points  and  that 
do  not  contain  any  grid  point  in  their  interior).  By 
applying  the  same  type  of  arguments  as  in  the  proof 
of  Theorem  2.1,  one  can  show  that  every  cube  in  this 
partitioning  intersects  at  most  some  fixed  number  of 
spheres.  Moreover,  each  sphere  intersects  at  most  8 
cubes.  Now  take  the  collection  of  0(n)  non-empty 
cubes.  It  is  easy  to  see  that  the  vertical  decomposi¬ 
tion  of  this  set  has  linear  complexity.  For  each  of  the 
cubes  consider  the  arrangement  of  spheres  that  inter- 
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sect  it,  restricted  to  the  cube  and  construct  its  verti¬ 
cal  decomposition  (again  restricted  to  the  cube).  This 
will  give  a  decomposition  of  the  cube  in  a  bounded 
number  of  simple  pieces.  As  there  are  0{n)  cubes  to 
decompose  the  total  complexity  will  be  linear.  It  is 
easy  to  see  that  the  number  of  adjacencies  between 
pieces  is  linear  as  well,  which  is  an  important  prop¬ 
erty  in  certain  applications  (like  motion  planning). 

Theorem  2.4  For  a  collection  of  spheres  M  as  de¬ 
fined  in  Theorem  2.1  there  exists  a  decomposition  of 
the  arrangement  A{M)  into  simple  pieces  of  total 
complexity  0{n). 

3  Intersection  Queries 

Given  a  molecule  it  is  required  that  we  are  able  to 
answer  certain  queries  efficiently.  In  particular  inter¬ 
section  queries  play  an  important  role.  For  example, 
in  computer  aided  drug  design,  one  wishes  to  manip¬ 
ulate  a  molecule  in  the  presence  of  another  molecule 
to  see  whether  they  fit  together.  During  such  ma¬ 
nipulation  the  molecules  should  not  intersect.  Hence, 
with  every  change  in  the  manipulated  molecule  an 
intersection  query  with  the  other  molecule  must  be 
performed.  Also,  it  is  often  required  to  know  whether 
particular  positions  in  3-space  lie  inside  or  outside  the 
molecule,  i.e.,  whether  a  point  intersects  the  molecule 
or  not. 

In  this  section  we  will  devise  a  data  structure  that 
can  be  used  to  answer  intersection  queries  with  ei¬ 
ther  a  point  or  with  a  ball  whose  radius  is  bounded 
by  T-max-  The  query  should  report  all  atom  balls  that 
contain  the  point  or  intersect  the  query  ball.  We  will 
use  this  structure  in  the  next  section  to  compute  the 
molecule  boundary.  Clearly  a  point  is  a  degenerated 
ball.  Hence,  a  data  structure  that  can  answer  queries 
for  balls  can  also  answer  point  queries.  Below  we  only 
describe  a  structure  for  query  balls.  When  imple¬ 
menting  the  structure  though  a  distinction  is  made 
because  point  queries  can  be  answered  faster  (i.e., 
with  smaller  constants  in  the  time  bounds). 

Our  first  approach  to  solving  this  problem  trans¬ 
forms  it  into  an  orthogonal  range  query  problem.  Let 
M  be  the  set  of  n  balls  (as  defined  in  Theorem  2.1)  to 
store  and  let  C  be  the  set  of  centers  of  the  balls.  We 
store  the  set  C  in  a  three-dimensional  range  tree  that 
can  answer  the  following  type  of  query:  Given  a  query 
axis-parallel  box,  report  the  points  of  C  contained  in 
the  box.  To  report  all  the  balls  of  M  intersecting  a 
query  ball  Q  with  radius  rq  we  construct  a  ball  Q' 
with  radius  rq  -t-rmax  and  concentric  with  Q.  Clearly, 
the  center  of  a  ball  of  M  intersecting  Q  must  lie  inside 
Q' .  Next,  let  Q'  denote  the  axis-parallel  bounding 
box  of  Q' .  We  query  the  range  search  structure  on 


C  with  the  box  Q' .  Using  the  same  technique  as  in 
the  proof  of  Theorem  2.1(i),  it  can  be  shown  that  the 
number  of  answers  is  bounded  by  a  constant.  For  each 
of  these  answers  we  check  whether  the  corresponding 
ball  actually  intersects  Q.  This  takes  0(1)  time.  Us¬ 
ing  a  standard  range  tree  (see,  e.g.,  [18,  Section  2.3.4]) 
we  obtain  a  solution  that  requires  0{n  log^  n)  prepro¬ 
cessing  time  and  space  and  answers  queries  in  time 
O(log^n). 

However,  due  to  the  special  properties  of  the  col¬ 
lection  of  spheres  that  we  study,  they  admit  a  simpler 
and  more  efficient  data  structure.  Like  in  the  previous 
section,  we  subdivide  space  into  cubes  of  size  2  x  fmax- 
For  each  ball  in  M  we  compute  the  grid  cubes  that  it 
intersects.  Let  C  be  the  set  of  non-empty  grid  cubes. 
The  size  of  C  is  bounded  by  0{n).  We  arrange  the 
cubes  of  C  in  a  balanced  binary  search  tree,  ordered 
by  the  lexicographic  order  of  the  {x,y,z)  coordinates 
of  the  say,  bottom-left-front  vertex  of  the  cube.  With 
each  non-empty  cube  we  store  the  list  of  (at  most  a 
constant  number  of)  balls  of  M  that  intersect  it. 

Given  a  query  ball  Q,  we  compute  all  the  (at  most 
8)  grid  cubes  it  intersects,  and  search  for  each  of  these 
cubes  in  the  binary  tree,  to  see  whether  it  is  non¬ 
empty.  If  it  exists  we  check  the  balls  stored  for  inter¬ 
section  with  Q.  We  might  find  some  balls  more  than 
once  but  duplicates  can  easily  be  removed.  The  total 
number  of  balls  tested  will  be  0(1).  This  leads  to  the 
following  result: 

Theorem  3.1  Given  a  collection  of  n  balls  M  as  in 
Theorem  2.1,  one  can  construct  a  data  structure  us¬ 
ing  0{n)  space  and  O(nlogn)  preprocessing  time,  to 
answer  intersection  queries  for  balls  whose  radii  are 
not  greater  than  fmax;  ^  O(logn). 

The  binary  search  tree  in  the  above  approach  is 
only  used  to  locate  the  different  cubes.  Hence,  we 
can  easily  replace  it  by  a  hashing  structure.  Using 
the  perfect  hashing  results  in  [7]  this  leads  to: 

Theorem  3.2  Given  a  collection  M  of  n  balls  as  de¬ 
fined  in  Theorem  2.1,  one  can  construct  a  data  struc¬ 
ture  using  0{n)  space,  to  answer  intersection  queries 
for  balls  whose  radii  are  not  greater  than  rmax.  in 
0(1)  time.  The  randomized  preprocessing  time  of  the 
structure  is  0{n). 

Recently,  Fjallstrom  and  Petersson  [8]  have  car¬ 
ried  out  a  comparative  study  of  the  behavior  in  prac¬ 
tice  of  various  three-dimensional  range  search  struc¬ 
tures,  motivated  by  simulation  of  deformation  pro¬ 
cesses.  An  interesting  result  of  their  study  is  that 
range  trees  do  not  perform  well  in  practice  (for  the 
special  test  data  that  they  experiment  with),  as  com¬ 
pared  to  other  methods  that  they  examined.  In  par- 
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ticular  the  best  performing  method  in  their  study  is  a 
grid-like  method,  similar  to  the  one  mentioned  above. 

We  implemented  this  data  structure  although  we 
used  double  hashing  rather  than  perfect  hashing. 
The  following  table  shows,  for  each  of  the  four  test 
molecules:  their  size,  the  building  time,  the  time  to 
answer  1000  point  intersection  tests  and  the  time  to 
answer  1000  intersection  tests  for  a  ball  with  radius 
^ max  •  All  running  times  are  in  seconds,  on  an  Indigo 
R3000  workstation  (25.4  specmarks). 


mol. 

n 

build 

point  query 

ball  query 

1 

42 

0.01 

0.14 

0.20 

2 

100 

0.01 

0.13 

0.22 

3 

757 

0.08 

0.13 

0.19 

4 

971 

0.12 

0.14 

0.22 

As  can  be  seen  the  query  time  seems  independent 
of  the  size  of  the  molecule.  Also,  the  query  time 
bounds  are  very  low  allowing,  e.g.,  for  testing  whether 
a  molecule  with  100  atoms  intersects  a  molecule  with 
1000  atoms  in  0.02  seconds,  i.e.,  fast  enough  for  in¬ 
teractive  use  while  manipulating  the  molecule. 

4  Computing  the  Boundary 

We  now  use  the  data  structure  of  the  previous  section 
to  efficiently  compute  the  outer  cell  of  the  boundary 
of  the  union  of  the  spheres.  We  assume  that  the  union 
consists  of  one  connected  component,  which  is  often 
the  case  in  our  application  setting.  If  the  union  is 
composed  of  several  pairwise-disjoint  connected  com¬ 
ponents,  then  additional  measures  must  be  taken;  see 
below  for  more  details. 

The  boundary  of  the  outer  cell  can  be  viewed  as 
a  formal  molecular  surface,  which  is  of  interest  in 
various  applications  in  molecular  biology.  Moreover, 
we  will  show  that  the  same  algorithm,  operating  on 
a  closely  related  set  of  balls,  solves  another  prob¬ 
lem  in  computational  biology,  sometimes  referred  to 
as  computing  the  approximate  solvent  accessible  sur¬ 
face  (see,  e.g.,  [14,  5,  16]).  This  involves  the  inter¬ 
action  of  solute  and  solvent  molecules  in  a  solution. 
In  a  simplified  model,  the  question  may  be  formu¬ 
lated  as  follows:  Which  parts  of  a  molecular  sur¬ 
face  of  a  solute  molecule  are  accessible  to  the  solvent 
molecules,  where  the  latter  are  modeled  by  spheres  (a 
single  sphere  each).  We  will  assume  that  the  solute 
molecules  are  modeled  by  the  hard  sphere  model. 

An  approach  to  solving  this  problem  is  proposed  in 
[14]:  Roll  a  sphere  representing  the  solvent  molecule 
on  a  reference  surface,  to  obtain  a  new  surface  de¬ 
scribed  by  the  center  of  the  rolling  sphere.  The  refer¬ 
ence  surface,  in  our  case,  is  the  boundary  of  the  union 
of  the  balls  in  the  hard  sphere  model.  In  most  cases, 


the  solvent  “sphere”  is  assumed  to  be  fairly  small. 
We  denote  the  solvent  molecule  sphere  by  R. 

We  can  rephrase  the  above  problem  in  terms  of 
motion  planning  in  robotics:  Let  i?  be  a  spherical 
robot  moving  in  3-space  among  spherical  obstacles 
(the  balls  in  M).  Describe  the  obstacles  in  a  config¬ 
uration  space  where  every  point  represents  a  possi¬ 
ble  placement  of  R  by  the  position  of  the  center  of 
R.  In  this  formulation,  the  solvent  accessible  surface 
becomes  the  boundary  of  the  outer  cell  of  the  free 
portion  of  the  configuration  space.  Unlike  prevailing 
formulations  of  motion-planning  problems,  the  “ob¬ 
stacles”  in  our  case  may  intersect.  However,  as  ex¬ 
pressed  in  Theorem  2.1,  these  obstacles  have  other, 
favorable,  properties. 

To  compute  the  boundary  of  the  outer  cell  of  the 
free  configuration  space,  we  follow  a  common  practice 
in  motion  planning,  and  compute  the  Minkowski  (vec¬ 
tor)  difference^  of  each  obstacle  and  the  robot,  to  ob¬ 
tain  the  configuration  obstacles.  Now,  the  Minkowski 
difference  of  two  spheres  is  again  a  (larger)  sphere. 
Hence,  our  goal  is  actually  to  compute  the  outer  com¬ 
ponent  of  the  union  boundary  for  a  collection  of  balls 
in  3-space. 

The  solute  molecule  is  modeled  by  balls  as  in  The¬ 
orem  2.1.  The  radius  of  each  ball  in  M  is  increased 
by  r' — the  radius  of  the  solvent  sphere.  We  assume 
r'  to  be  of  the  same  order  of  magnitude  as  Cmax- 
Note  that,  the  expanded  balls  obey  the  conditions 
of  Theorem  2.1  for  different  constants  r' r'^^,  k’ 
and  p' .  Therefore,  an  algorithm  for  computing  the 
union  boundary  for  a  collection  M  of  balls  as  in  The¬ 
orem  2.1,  is  also  applicable  to  computing  solvent  ac¬ 
cessible  surfaces.  Thus,  we  present  an  algorithm  for 
computing  the  union  boundary  of  a  collection  of  balls 
M  as  defined  in  Theorem  2.1. 

The  scheme  of  the  algorithm  is  to  compute,  for  each 
Bi  e  M,  the  contribution  of  its  boundary  to  the  union 
boundary  and  then  to  combine  all  this  information  to 
give  the  final  output  of  the  algorithm.  The  algorithm 
consists  of  four  steps: 

1.  For  each  ball  identify  the  other  balls  intersecting 
it. 

2.  For  each  ball  compute  its  (potentially  null)  con¬ 
tribution  to  the  union  boundary. 

3.  Transform  the  local  information  into  global 
structures  describing  the  connected  components 
of  the  union  boundary. 

4.  Filter  out  portions  of  the  union  boundary  that  do 
not  belong  to  the  desired  connected  component 
of  the  “outer  cell” . 

'■The  Minko-wski  difference  of  two  spatial  sets  A  and  B, 

A  —  B,  is  the  set  {p  —  q\p  d  A,g  e  B). 
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If  all  we  need  is  a  list  of  the  faces  of  the  union 
boundary  (as  is  the  case  in  the  preparatory  stage  of 
the  hidden  surface  removal  algorithm  of  Section  5) 
then  we  only  need  Steps  1  and  2. 

For  Step  1,  we  use  the  data  structure  described  in 
Theorem  3.2.  The  cost  of  the  query  for  each  ball  is 
0(1),  hence  the  total  cost  of  this  step  is  0(n)  ran¬ 
domized  time  (or  O(nlogn)  deterministic). 

As  for  Step  2,  consider  a  ball  Bi  and  the  family 
of  (a  constant  number  of)  balls  intersecting  it.  Let 
Bj  be  a  ball  intersecting  Bi.  If  Bj  fully  contains  Bi, 
then  we  stop  the  process  for  Bi,  as  it  cannot  con¬ 
tribute  in  any  way  to  the  union  boundary.  If  Bj  is 
fully  contained  in  Bi,  then  we  ignore  Bj,  for  obvi¬ 
ous  reasons.  Otherwise,  we  compute  the  intersection 
between  the  spheres  Si  and  Sj,  which  is  a  circle  Cij 
on  Si.  The  circle  partitions  Si  into  two  parts: 
One  part  may  appear  on  the  union  boundary,  and  we 
will  refer  to  it  as  the  free  part  of  S^  with  respect  to 
Sj,  and  the  other  is  completely  contained  in  Sj  and 
therefore  cannot  appear  on  the  union  boundary.  We 
repeat  the  process  for  each  ball  intersecting  Bi,  to  get 
a  collection  of  circles  on  Si.  These  circles  form  a  2D 
arrangement  Ai  on  Si.  A  face  of  Ai  belongs  to  the 
union  boundary  if  and  only  if  it  belongs  to  the  free 
portion  defined  by  each  circle  Cij.  Since  the  number 
of  circles  on  Si  defined  in  that  way  is  constant,  the  ar¬ 
rangement  can  be  computed  by  a  brute  force  method 
in  constant  time,  together  with  the  attribute  whether 
a  face  is  free  or  not  (a  free  face  is  guaranteed  to  ap¬ 
pear  on  the  union  boundary).  The  complexity  of  the 
arrangement  Ai  is  evidently  constant.  For  each  ball 
in  M,  the  above  procedure  will  take  constant  time. 

In  Step  3  we  will  represent  each  connected  compo¬ 
nent  of  the  union  boundary  by  a  quad-edge  structure 
[10].  To  this  end  we  have  to  augment  the  arrange¬ 
ments  Ai  slightly.  If  Ai  is  the  whole  sphere  Si  we 
split  it  into  two  parts  with  some  circle.  Next,  if  a 
boundary  component  of  Ai  is  a  simple  circle  C  we 
split  C  into  two  arcs  adding  two  vertices.  (If  C  lies 
in  both  Ai  and  Aj  the  same  vertices  should  be  added 
in  both  arrangements.)  Finally,  if  a  free  face  of  Ai 
contains  holes  we  split  it  by  adding  extra  arcs.  (To 
make  these  additions  canonical,  we  can  fix  a  direction 
d,  and  add  all  the  extra  arcs  along  great  circles  that 
are  intersection  of  the  sphere  with  planes  parallel  to 
the  direction  d.)  After  this  step,  which  can  easily 
be  performed  in  time  0{n),  the  union  boundary  will 
consist  of  simple  faces  where  each  face  is  bounded  by 
at  least  two  edges  and  each  edge  bounds  exactly  two 
faces  (assuming  general  position). 

Now  we  proceed  as  follows:  We  take  some  free  face 
/  of  some  Ai,  that  was  not  treated  before,  and  de¬ 
termine  its  boundary.  In  this  way  we  obtain  for  each 
edge  e  bounding  /  pointers  to  the  previous  and  next 


edge  along  /,  as  required  for  the  quad-edge  structure. 
Also  for  each  edge  e  bounding  /  we  determine  the  ar¬ 
rangement  Aj  containing  the  face  /'  on  the  opposite 
side  of  /  {j  can  be  i  because  of  the  extra  arcs  we 
added).  /'  can  be  found  in  time  0(1)  because  only 
a  constant  number  of  spheres  intersect  Si.  We  locate 
e  in  Aj ,  add  pointers  between  the  copies  of  e  and  re¬ 
cursively  treat  /'  in  Aj  if  it  was  not  visited  before.  In 
this  way  we  continue  until  we  have  located  the  whole 
connected  component  of  the  boundary  containing  /. 
If  not  all  free  faces  have  been  visited  we  take  one  of 
the  non-visited  faces  and  continue  from  there.  It  is 
easy  to  see  that  this  requires  time  0(n)  in  total. 

As  a  result  of  Steps  2  and  3  we  get  a  quad-edge 
representation  of  each  connected  component  of  the 
union  boundary  of  the  balls  in  M.  Recall  that  we 
are  only  interested  in  the  boundary  of  the  outer  cell, 
which  we  assumed  to  be  connected.  At  this  stage  we 
need  to  identify  the  outer  cell  (Step  4).  We  search 
for  the  ball  B;  G  M  having  the  point  with  largest 
2:-coordinate  on  its  boundary.  We  then  determine  an 
edge  on  the  boundary  of  the  face  fi  of  Bi  contain¬ 
ing  the  highest  point.  This  face  clearly  belongs  to 
the  outer  cell.  From  this  face  we  scan  the  entire  con¬ 
nected  component  containing  fi,  using  the  quad-edge 
structure.  We  mark  every  edge  found  throughout  the 
scan,  as  belonging  to  the  outer  cell.  Finally  we  delete 
any  edge  that  has  not  been  marked  in  the  previous 
step.  The  remaining  structure  describes  the  outer 
cell.  This  takes  time  proportional  to  the  size  of  the 
union  boundary  which  is  0{n). 

As  for  space  requirements,  the  data  structure  of 
Theorem  3.2  requires  0{n)  space.  The  additional 
structures  described  above  are  easily  verified  to  re¬ 
quire  linear  space. 


Theorem  4.1  The  outer  portion  of  the  boundary  of 
the  union  of  a  connected  collection  of  balls  as  defined 
in  Theorem  2.1  can  be  computed  in  0{n)  randomized 
time  (or  0{n  log  n)  deterministic),  using  0{n)  space. 

If  the  union  of  the  balls  is  not  connected,  additional 
machinery  is  required  to  identify  and  discard  compo¬ 
nents  lying  in  internal  cells.  This  can  be  done  in  time 
O(nlogn)  by  computing  the  decomposition  of  A{M) 
as  described  in  Theorem  2.4  and  traversing  the  outer 
cell  of  this  arrangement.  Details  will  be  given  in  the 
full  paper. 

We  only  implemented  Steps  1  and  2  of  the  above 
algorithm  because  that  is  all  that  is  required  for  hid¬ 
den  surface  removal.  The  following  table  shows  the 
running  time  for  our  four  molecules. 
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mol. 

Step  1 

Step  2 

1 

0.02 

0.2 

2 

0.05 

0.7 

3 

0.42 

6.5 

4 

0.54 

7.5 

As  can  be  seen  Step  2  dominates  the  amount  of 
time  required,  while  theoretically  both  steps  take 
time  0{n).  It  is  though  easy  to  verify  that  the  oper¬ 
ations  involved  in  Step  2  are  much  more  complicated 
and,  hence,  the  constants  in  the  bound  are  much 
higher. 


5  Hidden  Surface  Removal 

One  of  the  tasks  of  molecular  modelling  packages  is 
to  display  molecules.  This  should  preferably  be  done 
so  fast  that  the  user  can  interact  with  the  model, 
e.g.,  by  turning  it  around  to  look  at  it  from  different 
directions  or  by  moving  different  molecules  with  re¬ 
spect  to  each  other,  for  example  to  see  whether  they 
fit  together  in  some  nice  way.  Hence,  one  needs  fast 
algorithms  for  hidden  surface  removal  among  sets  of 
intersecting  spheres. 

In  practice  one  normally  uses  the  Z-buffer  algo¬ 
rithm  for  this.  Almost  all  3-D  graphics  workstation 
available  nowadays  have  such  an  algorithm  imple¬ 
mented  in  their  graphics  hardware.  Unfortunately, 
such  hardware  can  only  handle  polyhedral  objects. 
Hence,  as  a  first  step,  one  has  to  approximate  the 
spheres  by  triangular  meshes.  Different  methods  ex¬ 
ist  but  all  lead  to  a  large  number  of  faces.  To  get 
a  reasonable  sphere  one  needs  at  least  100  triangles. 
So  a  molecule  of  1000  atoms  requires  the  drawing  of 
100,000  triangles  in  3-space.  Even  special  graphics 
hardware  (except  maybe  the  most  expensive  types) 
cannot  display  such  molecules  fast.  It  would  typically 
take  between  one  and  a  few  seconds. 

A  second  approach  used  in  computer  graphics  is 
the  painter  algorithm.  Here  one  tries  to  define  a 
depth  order  on  the  objects,  sorting  them  from  back 
to  front.  Next  one  draws  the  objects  in  this  order  on 
top  of  each  other  (like  a  painter)  where  each  new  ob¬ 
ject  hides  the  parts  of  other  objects  that  lie  below  it. 
Such  an  approach  does  not  require  special  graphics 
hardware  and,  hence,  can  also  be  used  on,  e.g.,  laser 
printers.  The  problem  though  is  that  one  needs  a 
valid  depth  order  on  the  objects.  Such  an  order  does 
not  always  exist  (there  can  be  cyclic  overlap  among 
objects).  Also,  for  intersecting  object  such  an  order 
obviously  does  not  exist. 

For  sets  of  non-intersecting  spheres  an  easy  depth 
order  exists:  simply  sort  the  spheres  by  z-coordinate 
of  their  center.  But  for  intersecting  spheres  this  does 
not  apply.  There  is  though  no  need  to  draw  the  entire 


spheres.  It  is  good  enough  to  draw  the  pieces  of  the 
spheres  that  constitute  the  boundary  of  the  union, 
as  computed  in  the  previous  section.  Clearly,  pieces 
of  the  spheres  that  do  not  contribute  to  the  bound¬ 
ary  cannot  be  visible.  Also  the  parts  of  the  sphere 
that  point  away  from  the  viewing  direction  cannot  be 
seen.  We  will  show  that  for  the  remaining  part  of  the 
boundary  a  depth  order  does  exist. 


Theorem  5.1  Let  S  =  Si,...,Sn  be  a  collection  of 
spheres  sorted  by  decreasing  z-coordinate  of  their  cen¬ 
ter  (i.e.,  from  back  to  front).  For  each  Si  let 
be  the  hemisphere  facing  the  viewing  direction.  Let 
be  the  collection  of  maximal  pieces  of  ILi 
that  are  part  of  the  boundary  of  the  union  of  the 
spheres.  Then 


is  a  valid  depth  order  for  the  pieces  on  the  boundary. 


Proof.  We  will  prove  this  by  contradiction.  So  as¬ 
sume  there  are  two  pieces  and  Hj  that  are  in  the 
wrong  order.  Clearly  i  j  so  let  us  assume  i  <  j. 
Hi  and  ifj  are  in  the  wrong  order  so  there  must  be 
a  ray  in  the  viewing  direction  that  first  hits  and 
than  Hi  (so  H^  partially  hides  Hj ) .  Because  H^  and 
Hj  belong  to  the  union  boundary,  the  ray  must  first 
hit  Hi,  next  the  back  of  Ht,  next  Hi,  and  finally  the 
back  of  Hj.  But  this  implies  that  the  back  of  Hj  lies 
behind  the  back  of  H^  and,  hence,  j  <  i,  which  con¬ 
tradicts  our  assumption.  □ 


This  leads  to  the  following  algorithm.  We  first  com¬ 
pute  the  boundary  of  the  union.  In  this  way,  for  each 
sphere  Si  we  collect  a  (constant)  number  of  pieces 
that  it  contributes  to  the  boundary.  For  each  of  these 
pieces  we  cut  off  the  part  that  does  not  lie  in  Hi.  Next 
we  sort  the  spheres  by  depth  and  draw  the  pieces  in 
this  order.  As  shown  in  the  previous  section  comput¬ 
ing  the  boundary  takes  deterministic  time  0(n  log  n) 
and  results  in  0{n)  pieces.  The  sorting  of  the  spheres 
by  depth  takes  time  O(nlogn).  Hence,  we  obtain  the 
following  result: 

Theorem  5.2  Given  a  set  of  n  spheres  as  defined  in 
Theorem  2.1,  a  valid  depth  order  consisting  of  0{n) 
pieces  can  be  computed  in  time  0(n  log  n). 

We  implemented  this  algorithm.  Actually,  we  split 
it  into  two  phases:  In  the  preprocessing  phase  w^e 
compute  the  boundary  of  the  union  and  for  each 
sphere  we  collect  the  pieces  that  are  part  of  the 
boundary.  (See  the  previous  section  for  time  bounds.) 
In  the  query  phase  we  are  given  a  particular  viewing 
direction.  Now  we  compute  the  hemi-spheres  (which 
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are  dependent  on  the  viewing  direction)  and  cut  off 
the  parts  that  lie  outside  it.  Next  we  compute  the 
depth  order  for  the  spheres  and  draw  the  pieces.  All 
pictures  in  this  paper  were  generated  using  this  ap¬ 
proach.  The  implementation  (even  though  not  yet 
fully  optimized)  is  rather  fast.  We  did  run  it  on  two 
different  machines:  An  Indigo  R3000  machine  with 
entry  level  graphics  and  an  Indigo  R3000  XS24  with 
hardware  Z-buffer.  The  first  machine  performs  the 
Z-buffer  algorithm  in  software  while  the  second  one 
has  special  purpose  hardware  for  this.  The  following 
table  shows  the  amount  of  time  (in  seconds)  required 
per  frame  for  both  the  Z-buffer  algorithm  and  our 
method. 


entry  graphics 

X 

CO 

1 

24 

mol. 

Z-buf 

new 

z-buf 

new 

1 

0.5 

0.2 

0.1 

0.1 

2 

1.1 

0.5 

0.25 

0.3 

3 

5.5 

3.6 

1.85 

2.0 

4 

7.4 

4.6 

2.35 

2.9 

From  these  figures  one  might  conclude  that  the 
new  method  runs  about  twice  as  fast  on  the  entry 
level  system  while  it  is  slightly  slower  on  the  XS-24. 
This  should  though  be  treated  with  care.  First  of  all, 
the  pictures  produced  by  the  two  methods  are  quite 
different.  The  way  we  implemented  it,  the  Z-buffer 
algorithm  produces  a  Gouraud  shaded  image  while 
our  new  method  produces  a  flat  shaded  image  (be¬ 
cause  the  method  draws  in  2-d  one  cannot  produce 
anything  else).  Our  method  though  draws  boundary 
lines  around  the  spheres  and  along  the  intersections 
(like  in  Figures  1  till  4)  while  the  Z-buffer  method 
does  not  (it  cannot  because  it  does  not  know  where 
the  intersections  are).  Also  the  picture  quality  differs 
largely.  On  8-bit  systems  the  Gouraud  shaded  image 
looks  rather  displeasing  while  the  images  produced  by 
the  new  method  look  much  nicer.  Also  there  is  the 
issue  of  precision.  For  the  Z-buffer  we  used  an  ap¬ 
proximation  of  the  spheres  with  about  100  triangles. 
In  the  new  method  we  used  an  approximation  of  the 
circles  and  ellipses  (projected  sphere  boundaries  and 
intersections)  with  24  edges,  which  is  better.  Finally, 
the  difference  in  time  for  our  new  method  on  the  two 
different  platforms  is  caused  because,  on  the  entry 
level  systems,  more  than  half  of  the  time  is  spent  in 
the  actual  drawing,  not  in  the  computation  of  the 
depth  order.  On  the  XS-24  this  is  much  less.  For 
more  details  on  the  implementation  see  [9]. 

So  we  can  conclude  that  our  new  approach  pro¬ 
duces  nice  images  and  can  be  used  on  simple  graphics 
workstations  without  3-D  capabilities.  Even  on  ma¬ 
chines  with  3-D  graphics  hardware  the  method  be¬ 
haves  comparably.  Finally,  and  probably  most  im¬ 
portant,  the  new  method  can  be  used  to  produce  high 


quality  Postscript  pictures  which  is  impossible  with 
the  Z-buffer  algorithm. 

In  some  applications  a  depth  order  is  not  enough 
but  one  wants  a  combinatorial  representation  of  the 
visible  pieces,  the  visibility  map.  A  lot  of  work  has 
been  done  in  computational  geometry  in  producing 
visibility  maps.  See,  e.g.,  the  book  of  de  Berg  [6]. 
For  collections  of  arbitrary  intersecting  spheres  the 
best  known  hidden  surface  removal  algorithm  runs 
in  randomized  expected  time  0(n^+')  for  any  e  >  0 
(using  the  general  algorithm  for  computing  the  lower 
envelope  of  low-degree  algebraic  surfaces  in  3-space 
by  Sharir[19]).  No  output-sensitive  method,  where 
the  time  bound  depends  on  the  complexity  of  the  re¬ 
sulting  visibility  map,  is  known.  In  the  case  of  a  set 
of  non-intersecting  spheres  the  best  known  method 
is  presented  in  [12].  It  runs  in  time  0((n  -f  k)  log^  n) 
where  k  is  the  complexity  of  the  visibility  map  (which 
can  be  0(n^),  even  for  non-intersecting  unit  spheres). 
We  will  now  briefly  indicate  how  the  method  of  [12] 
can  be  adapted  to  work  for  intersecting  spheres  as 
well.  For  more  details  see  the  full  paper.  We  assume 
some  familiarity  with  the  method  of  [12]. 

For  the  method  of  [12]  to  work  one  needs  to  sub¬ 
divide  the  objects  in  a  (small)  collection  of  pieces 
that  first  of  all  has  a  depth  order  and,  secondly,  has 
the  property  that  the  union  of  the  projection  on  the 
viewing  plane  of  the  pieces  lying  in  any  depth  range 
has  small  complexity.  One  could  use  the  collection  of 
pieces  as  defined  in  Theorem  5.1.  It  is  though  unclear 
whether  this  collection  satisfies  the  second  condition. 
Hence,  we  define  a  different  set  of  pieces. 

For  each  sphere  Si  we  again  consider  the  hemi¬ 
sphere  Hi.  We  take  the  pieces  of  Hi  that  are  con¬ 
tained  in  the  boundary  of  the  union  of  {Hi,  ...,Hi}, 
i.e.,  we  ignore  the  spheres  that  lie  nearer.  In  a  way 
similar  to  the  proof  of  Theorem  5.1  one  can  prove 
that  these  pieces  constitute  a  valid  depth  order.  In 
the  full  paper  we  will  also  prove  that  for  any  2:-range 
the  union  of  the  projection  of  the  pieces  whose  sphere 
center  lies  in  the  range,  is  linear.  Applying  the  results 
in  [12]  we  obtain: 

Theorem  5.3  Given  a  set  of  n  spheres  as  defined  in 
Theorem  2.1,  the  visibility  map  can  be  computed  in 
time  0{{n  +  A:)log^n),  where  k  is  the  complexity  of 
the  resulting  map. 

6  Conclusion 

We  have  considered  the  hard  sphere  model  of  a 
molecule  in  a  fixed  nuclear  configuration  from  a  com¬ 
putational  geometry  point  of  view.  We  have  shown 
that  the  collection  of  spheres  representing  the  atoms 
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of  a  molecule  in  that  model  behaves  favorably  in  com¬ 
parison  with  an  arbitrary  collection  of  spheres  in  3- 
space.  Using  this  observation  we  were  able  to  de¬ 
vise  a  data  structure  that  efficiently  detects  which 
atom  spheres  of  a  molecule  are  intersected  by  a  query 
sphere  (of  bounded  radius).  Then,  we  presented  an 
efficient  algorithm  for  computing  the  boundary  (or 
envelope)  of  the  union  of  atom  spheres.  Furthermore, 
we  presented  efficient  algorithms  for  hidden  surface 
removal  of  a  molecule.  Our  results  have  been  sup¬ 
ported  by  experiments  carried  out  on  actual  protein 
data,  displaying  the  efficiency  and  practicality  of  our 
approach. 

We  believe  that  other  (algorithmic)  problems  in 
molecular  modelling  can  be  solved  efficiently  using 
the  data  structures  and  properties  presented  in  this 
paper.  The  area  also  provides  many  other  chal¬ 
lenging  problems,  like:  Is  it  possible  to  preprocess 
two  molecules  such  that  one  can  efficiently  determine 
whether,  in  a  particular  pose,  the  molecules  intersect 
or  not?  Also  other  models  of  molecules  give  rise  to 
interesting  question.  For  example,  there  is  the  prob¬ 
lem  of  hidden  surface  removal  for  the  ball-and-stick 
model  where  atoms  are  represented  by  small  balls  and 
the  links  by  thin  sticks.  We  plan  to  work  further  on 
these  problems. 
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Abstract 

A  polyhedron  P  is  castable  if  its  boundary  can  be  par¬ 
titioned  by  a  plane  into  two  polyhedral  terrains.  Such 
polyhedra  can  be  manufactured  easily  using  two  cast 
parts.  Assuming  that  the  cast  parts  are  removed  by 
a  single  translation  each,  it  is  shown  that  for  a  simple 
polyhedron  with  n  vertices,  castability  can  be  decided 
in  O(n^logn)  time  and  linear  space  using  a  simple  al¬ 
gorithm.  Furthermore,  a  more  complicated  algorithm 
solves  the  problem  in  time  and  space,  for  any 

fixed  e  >  0.  In  the  case  where  the  cast  parts  are  to 
be  removed  in  opposite  directions,  a  simple  0{v?)  time 
algorithm  is  presented.  Finally,  if  the  object  is  a  con¬ 
vex  polyhedron  and  the  cast  parts  are  to  be  removed 
in  opposite  directions,  a  simple  0(nlog^  n)  algorithm  is 
presented. 


1  Introduction 

A  new  application  area  of  computational  geometry  is  in 
the  area  of  automated  manufacturing,  where  an  engi¬ 
neer  can  design  an  object  with  the  aid  of  a  computer, 
and  determine  by  which  manufacturing  process  the  ob¬ 
ject  can  be  constructed.  There  are  several  types  of  man¬ 
ufacturing  processes  studied  in  computational  geometry, 
such  as  injection  molding  [5,  6,  13],  NC  machining  [14], 
automated  welding  [17]  and  layer  deposition  methods 
(such  as  stereolithography)  [3]. 

In  this  paper,  we  study  the  geometric  and  computa¬ 
tional  aspects  of  casting.  Casting  consists  of  filling  the 
open  region  bounded  by  two  or  more  cast  parts  with 
a  material  such  as  a  liquid  metal,  after  which  the  cast 
parts  are  removed.  The  removal  of  the  cast  parts  with¬ 
out  breaking  them  imposes  certain  restrictions  on  the 
shape  of  the  object  to  be  constructed.  For  sand  casting 
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(see  e.g.  [12,  23]),  only  two  cast  parts  are  used.  To  con¬ 
struct  the  cast  parts,  a  prototype  of  the  object  is  first 
obtained  (see  Figure  1).  The  prototype  is  then  divided 
into  two  parts  along  a  plane.  The  face  along  which  the 
part  is  cut  is  referred  to  as  the  base.  The  first  cast 
part  is  made  by  placing  the  base  of  the  part  on  a  flat 
surface,  and  then  adding  sand  around  it.  The  part  is 
then  rotated  such  that  the  base  is  facing  up.  The  other 
part  is  placed  such  that  the  bases  coincide  and  the  sec¬ 
ond  cast  part  is  built  by  adding  sand  around  this  part. 
An  opening  into  the  cavity  is  maintained  during  the 
construction  of  the  second  cast  part.  This  completes 
the  construction  of  the  cast  of  the  prototype  object.  To 
built  a  metal  rendition  of  the  prototype  object  with  this 
cast,  liquid  metal  is  poured  into  the  opening  until  it  fills 
the  cavity.  After  the  metal  solidifies,  the  cast  parts  are 
removed  from  the  object.  To  be  able  to  construct  the 
cast  with  this  process,  it  is  necessary  that  the  proto¬ 
type  can  be  removed  without  breaking  the  cast.  Also, 
for  manufacturing  methods  related  to  sand  casting  such 
as  other  metal  casting  methods  [12,  23]  and  injection 
molding  and  blow  molding  methods  for  plastics  [20,  24], 
it  must  be  possible  to  remove  the  constructed  object 
from  the  cast  without  breaking  it  in  order  to  re-use  the 
cast.  Thus  for  several  different  manufacturing  methods 
involving  casting,  the  geometry  of  the  object  determines 
its  feasibility  of  construction. 

We  note  that  more  complicated  objects  can  be  made 
by  using  cores  and  inserts  [12,  20,  23,  24].  However,  their 
use  slows  down  the  manufacturing  process  and  makes  it 
more  costly.  Thus  to  be  cost  efficient,  cores  and  inserts 
should  be  avoided.  We  do  not  study  the  extra  possibil¬ 
ities  of  cores  and  inserts  in  this  paper. 

An  object  is  castable  if  it  can  be  manufactured  by 
casting.  Geometric  and  algorithmic  issues  of  casting 
of  planar  objects  has  been  studied  by  Rosenbloom  and 
Rappaport  [21].  This  paper  addresses  casting  of  poly¬ 
hedral  objects.  In  geometric  terms,  castability  can  be 
defined  as  follows  (for  a  polyhedron  P,  dP  denotes  the 
boundary  of  P,  and  for  a  plane  h,  /i+  and  h~  refer  to 
the  open  half-spaces  above  and  below  h): 

Definition  1  A  simple  polyhedron  P  is  castable  if  there 
exists  a  plane  h  such  that  h'^  n  dP  is  a  weak  terrain  in 
some  orientation,  and  h^DdP  is  a  weak  terrain  in  some 
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sand  cast  of  lower  half 


remove  prototype  halves 
from  the  cast  parts 


forming  the 
metal  object 


Figure  1:  Construction  of  an  object  by  sand  casting,  using  two  halves  of  the  object  as  prototypes. 


orientation.  The  plane  h  is  called  the  casting  plane.  (A 
weak  terrain  may  contain  edges  and  facets  parallel  to  the 
orientation  in  which  it  is  a  terrain.) 

To  manufacture  a  polyhedron  P  that  is  castable,  first 
determine  a  casting  plane  h  for  P.  Then  the  cast  parts 
Cl  and  C2  are  made  from  the  prototype  halves  n  dP 
and  h~  n  dP.  Since  P  is  castable,  the  prototype  halves 
can  be  removed  from  the  cast  parts,  and  later  the  man¬ 
ufactured  object  can  be  removed  from  the  cast  parts. 
Although  our  algorithms  deal  with  the  sand  casting  pro¬ 
cess,  they  can  be  applied  in  related  processes  as  well. 
We  consider  three  versions  of  the  castability  problem. 
They  differ  in  the  way  the  cast  may  be  removed  from 
the  polyhedron  P.  Figure  2  shows  the  three  versions  for 
planar  polygons. 


f _ Z  _ f 


Figure  2:  Three  versions  of  the  castability  problem. 

1.  The  two  cast  parts  must  be  removed  from  P  by 
one  translation  each,  in  opposite  directions,  and 
normal  to  the  casting  plane  (orthogonal  cast  re¬ 
moval). 

2.  The  two  cast  parts  must  be  removed  from  P  by 


one  translation  each,  and  in  opposite  directions 
(opposite  cast  removal). 

3.  The  two  cast  parts  must  be  removed  from  P  by 
one  translation  each,  in  arbitrary  directions  (arbi¬ 
trary  cast  removal). 

In  manufacturing,  developing  machines  that  perform 
orthogonal  and  opposite  cast  removal  is  much  simpler 
than  machines  that  perform  arbitrary  cast  removal.  In 
fact,  opposite  cast  removal  seems  to  be  the  most  popu¬ 
lar  technique  used  [8,  20].  Furthermore,  if  orthogonal  or 
opposite  cast  removal  is  possible,  it  can  be  determined 
more  efficiently.  We  summarize  the  complexity  of  the 
different  algorithms  we  developed  for  the  casting  prob¬ 
lem.  In  the  top  half  of  the  table,  the  time  bounds  of 
simple,  linear  space  algorithms  are  shown.  The  bottom 
half  of  the  table  shows  improvements  made  (in  theory) 
by  using  0(n^/^+')  storage  (for  any  positive  constant 

e). 


lin.  space 

orthogonal 

opposite 

arbitrary 

convex 

0(n  log^  n) 

0(nlog^  n) 

0{n^  logn) 

simple 

0{n^) 

0{n^) 

0{n^  logn) 

best  res. 

orthogonal 

opposite 

arbitrary 

convex 

Q(^3/2  +  e^ 

simple 

0(^n^/2+e) 

0[jfil2+e) 

Q(^3/2+e) 

Our  algorithms  depend  on  geometric  properties  of 
casting,  discussed  in  Section  2.  The  maximum  number 
of  distinct  cast  planes  is  analysed  in  Section  3,  and  al¬ 
gorithms  are  given  in  Sections  4  and  5.  For  detailed 
proofs  and  descriptions  of  the  algorithms,  we  refer  the 
reader  to  the  technical  report  [4]. 
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2  Preliminaries 

For  a  polyhedron  P,  denote  by  V,  E  and  F  the  set 
of  vertices,  edges  and  facets.  Edges  that  bound  two 
parallel  facets  are  not  allowed;  they  can  be  removed 
without  changing  the  shape  of  the  polyhedron.  Edges 
and  facets  are  open,  the  closure  of  an  edge  e  or  facet  / 
is  denoted  by  d{e)  or  c/(/).  Although  technically  the 
object  to  be  constructed  is  the  interior  of  P,  and  the 
boundary  of  P  is  part  of  the  cast,  with  a  slight  abuse 
of  notation,  we  nevertheless  state  that  P  is  castable  or 
not  castable. 

For  a  non-vertical  plane  h,  we  denote  by  /i+  and 
h~  the  open  half-spaces  above  and  below  h.  If  h  is 
vertical  but  does  not  contain  a  line  parallel  to  the  y-axis, 
then  and  h~  denote  the  open  half-spaces  bounded 
by  h  that  contain  the  points  (0,oo,0)  and  (0, -oo,0), 
respectively.  If  h  is  vertical  and  contains  a  line  parallel 
to  the  2/-axis  then  and  h~  denote  the  open  half¬ 
spaces  bounded  by  h  that  contain  the  points  (oo,0,0) 
and  (—00, 0,0),  respectively. 

Let  P  be  a  polyhedron  and  let  h  be  a  plane  that 
intersects  P.  If  /i  is  a  casting  plane  and  the  upper  cast 
part  can  be  removed  in  direction  u,  then  the  outward 
normal  of  any  facet  f  of  P  that  lies  in  /i+  makes  an  angle 
at  most  ■k/2  with  u.  This  also  holds  when  /  lies  partially 
in  /i+.  Also,  if  all  facets  of  P  that  lie  completely  or 
partially  in  /i+  make  an  angle  at  most  7r/2  with  u,  then 
/i+  n  dP  is  a  weak  terrain.  Therefore,  castability  with 
respect  to  a  plane  h  is  only  determined  by  the  facets  of 
P  that  intersect  and  the  ones  that  intersect  h~ .  Ifh 
is  a  casting  plane  for  P,  then  h  can  be  perturbed  if  this 
does  not  involve  new  facets  intersecting  h.  In  case  of 
orthogonal  cast  removal,  the  only  perturbation  allowed 
is  translation. 

Observation  1  For  castability  with  orthogonal  cast  re¬ 
moval,  we  may  assume  that  the  casting  plane  contains 
at  least  one  vertex  of  P.  For  opposite  and  arbitrary  cast 
removal,  we  may  assume  that  the  casting  plane  contains 
at  least  three  vertices  of  P. 

2.1  Relation  to  linear  programming 

Let  P  be  a  polyhedron  and  let  /i  be  a  plane.  The  plane 
h  partitions  the  set  V  of  vertices  of  P  into  three  subsets 
Vh,  and  Vff  of  vertices  in,  above  and  below  h,  re¬ 
spectively.  Similarly,  h  partitions  the  set  E  of  edges  of  P 
in  four  subsets  E^,  E^ ,  and  Ejf  of  edges  contained 
in  h,  intersecting  h,  above  h  and  below  h,  respectively. 
The  set  P  of  facets  is  partitioned  in  the  same  way.  For 
any  facet  f  G  F,  denote  by  T(/)  the  closed  half-space 
supporting  /  but  not  containing  the  interior  of  P.  De¬ 
note  by  ^o(/)  the  same  half-space,  but  translated  such 


that  the  bounding  plane  contains  the  origin.  Let 

t{h)=  fl  ^o(/)  and  r{h)=  f|  To(/). 
f€F+uFf  /ep:uF,f 

The  intersection  of  a  set  of  half-spaces  is  non-trivial  if  it 
contains  more  than  a  single  point.  Denote  by  refl{b)  the 
reflection  of  an  object  b  through  the  origin.  We  make 
the  following  observations. 

Lemma  1  h  is  a  casting  plane  for  polyhedron  P  for 
arbitrary  cast  removal  if  and  only  if  and  f~{h) 

are  both  non-trivial. 

Lemma  2  h  is  a  casting  plane  for  polyhedron  P  for 
opposite  cast  removal  if  and  only  if  n  refi{f^{h)) 
is  non-trivial. 

Lemma  3  Let  h  be  a  plane  and  let  i  be  a  line  perpen¬ 
dicular  to  h  and  through  the  origin,  h  is  a  casting  plane 
for  polyhedron  P  for  orthogonal  cast  removal  if  and  only 
ifen^+{h)  n  refl{£,-{h))  is  non-trivial. 

With  the  above  lemmas,  we  can  test  for  a  given  plane 
h  efficiently  whether  it  is  a  casting  plane  for  P.  Since 
the  casting  problem  for  a  plane  h  and  a  polyhedron  P 
can  be  transformed  in  linear  time  to  a  linear  program¬ 
ming  problem  in  3  dimensions,  the  test  requires  only 
linear  time  [18].  In  fact,  since  all  planes  bounding  the 
half-spaces  contain  the  origin,  the  linear  programming 
problem  is  basically  2-dimensional. 

Lemma  4  Given  a  polyhedron  P  and  a  plane  h,  one 
can  test  in  linear  time  whether  h  is  a  casting  plane  for 
P  in  any  of  the  three  versions  for  removing  the  cast. 

2.2  Geometric  properties  of  casting 

In  this  section,  we  uncover  some  important  geometric 
properties  of  casting  planes  that  will  allow  us  to  bound 
the  number  of  distinct  casting  planes  and  hence  develop 
efficient  algorithms. 

Lemma  5  If  a  simple  polyhedron  P  is  castable  in  non¬ 
opposite  directions  with  casting  plane  h,  then  h  contains 
an  edge  of  P. 

Lemma  6  If  a  simple  polyhedron  P  is  castable  with 
casting  plane  h  and  in  non- opposite  directions,  then  h 
contains  an  edge  of  the  convex  hull  of  P. 

Notice  that  the  above  two  lemmas  imply  that  if  a 
polyhedron  is  castable,  but  not  with  opposite  cast  re¬ 
moval,  then  the  casting  plane  contains  both  an  edge  of 
P  and  an  edge  of  the  convex  hull  of  P  (this  might  be 
the  same  edge).  This  will  aid  considerably  to  determine 
castability  with  arbitrary  cast  removal. 
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The  following  lemma  forms  the  crucial  link  between 
simple  polyhedra  and  convex  polyhedra  in  terms  of 
castability  with  removal  directions  restricted  to  orthog¬ 
onal  and  opposite. 

Lemma  7  If  a  simple  polyhedron  P  is  castable,  then  the 
convex  hull  of  P  is  also  castable  using  the  same  casting 
plane  and  cast  removal  directions. 

Lemma  8  If  the  casting  plane  h  properly  intersects  a 
facet  f  of  convex  polyheron  P,  and  also  two  vertices 
u.v  in  the  closure  of  f ,  then  for  opposite  cast  removal, 
vertices  u,v  must  be  antipodal  in  cl{f). 

Corollary  1  Let  h  be  a  casting  plane  for  a  convex  poly¬ 
hedron  P  which  intersects  a  facet  f  properly,  and  as¬ 
sume  opposite  cast  removal.  If  h  intersects  a  vertex  v 
and  properly  intersects  an  edge  e  in  the  closure  of  f, 
then  V  is  antipodal  to  both  endpoints  of  e.  If  h  properly 
intersects  two  edges  in  the  closure  of  f,  then  they  are 
parallel. 

Lemma  8  and  Corollary  1  are  the  two  geometric  facts 
that  allow  us  to  solve  opposite  cast  removal  more  effi¬ 
ciently  than  arbitrary  cast  removal. 

In  [4],  we  also  show  that  for  a  convex  polyhedron  P, 
the  linear  programming  problems  defined  by  P  and  a 
candidate  casting  plane  h  need  not  consider  all  facets  of 
F,  but  only  those  intersecting  h  and  those  adjacent  to 
h,  leading  to  the  algorithm  described  in  Section  4. 

3  The  number  of  distinct  casting 
planes 

Given  a  polyhedron  P,  two  planes  hi  and  /12  are  (combi- 
natorially)  distinct  if  the  partitioning  of  the  facets  into 
F+,  F~,  F'-  and  F^  they  define  is  different.  By  Ob¬ 
servation  1,  a  trivial  upper  bound  on  the  number  of 
distinct  casting  planes  for  a  polyhedron  with  n  vertices 
is  0{n^). 

This  section  gives  a  linear  upper  bound  on  the  maxi¬ 
mum  number  of  distinct  casting  planes  for  convex  poly¬ 
hedra  in  case  of  orthogonal  and  opposite  cast  removal 
as  well  as  a  quadratic  upper  bound  for  arbitrary  cast 
removal. 

3.1  Orthogonal  and  opposite  cast  re¬ 
moval 

We  restrict  our  attention  to  opposite  cast  removal,  since 
orthogonal  cast  removal  is  (close  to)  a  special  case  of  op¬ 
posite  cast  removal,  and  there  are  only  minor  differences 
between  the  two  cases.  For  an  explicit  rendition  of  the 
differences,  we  refer  the  reader  to  [4].  For  opposite  cast 


removal,  we  have  the  following  properties  of  intersec¬ 
tions  of  a  casting  plane  h  and  a  polyhedron  P:  (i)  The 
facets  of  that  intersect  h  properly  have  their  out¬ 
ward  normals  such  that  when  translated  to  the  origin, 
they  span  a  plane  or  part  of  it  (since  f|{l'o(/)  |  /  e  F^} 
contains  a  line  through  o).  (ii)  All  edges  that  intersect  h 
properly  are  parallel  (otherwise  the  incident  facets  span 
more  than  a  plane),  (iii)  Any  two  vertices  in  the  closure 
of  a  facet  /  and  in  h  are  antipodal  in  cl{f).  Any  ver¬ 
tex  and  edge  in  the  closure  of  /  and  intersecting  h  are 
antipodal  in  c/(/)  (see  Lemma  8). 

Lemma  9  Given  a  convex  polyhedron  P,  the  number 
of  distinct  casting  planes  that  intersect  some  edge  of  P 
properly  is  at  most  linear  in  the  number  of  vertices  of 
P,  assuming  opposite  cast  removal. 

Let  be  a  candidate  casting  plane  of  F,  and  let 
Q  =  h  Cl  P.  If  Q  contains  three  consecutive  vertices 
u,v,w  that  are  also  vertices  of  F,  then  each  of  u  and 
w  is  either  an  endpoint  of  an  edge  incident  to  v,  or  a 
vertex  antipodal  to  v  on  the  closure  of  a  facet  /  incident 
to  V.  We  say  that  the  plane  through  u,  v,  w  is  generated 
by  V.  It  follows  that  the  set  of  candidate  casting  planes 
generated  by  v  has  size  >  where  d  is  the  degree  of  v 
and  a  is  the  number  of  vertices  antipodal  to  v  in  the  clo¬ 
sures  of  the  facets  incident  to  v.  Every  casting  plane  h 
that  does  not  intersect  any  edge  properly  contains  three 
vertices  that  are  consecutive  in  fin  P,  and  therefore,  ev¬ 
ery  such  casting  plane  is  generated  by  some  vertex  of  P. 
It  is  straightforward  to  prove  that  every  convex  polyhe¬ 
dron  has  a  vertex  of  constant  degree  which  participates 
in  a  constant  number  of  antipodal  pairs  (on  the  incident 
facets).  Using  this  fact  inductively,  we  obtain: 

Theorem  1  Given  a  convex  polyhedron  P  with  n  ver¬ 
tices,  the  maximum  number  of  distinct  casting  planes 
for  P  is  0{n),  assuming  opposite  removal  of  the  cast 
parts. 

There  is  another  interesting  combinatorial  bound  on 
the  complexity  of  the  intersection  of  all  distinct  casting 
planes  with  a  convex  polyhedron.  Two  distinct  casting 
planes  hi  and  h2  that  properly  intersect  the  same  edge 
are  similar,  because  they  define  the  same  cast  removal 
directions,  and  they  intersect  the  same  closure  of  edges 
and  facets.  In  other  words,  if  hi  and  ^2  each  properly 
intersect  edges  that  are  parallel,  there  cannot  be  two 
vertices  u,v  strictly  to  the  one  side  of  hi  and  strictly 
to  different  sides  of  h2.  We  use  the  term  weakly  equiv¬ 
alent  for  two  such  planes.  Two  planes  are  weakly  dis¬ 
tinct  if  they  are  not  weakly  equivalent.  There  are  0{n) 
weakly  distinct  casting  planes  for  any  convex  polyhe¬ 
dron  P  with  n  vertices.  We  analyse  the  combinatorial 
complexity  of  h  f)  P,  summed  over  all  weakly  distinct 


126 


casting  planes  h.  This  quantity  is  well-defined  for  op¬ 
posite  cast  removal,  since  two  weakly  equivalent  casting 
planes  have  an  equal-size  intersection  with  P  (although 
they  may  intersect  different  facets,  edges  and  vertices). 
We  prove  a  bound  of  0(n  log  n)  on  the  summed  com¬ 
plexity.  Note  that  when  the  sum  is  over  all  distinct  cast¬ 
ing  planes  (not  weakly  distinct),  the  summed  complex¬ 
ity  can  be  0(n^)  if  P  has  a  set  of  Q{n)  parallel  edges. 
The  bound  makes  use  of  a  hierarchical  decomposition 
of  P  that  closely  resembles  the  hierarchy  of  Dobkin  and 
Kirkpatrick  [10].  It  is  the  basis  of  the  O(nlog^n)  time 
algorithm  for  casting  of  convex  polyhedra  with  opposite 
cast  removal. 

Lemma  10  Given  a  convex  polyhedron  P  with  n  ver¬ 
tices,  there  exists  a  subset  V  of  the  vertices  V  of  size 
fi(n),  such  that  each  v  e  V  has  degree  <  8  and  is  an¬ 
tipodal  to  <  12  vertices  in  facets  incident  to  v. 

The  following  hierarchical  decomposition  of  P  gen¬ 
erates  a  set  of  planes  that  contains  all  the  candidate 
casting  planes  that  do  not  intersect  an  edge  properly; 

1.  Mark  all  edges  of  P  as  “original”.  Compute  all 
antipodal  pairs  of  all  facets  of  P  and  mark  them 
as  “original”.  Set  i  =  1. 

2.  Select  a  subset  Vi  of  V  as  in  Lemma  10.  For  every 

vertex  v  e  Vi,  generate  all  planes  through  u,v,w 
for  which  uv  and  vw  are  original  edges,  or  {u,v) 
and  {v,w)  are  original  antipodal  pairs,  or  one  is  an 
original  edge  and  the  other  an  original  antipodal 
pair.  For  every  vertex  v  G  Vi,  the  number  of  gen¬ 
erated  planes  is  at  most  =  190,  thus  0{n) 

for  the  whole  subset. 

3.  Recompute  the  convex  hull  of  the  vertices  of  P 
minus  the  vertices  of  Vi. 

4.  Recompute  the  antipodal  pairs  of  the  facets. 
(Note:  Edges  or  antipodal  pairs  of  CH{P  \  Vi) 
are  not  original  unless  they  were  deemed  original 
in  step  1.) 

5.  Repeat  at  step  2  with  i  =  i -\- 1  unless  P  has  no 
vertices  left. 

As  in  [10],  the  number  of  subsets  Vi  is  O(logn).  It 
follows — as  we  show  in  [4] — that  every  n(log  n)  consecu¬ 
tive  vertices  of  P  contained  in  a  casting  plane  h  contains 
a  vertex  that  generates  h.  This  leads  to: 

Theorem  2  Given  a  convex  polyhedron  P  with  n  ver¬ 
tices,  the  maximum  total  complexity  of  hn  P,  summed 
over  all  weakly  distinct  casting  planes  h  for  P,  is 
O(nlogn),  assuming  opposite  cast  removal. 


Corollary  2  Given  a  convex  polyhedron  P  with  n  ver¬ 
tices,  the  maximum  number  of  planes  that  intersect  the 
interior  of  P  but  do  not  intersect  any  facets  of  P  is 
0{n),  and  the  maximum  number  of  edges  of  P  contained 
in  these  planes,  summed  over  all  planes,  is  0(n  log n). 

3.2  Arbitrary  cast  removal 

We  have  shown  that  the  number  of  casting  planes  that 
also  allow  opposite  cast  removal  is  linear.  For  the  other 
casting  planes,  we  know  from  Lemma  5  that  they  con¬ 
tain  an  edge  of  P.  Since  we  may  also  assume  that  they 
contain  a  third  vertex,  we  conclude: 

Theorem  3  Given  a  convex  polyhedron  P  with  n  ver¬ 
tices,  the  maximum  number  of  distinct  casting  planes 
for  P  is  O(n^),  assuming  arbitrary  removal  of  the  cast 
parts. 

4  Algorithms  for  orthogonal  and 
opposite  cast  removal 

In  this  section  and  the  next,  algorithms  are  presented  for 
the  computation  of  casting  planes,  and  hence,  determine 
whether  a  given  polyhedron  is  castable.  This  section 
focuses  on  orthogonal  and  opposite  cast  removal. 

4.1  A  simple  algorithm  for  simple  poly¬ 
hedra 

We  compute  0(n)  candidate  casting  planes  as  follows. 
By  Lemma  7,  we  need  only  consider  the  casting  planes 
of  the  convex  hull  of  P.  We  first  compute  the  candidate 
casting  planes  that  intersect  some  edge  properly,  and 
then  the  ones  that  are  generated.  We  only  consider  op¬ 
posite  cast  removal;  the  case  of  orthogonal  cast  removal 
only  requires  some  straightforward  changes. 

Let  El,. . .  ,Ek  be  a  partitioning  of  E  into  subsets  of 
parallel  edges.  Let  VG  be  the  set  of  vertices  that  must 
be  contained  in  the  casting  plane  for  the  cast  removal 
direction  parallel  to  the  edges  of  Ei.  For  each  Ei,  we 
compute  the  convex  hull  Gf  of  the  lower  endpoints  of 
Ei  and  the  convex  hull  Gf  of  the  upper  endpoints  of  Ei. 
The  convex  hull  of  \Ei  \  points  in  3-dimensional  space  can 
be  computed  in  (9(|£li|  log  |Ei|)  time,  see  e.g.  [11,  19]. 
Then  we  compute  all  planes  that  contain  the  vertices 
of  VV  and  separate  the  interiors  of  Gf  and  G~ ,  and 
contain  at  least  three  vertices  of  VG ,  Gf  and  GJ .  This 
gives  0(\Ei\)  candidate  casting  planes.  Summed  over  all 
subsets  Ei,...,Ek,  we  obtain  0{n)  candidate  casting 
planes  in  0(n  log  n)  time. 

Second,  we  compute  the  other  candidate  casting 
planes,  which  are  the  generated  planes  of  the  hierar¬ 
chical  decomposition  of  Subsection  3.1.  Antipodal  pairs 
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computations  take  0{n)  time  and  convex  hull  compu¬ 
tations  take  0(nlog7i)  time,  see  e.g.  [11,  19].  The  total 
time  taken  by  the  process  is  given  by  the  recurrence 
T{n)  <  T{{\  —  a)n)  +  O(nlogn),  where  a  is  the  con¬ 
stant  in  the  fl{n)  of  Lemma  10.  This  recurrence  solves 
to  T{n)  =  O(nlogn).  We  conclude: 

Lemma  11  Given  a  polyhedron  P  with  n  vertices,  one 
can  compute  in  O{n\ogn)  time  a  set  Q  of  0{n)  planes 
such  that  any  casting  plane  h  that  contains  at  least  three 
vertices  of  P  is  contained  in  Q,  assuming  opposite  cast 
removal. 

Theorem  4  Given  a  polyhedron  P  with  n  vertices,  one 
can  decide  in  0{n‘^)  time  and  linear  space  whether  P  is 
castable  when  the  cast  parts  must  be  removed  in  orthog¬ 
onal  or  opposite  directions. 

4.2  Walking  around  convex  polyhedra 

For  convex  polyhedra,  the  result  can  be  improved.  Re¬ 
call  that  determining  whether  a  plane  is  a  casting 
plane  for  P  can  be  done  by  only  considering  the  facets 
intersected  by  h  and  the  facets  incident  to  the  edges 
that  are  contained  in  h  (this  only  holds  for  convex  poly¬ 
hedra),  see  Section  2.2.  A  linear  program  on  this  set 
of  facets  tells  us  whether  /i  is  a  casting  plane.  We  also 
know,  by  Theorem  2,  that  the  total  number  of  facets 
that  we  check,  for  all  0{n)  candidate  casting  planes,  is 
only  O(nlogn).  To  test  each  candidate  casting  plane, 
either  generated  or  a  member  of  an  equivalence  class  of 
weakly  equivalent  planes,  our  algorithm  walks  around 
the  intersection  of  the  plane  with  the  polyhedron  at  a 
cost  of  O(logn)  per  step.  This  leads  to  an  O(nlog^n) 
time  algorithm  for  a  convex  polyhedron  P  with  n  ver¬ 
tices. 

Theorem  5  Given  a  convex  polyhedron  P  with  n  ver¬ 
tices,  one  can  decide  in  O(nlog^n)  time  and  linear 
space  whether  P  is  castable  when  the  cast  parts  must 
be  removed  in  orthogonal  or  opposite  directions. 

4.3  A  data  structuring  approach 

The  second,  0(n  log^n)  time  solution  described  above 
only  applies  to  convex  polyhedra.  By  using  data  struc¬ 
tures,  we  will  improve  upon  the  quadratic  time  results 
of  Theorem  4  for  simple  polyhedra  as  well.  Unfortu¬ 
nately,  the  storage  requirements  increase  with  the  data 
structuring  method.  The  idea  is  to  test  every  candidate 
casting  plane  by  querying  with  it  in  a  data  structure 
(instead  of  applying  linear  programming).  The  query 
determines  whether  h  really  is  a  valid  casting  plane  for 
the  polyhedron  P.  The  preprocessing  of  the  data  struc¬ 
ture  should  be  less  than  quadratic  and  the  query  time 
should  be  less  than  linear  in  order  to  beat  the  quadratic 


time  bound.  It  turns  out  that  the  data  structure  is  ex¬ 
actly  the  same  for  the  three  versions  of  removing  the 
cast;  only  the  query  algorithms  are  different. 

Let  P  be  a  polyhedron.  For  any  vertex  v  e  V,  de¬ 
fine  F{v)  as  the  set  of  facets  incident  to  v,  and  for  any 
subset  V  C  V,  define  P(U')  as  the  set  of  facets  inci¬ 
dent  to  at  least  one  vertex  of  V’.  We  make  the  following 
observation: 

Observation  2  For  any  plane  h,  we  have  U  Fff  = 

We  store  P  in  a  2-level  data  structure  T.  The  pri¬ 
mary  tree  is  a  3-dimensional  partition  tree  that  stores 
the  set  V  of  vertices  of  P,  see  Matousek  [15,  16]  and 
Agarwal  and  Sharir  [2]  for  example.  They  show  that 
for  any  constant  e  >  0,  a  structure  of  size  and  prepro¬ 
cessing  time  0(n^/^+")  exists,  such  that  for  any  query 
plane  h,  the  subset  C  U  of  vertices  above  h  can 
be  retrieved  in  0(n^/2+")  canonical  nodes  in  0(n^/^+") 
time.  For  any  node  5  of  T,  corresponding  to  a  canoni¬ 
cal  subset  Vs,  the  secondary  structure  at  S  stores  Vs  as 
follows.  Recall  from  Subsection  2.1  that  for  a  facet  /, 
$(/)  is  one  of  the  closed  half-spaces  of  which  the  bound¬ 
ing  plane  supports  /,  and  4'o(/)  is  T(/)  translated  such 
that  the  bounding  plane  contains  the  origin.  Let  F{Vs) 
be  the  set  of  facets  incident  to  at  least  one  vertex  of  Vs . 
Let  Bs  be  the  cone  n{^o(/)  j  /  G  F{Vs)}  with  apex  at 
the  origin.  The  secondary  structure  is  simply  an  array 
or  balanced  binary  tree  that  stores  the  facets  of  Bs  in 
cyclic  order  around  the  apex.  The  secondary  structure 
allows  for  queries  with  a  half-line  starting  at  the  origin, 
to  determine  whether  the  half-line  is  contained  in  Bs. 
This  query  is  in  fact  a  2-dimensional  query  to  determine 
whether  a  point  lies  in  a  convex  polygon. 

Suppose  that  we  wish  to  determine  whether  /i  is  a 
valid  casting  plane  for  orthogonal  cast  removal.  Then 
we  search  with  h  in  T  and  determine  the  canonical 
nodes  of  T  with  respect  to  h,  in  particular,  the  set 
A+  =  {^1, . . . ,  dt}  of  nodes  of  which  the  associated  sets 
Wj , . . . ,  Vs,  are  a  partition  of  V^^ .  Let  £'^  be  the  upward 
half-line  normal  to  h  starting  at  the  origin.  For  each  of 
the  nodes  Si,  we  query  with  !')][  in  the  secondary  struc¬ 
ture  to  determine  by  binary  search  whether  e  Bs, . 
If  the  answer  is  positive  for  all  nodes  Si,...,  St,  then 
Pnh+  is  a  terrain  with  respect  to  the  direction  normal 
to  h  (and  parallel  to  £^).  The  query  is  repeated  for  h~, 
to  determine  whether  Pnh~  is  a  terrain  with  respect  to 
the  direction  normal  to  h.  If  this  is  also  the  case,  then 
h  is  a  casting  plane  of  P  for  orthogonal  cast  removal. 
Since  t  =  0{n^^'^^‘^),  the  query  time  is  logn). 

Next  we  consider  cast  removal  in  opposite  direc¬ 
tions.  The  query  is  a  variation  to  the  previous  solu¬ 
tion.  We  determine  both  sets  A+  and  A“  of  canonical 
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nodes  for  the  queries  with  /i+  and  h  ,  respectively.  Let 
A+  =  and  A"  =  . . . ,  We  wish 

to  determine  whether  the  common  intersection  of  the 
t  cones  intersected  with  the  reflection  in 

the  origin  of  the  common  intersection  of  the  s  cones 
Bs'  ,■  ■■,  Bg'^ ,  is  non-trivial.  One  can  decide  whether  the 
common  intersection  is  non-trivial  using  the  algorithm 
of  Reichling  [22].  He  shows  how  to  And  an  extremal 
point  in  the  common  intersection  of  k  convex  n-gons  in 
0{k\og^n)  time.  In  our  case,  we  ‘reflect’  all  operations 
on  the  second  set  of  cones.  (Alternatively,  we  could 
store  the  reflected  cones  explicitly  at  every  node  and 
work  with  these,  but  this  is  not  necessary.)  The  query 
time  is  0((s  -H  t)  log^  n)  =  log^  n)  time. 

Thirdly,  we  consider  the  query  in  the  same  structure 
to  solve  arbitrary  cast  removal.  We  remark  that  for  ar¬ 
bitrary  cast  removal,  we  can  determine  using  0(n  log  n) 
queries  whether  a  casting  plane  exists,  even  though  we 
do  not  have  a  subquadratic  bound  on  the  number  of 
casting  planes  in  this  case.  This  will  be  shown  in  the 
next  section. 

Let  h  be  the  casting  plane.  We  determine  the  set 
A+  as  before;  let  ,  •  ■  • ,  Bg,  be  the  cones  that  are 
stored  at  the  set  A+  canonical  nodes.  Now  we  have 
to  determine  whether  the  common  intersection  of  these 
cones  is  non-trivial.  Any  half-line  starting  at  the  origin 
and  in  the  common  intersection  of  the  cones  represents 
a  direction  with  respect  to  which  Pn/i+  is  a  terrain.  A 
half-line  in  the  common  intersection  of  the  t  cones  can  be 
determined  in  0(t  log^  n)  =  log^  n)  time  using 

Reichling’s  algorithm.  If  no  such  half-line  exists,  then 
P  0  h'^  is  not  a  terrain  with  respect  to  any  direction. 
If  P  n  is  a  terrain  for  some  direction,  we  repeat  the 
query  to  test  whether  P  D  h~  is  a  terrain. 

Theorem  6  For  any  constant  e  >  0  and  any  simple 
polyhedron  P  with  n  vertices,  a  data  structure  of  size 
and  prepreocessing  time  0(71^/^+*^)  exists,  such  that  for 
any  query  plane  h,  one  can  determine  in  time 

whether  h  is  a  casting  plane  for  P  in  any  of  the  versions 
for  removing  the  cast. 

Remark:  In  fact,  we  have  also  shown  that  for  any 

query  half-space  /i+ ,  we  can  determine  within  the  same 
bounds  whether  P  n  is  a  terrain  in  some  direc¬ 
tion.  Furthermore,  by  choosing  a  different  partition 
tree  for  the  primary  tree,  we  can  trade  query  time  for 
storage  space,  see  e.g.  Chazelle,  Sharir  and  Welzl  [7], 
Matousek  [16]  and  Agarwal  and  Sharir  [2].  For  any 
n  <  A  <  a  data  structure  of  size  and  preprocess¬ 
ing  time  0(A’-+'')  exists  with  query  time 
The  theorem  above  states  the  version  we  need  for  the 
casting  problem. 


Corollary  3  For  any  constant  e  >  0  and  any  sim¬ 
ple  polyhedron  P  with  n  vertices,  one  can  determine  in 
Q(^3/2+e)  and  space  whether  P  is  castable  when 
the  cast  parts  must  be  removed  orthogonal  to  the  casting 
plane,  or  in  opposite  directions. 

5  Algorithms  for  arbitrary  cast 
removal 

In  this  section  we  study  the  most  general  casting  prob¬ 
lem  of  this  paper:  determine  whether  a  simple  polyhe¬ 
dron  P  is  castable  when  the  cast  parts  may  be  removed 
in  arbitrary  directions.  Using  results  of  the  previous  sec¬ 
tions,  we  can  obtain  an  O(n®/^+0  time  algorithm:  we 
devise  a  data  structure  as  in  the  previous  section  to  test 
all  candidate  O(n^)  casting  planes.  We  will  do  better 
in  this  section.  Using  Lemmas  5  and  6  and  one  more 
observation  on  arbitrary  cast  removal,  we  first  obtain 
a  simple  0(n^  log  n)  time  and  linear  space  algorithm, 
and  then  a  more  complicated  time  and  space 

algorithm. 

Let  P  be  a  polyhedron.  We  first  test  whether  P  ad¬ 
mits  opposite  cast  removal  using  the  simple  O(u^)  time 
algorithm  of  Theorem  4.  If  so,  we  are  done.  Otherwise, 
if  P  is  convex,  then  by  Lemma  5  we  only  have  to  con¬ 
sider  casting  planes  that  contain  some  edge  of  P.  If  P 
is  non-convex,  then,  by  Lemma  6,  we  only  have  to  con¬ 
sider  casting  planes  that  contain  an  edge  of  the  convex 
hull  of  P. 

Observation  3  Let  P  be  a  polyhedron  and  h  a  plane 
that  contains  an  edge  e  of  the  convex  hull  of  P.  Assume 
w.l.o.g.  that  e  is  horizontal  and  that  a  vertical  plane 
exists  which  supports  e  and  has  P  —  cl{e)  completely  to 
the  one  side. 

•  If  Pnh+  is  a  terrain  and  Pr\h~  is  not  a  terrain, 
then  no  plane  p  containing  e  for  which  P  n  C 

is  a  casting  plane. 

•  //  P  n  /i+  is  not  a  terrain  and  Pnh^  is  a  terrain, 
then  no  plane  p  containing  e  for  which  P  Dh'^  C 
p'^  is  a  casting  plane. 

•  1/  P  n  /i+  and  Pr\h~  are  both  not  a  terrain,  then 
no  plane  containing  e  is  a  casting  plane. 

The  above  observation  sets  up  a  binary  search  for  a 
casting  plane  that  contains  some  edge  e  of  the  convex 
hull  of  P.  A  plane  h  rotating  about  e  encounters  the 
n  —  2  other  vertices  of  P  in  some  order.  One  binary 
search  step  on  this  order  tests  whether  Pn/i+  and  Pn/i“ 
are  terrains  using  linear  programming.  Therefore,  we 
have  the  following. 
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Theorem  7  Given  a  simple  polyhedron  P  with  n  ver¬ 
tices.  one  can  determine  in  O(n^logn)  time  and  linear 
space  whether  a  casting  plane  for  P  exists,  when  the  cast 
parts  can  he  removed  in  arbitrary  directions. 

As  shown  in  Subsection  4.3,  we  can  preprocess  P 
into  a  data  structure  such  that  any  plane  can  be  tested 
in  time.  Since  0{nlogn)  planes  are  tested 

by  the  above  procedure,  the  test  part  can  be  improved 
to  time.  However,  how  can  we  obtain  the  or¬ 

der  of  , ... ,  Vn-2  without  sorting?  Again,  the  solution 
lies  in  data  structuring.  Notice  first  that  the  order  of 
ui, . . .  ,Vn-2  is  not  needed  explicitly.  In  the  first  step, 
the  vertex  that  is  the  median  u„/2-i  must  be  deter¬ 
mined,  and  in  the  following  steps  a  median  in  one  of 
the  two  halves. 

The  data  structure  that  is  needed  solves  the  follow¬ 
ing  query  problem;  Given  a  query  edge  e  such  that  there 
exists  a  plane  h  containing  e  that  has  the  interior  of  P 
completely  to  the  one  side,  and  an  integer  k,  find  the 
A:-th  vertex  of  P  that  is  encountered  by  h  when  it  ro¬ 
tates  about  e.  Dualization  yields  a  more  familiar  query 
problem:  preprocess  a  set  of  n  planes  (dual  to  the  ver¬ 
tices  of  P),  such  that  for  any  given  query  ray,  the  k-th 
intersection  point  with  the  planes  can  be  determined. 
The  query  ray  is  contained  in  the  line  dual  to  the  line 
supporting  e,  and  the  starting  point  of  the  ray  is  any 
point  dual  to  a  plane  containing  e  that  has  the  interior 
of  P  completely  to  the  one  side. 

Let  \  vi  G  P}  be  the  set  of  planes 

dual  to  the  vertices  of  P,  preprocess  them  into  a  data 
structure  for  line  segment  intersection  counting,  as  given 
by  Agarwal  and  Matousek  [1].  They  show  that  for 
any  e  >  0,  a  structure  of  size  and  preprocessing  time 
Q(jr3/2+e)  exists,  such  that  segment  intersection  count¬ 
ing  queries  can  be  answered  in  time.  Further¬ 

more,  they  show  how  the  same  structure  can  be  used  to 
find  the  k-th  intersection  point  of  a  query  ray  with  !){¥) 
in  0(71^/^+*^)  time.  Let  the  query  ray  be  parameterized 
by  g  -I-  A  •  -ii,  A  >  0,  where  g  is  a  point  and  F  is  a  vector  in 
3-dimensional  space.  In  our  application,  if  the  fc-th  in¬ 
tersection  point  does  not  exist,  and  the  last  intersection 
point  is  the  g-th,  then  the  query  should  be  continued  in 
‘wrap-around’  mode:  find  the  {k  -  j)-th  plane  for  the 
query  ray  along  the  same  line  and  in  the  same  direction, 
but  with  g  translated  in  direction  -u  to  infinity.  When 
this  happens,  the  plane  rotating  about  e  in  primal  space 
rotates  past  a  vertical  orientation.  These  adaptations  to 
the  query  algorithm  can  easily  be  made  within  the  same 
asymptotic  time  bounds.  Hence,  we  conclude: 

Theorem  8  For  any  simple  polyhedron  P  with  n  ver¬ 
tices  and  any  constant  e  >  0,  one  can  determine  in 
Q(.^3/2+e)  whether  P  is  castable,  when 

the  cast  parts  can  be  removed  in  arbitrary  directions. 


6  Conclusions  and  open  prob¬ 
lems 

This  paper  studied  the  geometric  version  of  the  prob¬ 
lem  of  determining  whether  a  simple  polyhedron  can  be 
manufactured  using  casting.  It  was  assumed  that  there 
are  two  cast  parts,  and  each  has  be  removed  with  a 
single  translation.  We  presented  simple  algorithms  that 
use  O(n^)  or  0(n^  logn)  time  and  linear  space  which  are 
based  on  linear  programming.  Furthermore,  we  showed 
that  theoretically,  better  results  can  be  obtained  us¬ 
ing  partition  trees  and  their  variants.  This  leads  to  an 
(5(^3/2+€)  algorithm.  The  n^-factors  can 

be  replaced  by  polylogarithmic  factors  if  we  use  the  par¬ 
tition  trees  of  Matousek  [16].  Finally,  as  a  by-product, 
we  obtained  a  combinatorial  bound  on  the  number  of 
planes  intersecting  a  polyhedron  in  edges  only,  and  on 
the  summed  number  of  edges  in  these  planes. 

Manufacturing  applications  have  not  been  studied 
much  in  computational  geometry.  There  are  quite  a 
large  number  of  open  problems  to  be  solved  in  this  area. 
We  first  list  some  open  problems  related  to  this  paper, 
and  then  list  a  few  others  in  cast  design  that  deserve 
attention. 

1.  What  is  the  maximum  number  of  distinct  casting 
planes  in  case  of  arbitrary  cast  removal?  This  pa¬ 
per  shows  O(n^),  whereas  the  only  lower  bound 
we  have  is  linear. 

2.  For  a  convex  polyhedron  P,  what  is  the  maxi¬ 
mum  summed  complexity  of  the  intersection  of  all 
weakly  distinct  casting  planes  with  P?  This  paper 
shows  0(n  log  n)  in  case  of  opposite  cast  removal, 
but  the  trivial  lower  bound  is  linear. 

3.  Give  simple  algorithms  for  casting  that  improve 

our  simple  O(nlog^n),  O(n^)  and  0(n^  logn) 
time  algorithms.  Find  algorithms  that  improve 
upon  our  time  algorithms.  (De  Berg  [9] 

observed  that  orthogonal  cast  removal  for  simple 
polyhedra  can  be  done  in  time  by  using 

a  different  data  structure  to  test  candidate  casting 
planes.) 

4.  Suppose  that  we  wish  to  determine  castability  of 
an  object  with  non-linear  boundaries.  Give  (sim¬ 
ple)  algorithms  that  deal  with  this  problem. 

5.  For  some  casting  processes,  is  not  necessary  that 
the  cast  parts  must  be  separated  by  a  plane.  In 
these  cases,  every  convex  polyhedron  is  castable. 
However,  no  algorithms  are  known  for  cast  re¬ 
moval  of  simple  polyhedra. 
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Abstract 

Let  G  =  (y,  E)  be  a  n^vertex  connected  graph  with 
positive  edge  weights.  A  subgraph  G'  is  a  i-spanner 
if  for  all  u,  G  V,  the  distance  between  u  and  v 
in  the  subgraph  is  at  most  t  times  the  correspond¬ 
ing  distance  in  G.  We  design  an  O(nlog^n)  time 
algorithm  which,  given  a  set  F  of  n  points  in 
dimensional  space,  and  any  constant  t  >  1,  pro¬ 
duces  a  t-spanner  of  the  complete  Euclidean  graph 
of  V.  This  algorithm  retains  the  spirit  of  a  recent 
0{n^  log  Ti)-time  greedy  algorithm  which  produces  t- 
spanners  with  a  small  number  of  edges  and  a  small 
total  edge  weight;  we  use  graph  clustering  tech¬ 
niques  to  achieve  a  more  efficient  implementation. 
Our  spanners  have  similar  size  and  weight  sparse¬ 
ness  as  those  constructed  by  the  greedy  algorithm. 


1  Introduction 


Let  G  =  {y,  E)  be  a  n- vertex  connected  graph  with 
positive  edge  weights.  A  subgraph  G'  is  a  t-spanner 
if  for  all  u,v  G  V,  the  distance  between  u  and  v 
in  the  subgraph  is  at  most  t  times  the  correspond¬ 
ing  distance  in  G.  The  value  of  t  is  known  as  the 
stretch  factor  of  G' .  Let  F  be  a  set  of  n  points  in 
^-dimensional  space.  An  Euclidean  graph  has  F  as 
its  vertices,  and  its  edges  are  straight  line  segments 
joining  pairs  of  points,  with  edge  weights  being  their 
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Euclidean  distances.  The  complete  Euclidean  graph 
contains  all  n(n  —  l)/2  edges. 

Although  complete  graphs  represent  ideal  com¬ 
munication  networks,  they  are  expensive  to  build 
in  practice,  and  sparse  spanners  represent  low  cost 
alternatives.  Sparseness  of  spanners  is  usually  mea¬ 
sured  by  various  criteria  such  as  few  edges,  small 
weight,  and  small  degree.  Spanners  for  complete 
Euclidean  graphs  as  well  as  for  arbitrary  weighted 
graphs  find  applications  in  robotics,  network  topol¬ 
ogy  design,  distributed  systems,  design  of  parallel 
machines,  and  have  also  been  investigated  by  graph 
theorists.  Most  of  the  recent  literature  on  spanners 
are  referenced  in  [4,  9]. 

Let  MST  refer  to  a  minimum  spanning  tree  of 
the  complete  graph.  For  complete  Euclidean  graphs 
in  2-dimensions,  an  0(n  log  Ti)-time  algorithm  is  de¬ 
scribed  in  [7]  which  produces  t-spanners  with  0(n) 
edges  and  weight  0{wt{MST)),  which  is  optimal. 
The  problem  gets  more  difficult  in  higher  dimen¬ 
sions.  In  I:-dimensional  space,  there  are  several  al¬ 
gorithms  that  run  in  0(n  log n)  time  [8,  10].  How¬ 
ever  they  only  guarantee  that  the  number  of  span¬ 
ner  edges  is  0(n);  the  total  weight  could  be  arbi¬ 
trarily  large.  In  [4]  an  0(n  log  n)  time  algorithm  is 
described  which  generates  spanners  in  ife-dimensions 
with  weight  O {log  n)  ■  wt{M ST).  The  nature  of  this 
algorithm  is  such  that  the  O(logn)  factor  cannot 
be  eliminated,  so  the  algorithm  always  produces  a 
suboptimally  weighted  spanner. 

In  contrast,  in  [1,  4,  5]  a  simple  greedy  algorithm  is 
presented  which  generates  spanners  with  0(n)  edges 
and  a  weight  which  is  likely  to  be  asymptotically  op¬ 
timal  for  all  dimensions.  In  [4]  it  is  shown  that  for 
any  dimension  k  the  weight  is  O(logn)  •  wt{MST), 
and  in  [5]  it  is  shown  that  for  ib  <  3  the  weight 
is  0{wt{MST)).  In  [5]  it  is  conjectured  that  the 
weight  is  actually  asymptotically  optimal  in  any  di¬ 
mension  and  that  the  O(logn)  factor  can  be  elim- 
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inated  by  a  more  careful  analysis.  The  constants 
implicit  in  the  O-notation  depend  only  on  t  and  k. 
The  greedy  algorithm  has  also  been  used  to  gener¬ 
ate  spanners  of  arbitrary  weighted  graphs,  and  there 
again,  it  is  superior  to  other  algorithms  in  the  sense 
that  it  produces  spanners  with  very  small  weight. 

However,  the  greedy  algorithm  is  handicapped  by 
a  slow  running  time,  mainly  because  it  has  to  make 
a  large  number  of  shortest  path  queries.  In  the  Eu¬ 
clidean  case,  the  best-known  implementation  has  a 
running  time  of  0(n®  log  n).  In  this  paper,  we  de¬ 
sign  an  0(n  log^  n)-time  algorithm  to  construct  t- 
spanners,  which  retains  the  spirit  of  the  greedy  algo¬ 
rithm.  This  new  algorithm  is  faster  because  we  use 
graph  clustering  techniques  to  answer  shortest  path 
queries  only  approximately.  The  spanners  produced 
by  this  algorithm  have  the  same  asymptotic  size 
and  weight  bounds  as  the  spanners  produced  by  the 
greedy  algorithm.  Furthermore,  any  improvements 
in  these  bounds  for  the  greedy  algorithm  spanners  is 
likely  to  lead  to  similar  improvements  in  the  bounds 
for  the  spanners  produced  by  this  algorithm.  Graph 
clustering  is  not  new,  and  in  fact  there  are  several 
algorithms  which  use  a  variety  of  clustering  tech¬ 
niques  to  generate  spanners  of  Euclidean  as  well  as 
arbitrary  weighted  graphs  [2,  3,  6,  8,  10].  What  is 
interesting  is  the  way  in  which  we  exploit  a  simple 
clustering  technique  in  our  implementation. 

In  the  next  section  we  introduce  notation,  summa¬ 
rize  previous  relevant  research,  and  give  an  overview 
of  our  algorithm.  Section  3  describes  the  clustering 
technique  used.  Section  4  describes  the  algorithm. 
We  conclude  with  some  open  problems. 

2  Preliminaries 

Let  G  =  (V]  E)  be  a  n-vertex  connected  graph  with 
positive  edge  weights.  The  weight  of  a  single  edge  e 
is  wt{e),  the  weight  of  a  set  of  edges  E'  is  wt{E'), 
and  the  weight  of  the  entire  graph  is  wt{G).  Let 
spa{u,v)  be  the  weight  of  the  shortest  path  in  G 
between  u  and  v.  Let  d(u,  v)  refer  to  the  Euclidean 
distance  between  points  u  and  v  in  Euclidean  space. 

In  [1,  4],  a  greedy  algorithm  is  developed  for  con¬ 
structing  i-spanners  of  arbitrary  weighted  graphs, 
for  any  t  >  1.  We  reproduce  the  algorithm  in  Fig¬ 
ure  1.  Let  us  apply  this  algorithm  to  a  complete 
Euclidean  graph  in  ib-dimensional  space  and  let  the 
output  graph  be  G'  =  (V,  E'). 

It  is  easy  to  see  that  the  output  is  a  t-spanner, 
since  for  any  (u,  v)  that  is  not  an  edge  of  G', 
spG'(u,v)  <  t  •  d(u,v).  In  addition,  for  any  (u,  v) 


Algorithm  GREEDY(G  =  (Y,  E),  t) 
begin 

order  E  by  non-decreasing  weights 

E'  ^<b,G'  ^{V,E') 

for  each  edge  e  =  (u,  u)  €  E'  do 

if  spG'  (u,  v)  >  t  ■  wt{e)  then 

E'*-E'U  {c},  G'  ^  (V,  E') 

output  G' 
end. 


Figure  1:  The  greedy  algorithm 

that  is  an  edge  of  G',  the  weight  of  the  second 
shortest  path  between  u  and  v  in  G'  is  greater  than 
t  •  d(u,  v).  The  second  property  is  crucial  in  showing 
that  the  total  weight  of  the  spanner  is  small.  In  [4] 
it  is  shown  that  G'  has  0(n)  edges  and  has  a  total 
edge  weight  of  O(logn)  ■  wt(^MST),  where  MST  is 
a  minimum  sprinning  tree  of  V.  In  [5]  it  is  shown 
that  for  ife  <  3,  the  greedy  algorithm  outputs  a  t- 
spanner  of  weight  0{wt[M ST)).  In  that  paper  it  is 
conjectured  that  this  result  is  true  for  ^-dimensions. 

In  this  paper,  our  algorithm  mimics  the  greedy  al¬ 
gorithm  and  constructs  a  t-spanner  G'  with  asymp¬ 
totically  same  size  and  weight  sparseness.  Apart 
from  the  very  short  edges  of  the  spanner,  all  other 
edges  (u,  v)  satisfy  the  property  that  the  weight  of 
the  second  shortest  path  between  u  and  v  in  G'  is 
greater  than  c  •  d(u,  v),  where  c  is  a  constant  depen¬ 
dent  on  t.  Thus,  techniques  similar  to  those  used 
in  [4,  5]  can  be  used  to  prove  the  same  sparseness 
bounds.  Details  appear  later. 

The  following  concept  will  be  useful  in  later  sec¬ 
tions.  We  define  a  certain  kind  of  “partial”  spanners 
of  complete  Euclidean  graphs.  Let  V  be  a  set  of  n 
points  in  jb-dimensional  space,  t  >  1  and  W  >  0. 
A  graph  G'  is  a  (t,  W)-spanner  if  for  all  u,  i»  €  V 
such  that  d(Ti,  v)  <  W,  spa'{u,v)  <  t  •  d{u,v).  In 
other  words,  G'  provides  short  paths  between  pairs 
of  points  whose  inter-distance  is  at  most  W.  For 
instance,  at  any  stage  of  the  greedy  algorithm,  if 
the  edge  most  recently  examined  has  weight  L,  the 
partially  constructed  spanner  is  a  [t,  L)-spanner. 

2.1  Overview  of  the  algorithm 

We  first  run  the  0(n  log  n)  time  algorithm  in  [8]  to 
get  a  graph  G  =  (Y,  E),  which  is  a  v^-spanner  of  the 
complete  Euclidean  graph.  Although  it  may  have  a 
large  weight,  it  heis  0{n)  edges.  We  shall  then  find  a 
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V^-spanner  G'  of  G.  Clearly,  G'  will  be  a  f-spanner 
of  the  complete  graph. 

If  we  were  to  run  the  original  greedy  algorithm 
on  G  to  get  G',  it  will  have  to  answer  0(n)  short¬ 
est  path  queries  on  G'.  Our  algorithm  does  better 
by  speeding  up  the  queries.  We  group  the  edges  in 
E  into  O(logn)  groups,  such  that  each  group  has 
edges  that  are  more  or  less  of  equal  weight.  Then 
we  examine  the  edges,  going  from  one  group  to  the 
next  in  order  of  increasing  weight,  and  at  the  same 
time  building  a  partially  constructed  spanner,  G'. 

Immediately  before  processing  a  group  of  edges 
(of  weight  approximately  W),  the  algorithm  cre¬ 
ates  a  cluster- graph  H  as  follows.  Roughly  speak¬ 
ing,  the  vertices  of  G'  are  covered  by  a  set  of  clus¬ 
ters  such  that  each  cluster  consists  of  vertices  that 
are  within  0{W)  distance  from  each  other  in  G' . 
Each  cluster  is  collapsed  into  a  representative  ver¬ 
tex,  namely  its  center,  to  form  a  single  macro  vertex 
of  H.  These  macro  vertices  are  connected  by  edges 
of  weight  ©(W)  only  if  the  distance  between  their 
centers  in  G'  is  0(W).  Notice  that  all  edge  weights 
of  H  are  more  or  less  the  same.  Thus  distances  in  G' 
are  approximated  by  distances  in  H.  A  crucial  ob¬ 
servation,  as  proved  later,  is  that  in  the  Euclidean 
case  the  maximum  degree  of  H  turns  out  to  be  a 
constant.  Thus  the  shortest  path  query  can  be  an¬ 
swered  in  constant  time  as  follows.  Start  with  a 
macro  vertex  whose  corresponding  cluster  contains 
u,  and  see  whether  v  belongs  to  any  of  the  clusters 
that  are  within  0(\/i)  links  from  the  start  cluster. 
Since  the  degree  is  constant,  and  t  is  a  constant, 
we  need  only  explore  a  constant-sized  portion  of  H. 
Once  we  have  examined  all  the  edges  in  a  group,  we 
create  a  cluster-graph  again  (but  with  a  larger  W), 
and  process  the  next  group  of  edges. 

The  running  time  of  of  our  algorithm  is  dominated 
by  the  periodic  clustering  necessary. 

3  Clustering  a  graph 

In  this  section  we  first  consider  arbitrary  weighted 
graphs.  Let  G'  =  {V,  E')  be  an  any  positive 
weighted  graph,  and  W  >  0.  A  cluster  C  is  a  set 
of  vertices,  with  one  vertex  v  its  center,  such  that 
for  all  u  6  V,  3pG'{u,v)  <  W  implies  that  u  £  C. 
W  is  the  radius  of  the  cluster.  A  cluster- cover  is  a 
set  of  clusters  Gi,  Gj, . . . ,  Cm,  (with  corresponding 
centers  vi,V2, . . .  ,Vm)  such  that  their  union  is  V. 
Clusters  in  a  cluster-cover  may  overlap,  however  a 
cluster  center  does  not  belong  to  any  other  cluster. 

We  describe  an  obvious  algorithm  to  compute 


a  cluster-cover.  First  consider  a  procedure  called 
SINGLE-SOURCE(G',i;,  W),  which  takes  as  in¬ 
put  any  weighted  graph  G',  a  start  vertex  v,  and 
W  >  0.  It  outputs  all  vertices  it  such  that 
spGi{v,u)  <  W.  It  can  be  implemented  as  Dijk- 
stra’s  single-source  shortest  path  algorithm  by  us¬ 
ing  a  heap  and  ensuring  that  the  algorithm  never 
visits  vertices  further  than  W  from  v.  We  now  de¬ 
sign  a  procedure  CLUSTER-COVER(G',  W)  as  fol¬ 
lows.  Select  any  vertex  Vi  of  G',  and  run  SINGLE- 
SOURCE(G',  vi,  W).  The  output  is  a  cluster  C\ 
with  center  vi.  Then  select  any  vertex  vj  not  yet 
visited,  and  run  SINGLE-SOURCE(G',  v^,  W).  The 
output  is  Gj  with  center  vj.  Continue  this  process 
until  all  vertices  are  visited.  For  each  vertex,  the 
procedure  also  computes  the  clusters  to  which  it  be¬ 
longs  as  well  as  its  distance  from  their  respective 
centers. 

Let  G'  =  {V,  E')  be  any  weighted  graph,  W  >  0 
and  0  <  6  <  1/2.  Assume  a  cluster-cover  has 
been  constructed  with  cluster  radius  6W.  A  cluster- 
graph  H  is  defined  as  follows.  The  vertex  set  is 
V.  The  are  two  kinds  of  edges,  intra-cluster  edges 
and  inter-cluster  edges.  For  all  Ci,  and  for  all 
u  £Ci,  [u,  Vi]  is  an  intra-cluster  edge  (we  use  square 
brackets  to  distinguish  cluster-graph  edges  from  the 
edges  of  G').  Inter-cluster  edges  are  between  clus¬ 
ter  centers.  [vi,Vj]  is  an  inter-cluster  edge  if,  either 
spG'{vi,Vj)  <  W,  or  there  exists  c  =  (u,  v)  €  E' 
such  that  u  £  Ci,v  £  Cj.  Cluster-graph  edges  have 
weights,  and  ix)t([u,  v])  is  defined  as  spG'(u,v). 

We  now  describe  a  procedure  called  CLUSTER- 
GRAPH(G',  (5,  W).  It  first  calls  CLUSTER- 
COVER(G',  8W).  In  fact,  during  this  process  all 
intra-cluster  edges  (and  their  weights)  will  also  be 
computed.  Recall  that  inter-cluster  edges  have  to 
satisfy  one  of  two  conditions.  Inter-cluster  edges 
satisfying  the  first  condition  may  be  computed  by 
running  SINGLEl-SOURCE  on  G'  from  each  clus¬ 
ter  center  v,-  and  checking  which  other  cluster  cen¬ 
ters  are  no  further  than  W  from  Vj.  The  remain¬ 
ing  inter-cluster  edges  may  be  computed  as  fol¬ 
lows.  For  all  e  =  (u,  v)  £  E'  do  the  following. 
For  all  cluster  centers  Vi,Vj  such  that  u  belongs 
to  Vj’s  cluster  and  v  belongs  to  vj's  cluster,  add 
the  inter-cluster  edge  with  weight  defined  to 

be  ivt([vi,  uj)  -1-  wt[e)  -f-  iut([v,  Vjj)  (unless  the  inter¬ 
cluster  edge  already  existed  with  a  smaller  weight). 

The  motivation  for  the  cluster-graph  is  that  it  is 
a  simpler  structure  than  G'.  Shortest  path  queries 
in  G'  can  be  translated  into  shortest  path  queries  in 
H,  and  the  latter  can  be  performed  more  efficiently. 
The  following  two  lemmas  are  obvious. 
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Lemma  3.1  Intra-cluster  edges  are  no  larger  that 
SW .  Inter-cluster  edges  are  larger  than  6W ,  but 
no  larger  than  D  +  26W'^,  where  D  is  the 

weight  of  the  largest  edge  in  G' . 

Lemma  3.2  Let  a  path  between  u  and  v  in  H  have 
weight  L,  Then  there  is  a  path  between  u  and  v  in 
G'  with  weight  at  most  L. 

The  next  lemma  is  the  converse,  which  is  harder. 
We  first  introduce  some  definitions.  A  vertex  u  is 
defined  to  be  sufficiently  far  from  vertex  v  if,  (1) 
no  single  cluster  contains  both,  and  (2)  spc  {u,  v)  > 
W  -  26W.  Define  a  cluster-path  in  JEf  to  be  a  path 
where  the  first  and  Icist  edges  are  intrei-cluster  edges, 
but  all  intermediate  edges  are  inter-cluster  edges. 

Lemma  3.3  Let  u  be  sufficiently  far  from  v.  Let 
L\  be  the  weight  of  a  path  between  u  and  v  in  G' . 
Then  there  exists  a  cluster-path  between  u  and  v  in 
H  with  weight  L^  such  that 


Proof  :  This  lemma  says  that  H  approximates 
paths  in  G'  only  for  pairs  of  vertices  which  are  suf¬ 
ficiently  far  apart.  Secondly,  notice  that  the  quality 
of  the  approximation  depends  on  6\  smaller  values 
of  6  make  paths  in  H  closer  in  weight  to  paths  in 
G',  although  S  becomes  denser. 

Let  the  path  from  u  to  v  having  weight  Li  in  G'  be 
P.  We  shall  use  the  notation  P(io,  x)  to  denote  the 
vertices  of  P  between  vertices  w  and  x,  not  including 
w.  We  shall  construct  a  cluster-path  Q  from  u  to 
■u  in  with  weight  L3  as  follows,  follows.  Let  Go 
be  any  cluster  (with  center  vq)  containing  u.  The 
first  edge  of  Q  is  the  intra-cluster  edge  [u,  vq]-  Next, 
among  all  clusters  with  centers  adjacent  to  vo  in  H, 
let  Cl  (with  center  wi)  intersect  the  furthest  vertex 
along  P{u,v),  say  wi.  Add  the  inter-cluster  edge 
[■uo.vi]  to  Q.  Next,  among  all  clusters  with  centers 
adjacent  to  vi  in  H,  let  Cj  (with  center  dj)  intersect 
the  furthest  vertex  along  P{wi,v),  say  luj.  Add  the 
inter-cluster  edge  ['Ui,'U2]  to  Q.  This  process  may 
be  continued  until  we  reach  a  cluster  center,  Vm, 
whose  cluster  contains  v.  At  this  stage,  complete 
Q  by  adding  the  intra-cluster  edge  Figure  2 

illustrates  some  of  this  notions. 

Case  1:  m  =  1.  In  this  case  there  is  only  one  inter¬ 
cluster  edge  along  Q.  Since  t*  is  sufficiently  far  from 
V,  we  know  that 

Li>W-  2SW 


cluster  with  radius  SW 


Figure  2;  Paths  in  G'  and  H 

Now  Lj  =  iu<([u,  Vo])  -f  iut([vo,vi])  -t-  tvt([vi,v]). 
But  iut([u,vo])  and  u)t([vi,  v])  are  each  at  most  SW , 
while  tvt([vo,vi])  =  spcivoi^i)  <  spG'(^Oiu)  -f 
spaifu,  v)  spG'(v,  vi)  <  2SW  4-  Li.  Thus 

2,2  <  +  45W 

We  now  have  two  inequalities  relating  Li,  L2  and 
W.  After  eliminating  W,  we  get 


Case  2;  m  >  2  and  is  even.  In  this  case,  sup¬ 
pose  [vi,t)i+i]  and  [vi+i,'Ui+2]  are  any  two  consecu¬ 
tive  inter-cluster  edges  on  Q.  We  observe  that  the 
sum  of  their  weights  is  greater  than  W .  If  this  were 
not  so,  then  the  edge  [w<,Vi+2]  (which  exists  by  the 
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definition  of  the  cluster-graph)  would  have  instead 
been  added  to  Q  by  the  above  process. 

We  divide  Q  into  portions  Qo,  <?2i  •  •  •,  where  Qji 
is  the  portion  between  V2i  and  V2i+2-  Similarly,  we 
divide  P  into  portions  Pq,  P21  •  •  •,  where  P2i  is  the 
portion  between  the  last  vertex  intersecting  C2i  and 
the  first  vertex  intersecting  €21+2  (see  Figure  2).  We 
shall  first  prove  that  for  any  even  i,  the  weight  of  Q2i 
is  no  more  than  a  constant  (which  depends  upon 
times  the  weight  of  Pj^. 

Let  the  weight  of  P2i  be  p2t  and  that  of  Q2i  be  g2t- 
Since  there  cannot  be  an  inter-cluster  edge  between 
V2i  and  V2i+2,  we  have 

P2i>W-  26W 

Select  r  to  be  any  vertex  of  P2,-  within  the  in¬ 
termediate  cluster  C2i+i.  The  vertex  r  splits  P2i 
into  two  portions.  Let  p'^i  (pjj)  be  the  weight  of 
the  initial  (final)  portions;  thus  p2i  =  P2i  +  p'ii- 
So  wt{[v2i,  V2<+i])  <  P2i  +  26W,  and  similarly 
1^21+2])  <  P2i  +  26W.  Adding  the  two, 

we  get 

92t  <  P2t  +  45W 

We  now  have  two  inequalities  relating  p2i,  521  and 
W.  After  eliminating  W,  we  get 


Summing  over  all  even  values  of  i,  and  taking  into 
account  the  two  intra-cluster  edges  at  either  ends  of 
Q,  we  get 

Since  u  is  sufficiently  far  from  v,  we  know  that 
Li>  W  -  26W.  That  is.  Lx  ■  25/(1  -  26)  >  26W. 
Substituting  for  26W  in  the  above  inequality  and 
simplifying,  we  get 


Case  3;  m  >  3  and  is  odd.  In  this  case,  the  analysis 
will  be  exactly  the  same  as  in  Case  2,  except  that  we 
have  to  account  for  the  last  inter-cluster  edge  along 
Q  and  correspondingly  the  portion  of  P  between 
the  last  two  clusters.  Let  qm-i  be  the  weight  of 
[vm-i,  Vm],  and  let  Pm-i  be  the  weight  of  the  portion 
of  P  between  the  last  vertex  intersecting  C^-i  and 
first  vertex  intersecting  C^.  Clearly  qm-i  <  Pm-i  + 
26W.  The  inequality  does  not  change  if  we  rewrite 


it  as  qm-x  <  'Pm-i  +  26W.  We  then  sum 

up  the  p’s  and  9 ’s  as  in  Case  2,  and  get 

Since  Lx  >  W  —  25 W,  we  get  Li  •  45/(1  —  25)  > 
45W.  Substituting  for  45W  in  the  above  inequality 
and  simplifying,  we  get 


The  constant  in  Case  3  dominates,  which  proves 
the  lemma.  □ 


3.1  Clustering  of  partial  Euclidean 
spanners 

Although  described  for  arbitrary  weighted  graphs, 
in  our  algorithm  we  will  always  be  computing 
cluster-graphs  for  partial  spanners  of  complete  Eu¬ 
clidean  graphs  under  the  following  conditions.  Let 
t>l,  a>0,  0<5<  1/2,  0  >  0,  and  W  >  0.  For  a 
set  V  of  n  points  in  ^-dimensional  space,  let  G'  be 
a  (t,  aW)-8panner  such  that  all  its  edges  are  smaller 
than  ew.  Run  CLUSTER-GRAPH(G',5,  W),  and 
let  H  be  the  computed  cluster-graph.  E  has  several 
additional  properties,  which  are  used  in  analyzing 
the  running  time  of  our  algorithm. 

Lemma  3.4  Let  /3  >  0  Je  any  constant.  Inside  any 
sphere  of  radius  there  can  be  at  most  a  constant 
(which  depends  upon  a,  /?,  5,  t,  and  k)  number  of 
cluster  centers. 

Proof  :  We  first  provide  a  lower  bound  on  the  spa¬ 
tial  separation  between  any  pair  of  cluster  centers. 
Let  Vi,Vj  be  two  cluster  centers,  and  let  d{vi,Vj)  = 
L.  There  are  two  cases,  either  L  >  aW  01  L  <  aW. 
If  it  is  the  latter  case,  then  spci{vi,Vj)  <  tL.  But 
spG'{'0i,Vj)  >  5W  (Lemma  3.1),  hence  L  >  6W/t. 
So  in  either  case,  L  >  min{aW,  6W/t}. 

Given  a  sphere  of  radius  /3W,  we  can  pack  at  most 
a  constant  (which  depends  upon  a,  /3,  5,  t,  and  k) 
number  of  points  inside  it,  such  that  the  spatial  sep¬ 
aration  between  any  pair  of  points  is  greater  than 
min{aW,  6W/t}.  □ 

Lemma  3.5  A  vertex  belongs  to  at  most  a  constant 
(which  depends  upon  a,  6,  t,  and  k)  number  of  clus¬ 
ters. 
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Proof  :  Consider  any  vertex  u.  Let  C  be  a  clus¬ 
ter  containing  u,  with  cluster  center  v.  d{u,v)  < 
spQi  (u,  v)  <  6W.  Consider  a  sphere  of  radius  6W 
centered  at  u.  The  cluster  centers  of  all  clusters 
containing  u  have  to  lie  within  the  sphere.  Invoking 
Lemma  3.4  completes  the  proof. 

Lemma  3.6  The  subgraph  of  H  induced  by  the 
inter- cluster  edges  has  at  most  a  constant  degree 
(which  depends  a,  6,  6,  t,  and  k). 

Proof  :  Let  [uijUj]  be  an  inter-cluster  edge.  By 
Lemma 3.1,  d{vi,Vj)  <  spG'[vi,Vj)  <  max{kr,eW"-h 
26W}.  Consider  a  sphere  of  radius  max{W,  OW  -t- 
25 W}  centered  at  Vi.  The  cluster  centers  adjacent  to 
Vi  have  to  lie  within  the  sphere.  Invoking  Lemma  3.4 
completes  the  proof.  ^ 

We  introduce  one  more  procedure  to  be  used 
by  the  algorithm.  Let  G'  be  a  (t,  o:W)-spanner 
satisfying  the  conditions  laid  out  earlier,  and  let 
H  be  its  cluster-graph.  The  procedure  CHECK- 
PATH(fl',  u,  V,  L)  returns  “true”  if  there  is  a  cluster- 
path  from  u  to  u  in  Lf  with  weight  at  most  L,  and 
“false”  otherwise. 

This  procedure  is  always  called  by  our  spanner 
algorithm  under  fortunate  circumstances.  There  is 
a  constant  7  >  0  such  that,  the  input  values  of  L 
are  always  at  most  'fW ,  Because  of  this  restriction, 
the  procedure  can  be  made  to  run  in  constant  time. 
To  see  this,  consider  initiating  from  u  a  brute-force 
search  for  such  a  cluster-path.  There  are  at  most  a 
constant  number  of  intra-cluster  edges  leeiding  from 
u  to  different  cluster  centers.  The  inter-cluster  edges 
have  each  weight  at  lecist  6W,  thus  if  such  a  cluster- 
path  exists,  it  can  have  at  most  7/5  inter-cluster 
edges.  We  also  know  that  the  subgraph  of  H  in¬ 
duced  by  inter-cluster  edges  has  a  constant  degree 
(Lemma  3.6).  We  can  conclude  that  the  brute-force 
search  initiated  from  u  will  only  have  to  examine  a 
constant  sized  subgraph  of  H,  and  verify  whether  v 
belongs  to  this  subgraph,  and  if  so,  whether  there 
exists  a  cluster-path  from  u  to  1;  of  weight  at  most 
L. 

This  procedure  is  used  in  lieu  of  the  time  con¬ 
suming  shortest  path  queries  of  the  original  greedy 
algorithm. 

4  The  spanner  algorithm 

In  this  section  we  illustrate  an  0{n\o^  n)  time  al¬ 
gorithm  for  constructing  i-spanners  of  complete  Eu- 


Algorithm  FAST-GREEDY(Vit) 


use  algorithm  in  [8],  create  a  \/i-spanner  G  =  (P,  E) 
order  E  by  non-decreasing  weight 
let  the  largest  edge  in  E  have  weight  D 
create  the  intervals 

10  =  {0,D/n], 

11  =  2'D/n]  for  t  =  1, 2, . . . ,  logn 

let  Ei  be  the  (sorted)  edges  of  E  with  weights  in  U 
E'  ^Eo,G'  ^{V,E') 

for  i  <—  1  to  log  n  do 

Wi  2(*-^)l»/n 

H  r-  CLUSTER-GRAPH(G',5,  Wi) 
for  each  edge  e  £  Ei  do 

EXAMINE-EDGE(c) 

output  G' 

end. 


Figure  3:  An  0(n log^  n)-time  algorithm 

clidean  graphs  in  ifc-dimensional  space.  The  algo¬ 
rithm  is  shown  in  Figure  3,  and  one  of  its  subrou¬ 
tines  is  shown  in  Figure  4.  As  one  can  see,  it  retains 
the  spirit  of  the  greedy  algorithm,  except  for  the 
graph  clusterings. 

4.1  Analysis  of  the  algorithm 

We  analyze  three  aspects  of  the  FAST-GREEDY  al¬ 
gorithm.  First,  we  prove  that  G'  is  indeed  t-spanner. 
We  then  show  that  the  spanner  has  a  small  weight. 
Finally  we  show  that  a  proper  implementation  runs 
in  0(n  log^  n)  time. 

Lemma  4.1  The  graph  G'  produced  by  the  FAST- 
GREEDY  algorithm  is  a  t-spanner  of  the  complete 
Euclidean  graph. 

Proof  :  To  see  that  G'  is  a  t-spanner,  consider 
any  edge  c  =  (u,  u)  of  E  which  is  not  added  to 
E'  in  procedure  EXAMINE-EDGE.  A  cluster-path 
(in  fact,  any  path)  from  u  to  u  in  the  cluster-graph 
corresponds  to  an  equivalent  path  in  G'  with  the 
same  weight  (Lemma  3.2).  Thus  e  is  discarded  if 
spa'{u,v)  <  \/t  •  d(u,  u),  which  implies  that  G'  is 
a  v^'Spaimer  of  G,  and  is  thus  a  t-spanner  of  the 
complete  graph.  ^ 

Let  us  now  estimate  the  weight  of  the  spanner. 
The  Eq  edges  do  not  contribute  much  because  their 
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Algorithm  EXAMINE-EDGE(e  =  (u,  v)) 
begin 

if  not  CHECK- PATH(ir,  u,v,\/i-  d{u,  i;))  then 
E'  ^-E'U  {e},  G'  ^  {V,  E') 
for  all  cluster  centers  w,  x  such  that 
u  is  in  w’s  cluster  and 
V  is  in  x’s  cluster 
do 

add  inter-cluster  edge  [tu,  x]  to  H 
wt[[w,  e])  u;t([io,  u])  -f  d{u,  u)-!- 
■wtilv,  e]) 

end. 


Figure  4:  Deciding  whether  an  edge  should  be  added 

total  weight  is  at  most  D,  and  D  <  wt[M ST).  We 
shall  estimate  wt[E'  \  Eq). 

Lemma  4.2  Let  e  =  {u,v)  €  E'  \  Eq.  Then  the 
weight  of  the  second  shortest  path  between  u  and  v 
in  G'  is  greater  than  •  d(u,  v) . 

Proof  :  Let  C  be  the  shortest  simple  cycle  in  G' 
containing  e.  We  have  to  estimate  wt(C)  —  d(u,v). 
Let  Cl  =  (uijUi)  be  the  largest  edge  on  the  cycle. 
Then  ei  £  E\Eo,  and  among  the  cycle  edges  it  is 
examined  last  by  the  algorithm.  Let  us  consider  the 
scenario  while  the  algorithm  is  examining  e\. 

There  is  an  alternate  path  in  G'  from  u\  to  vi  of 
weight  wt[C)  —  <i(ui,ui).  But  since  the  algorithm 
eventually  decides  to  add  ei  to  the  spanner,  at  that 
moment  the  weight  of  each  cluster-path  from  uj  to 
Vi  is  larger  than  vi).  Notice  that  wt(ui,  vi) 

is  larger  than  Wj,  where  6Wi  is  the  current  cluster 
radius.  This  implies  that  ui  and  vi  are  not  con¬ 
tained  in  any  one  cluster.  Thus  ui  is  sufficiently 
far  from  vi.  By  Lemma  3.3  this  implies  that  the 
weight  of  each  path  in  G'  from  Ui  to  vi,  in  partic¬ 
ular  wt{C)  —  d(ui,  vi),  is  larger  than  Vt  •  • 

d(ui,vi).  Substituting  the  value  of  6  selected  by 
the  algorithm,  this  simplifies  to  •  d[u\,  vi).  Since 
wt{C)  —  d{u,v)  >  wt{C)  —  d(ui,vi),  the  lemma  is 
proved.  CH 

Lemmas  4.1  and  4.2  are  crucial  in  proving  that 
E'  \  Eo  has  a  small  weight.  Using  techniques  almost 
identical  to  those  in  [4,  5],  we  can  derive  a  bound  on 
the  weight  of  E'  \  Eq,  which  is  asymptotically  the 
same  as  the  weight  of  the  spanner  produced  by  the 


greedy  algorithm.  We  omit  details  from  this  version 
of  the  paper. 

We  now  analyze  the  running  time  of  the  algo¬ 
rithm. 

Lemma  4.3  At  any  stage,  if  the  most  recently  ex¬ 
amined  edge  has  weight  L,  at  that  stage  G'  is  a 
{t,  L/t)-spanner. 

Proof  :  Consider  in  general  any  i-spanner  G'  and 
let  u  and  v  be  any  pair  of  vertices.  Then  spc'  (^i  < 

t  •  d{u,  v),  which  implies  that  any  edge  on  this  path 
has  weight  at  most  t  •  d(u,  v).  Thus  in  our  case,  the 
edges  to  be  examined  in  the  future  (weights  larger 
than  L)  cannot  contribute  to  short  spanner  paths 
between  points  whose  spatial  separation  is  at  most 
L/t.  □ 

The  above  lemma  implies  that  each  cluster-graph 
H  generated  by  the  CLUSTER-CRAPH  procedure 
(prior  to  the  examination  of  edges  in  Ei)  satisfies  all 
the  properties  and  lemmas  discussed  in  Section  3.1. 
During  the  processing  of  Ei,  new  inter-cluster  edges 
may  be  added  by  the  EXAMINE-EDCE  procedure, 
but  H  still  satisfies  those  properties. 

Lemma  4.4  FAST-CREEDY  runs  in  O(nlog^n) 
time. 

Proof  :  The  initial  stages  of  the  algorithm  involve 
a  call  to  the  spanner  algorithm  in  [8],  and  sorting, 
which  together  take  0(n  log  n)  time. 

Consider  EXAMINE-EDCE.  As  seen  earlier,  a 
call  to  CHECK-PATH  takes  constant  time.  Adding 
the  new  inter-cluster  edges  takes  constant  time,  be¬ 
cause  there  are  only  a  constant  number  of  clusters 
containing  either  of  the  end  points  of  the  processed 
edge.  Thus  each  call  to  EXAMINE-EDCE  takes 
constant  time.  EXAMINE-EDCE  is  itself  called 
0(n)  times. 

Consider  CLUSTER-CRAPH.  This  procedure 
first  calls  CLUSTER-COVER.  On  partial  spanners, 
the  latter  can  be  made  to  run  in  0(n  log  n)  time  be¬ 
cause  there  are  0(n)  edges  and  each  vertex  is  visited 
by  the  SINGLE-SOURCE  procedure  at  most  a  con¬ 
stant  number  of  times  (Lemma  3.5),  thus  there  are 
overall  0{n)  heap  operations,  each  taking  O(logn) 
time.  The  CLUSTER-COVER  procedure  can  simul¬ 
taneously  build  the  intra-cluster  edges  of  the  cluster- 
graph.  Inter-cluster  edges  are  built  in  two  stages.  In 
the  first  stage,  the  SINGLE-SOURCE  procedure  is 
run  from  every  cluster  center,  and  in  a  manner  sim¬ 
ilar  to  the  proof  of  Lemma  3.5,  we  can  show  that 
each  vertex  is  visited  at  most  a  constant  number  of 
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times.  This  stage  therefore  takes  0(n  log  n)  time.  In 
the  second  stage,  each  edge  (it,  d)  of  G'  is  checked; 
there  are  only  a  constant  number  of  clusters  con¬ 
taining  It  or  V,  thus  this  stage  can  be  processed  in 
linear  time.  Thus  each  call  to  CLUSTER-GRAPH 
takes  O(Tilogn)  time.  CLUSTER-GRAPH  is  itself 
called  logn  times. 

Hence  the  time  complexity  of  FAST-GREEDY  is 
O(nlog^n).  ^ 


5  Conclusions 

In  this  paper  we  show  how  a  simple  and  natural 
graph  clustering  technique  can  be  effectively  used  in 
speeding  up  an  existing  greedy  algorithm  for  con¬ 
structing  Euclidean  graph  spanners.  We  conclude 
by  listing  some  open  problems. 

1.  Is  it  possible  to  reduce  the  running  time  to 
O(nlogn)?  This  may  require  doing  some  sort 
of  clustering  which  exploits  earlier  clusterings. 
One  problem  with  such  an  approach  is  that 
the  error  introduced  in  path  length  estimations 
(Lemma  3.3)  may  multiply  beyond  a  constant 
factor. 

2.  The  original  greedy  algorithm  has  been  success¬ 
ful  in  producing  small  weight  spanners  for  ar¬ 
bitrary  weighted  graphs  [4].  There  exist  other 
spanner  algorithms  for  general  graphs  which  are 
faster,  but  none  produce  spanners  with  smaller 
weight.  It  would  be  interesting  if  we  can  effi¬ 
ciently  implement  the  greedy  algorithm  for  ar¬ 
bitrary  weighted  graphs.  Of  course,  we  would 
not  be  able  to  exploit  several  nice  properties 
which  arise  in  the  geometric  case,  such  eis  con¬ 
stant  degree  cluster-graphs. 

3.  The  conjecture  in  [5]  that  for  any  dimensions 
the  weight  of  the  spanner  produced  by  the 
greedy  algorithm  is  0{wt{M ST))  needs  to  be 
resolved.  If  proven  true,  it  is  likely  to  imply  that 
asymptotically  optimally  weighted  spanners  in 
any  dimensions  can  be  computed  in  0(n  log  n) 
time. 

4.  The  clustering  technique  used  in  this  paper  is 
simple  and  natural;  it  would  interesting  to  see 
whether  it  has  other  applications  in  problems 
involving  geometric  graphs,  especially  in  speed¬ 
ing  up  existing  algorithms. 
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Abstract 

This  paper  describes  an  efEcient  scheme  to  dynamically 
maintain  the  set  of  maximas  of  a  2-d  set  of  points.  Using 
the  fact  that  the  maximas  can  be  stored  in  a  Staircase 
structure,  we  use  a  technique  which  maintains  approx¬ 
imations  to  the  Staircase  structure.  We  first  show  how 
to  maintain  the  maximas  in  O(logn)  time  per  insertion 
and  deletion  when  there  are  n  insertions  and  deletions. 
0(logn)  is  charged  per  change  for  reporting  changes  to 
the  structure.  We  also  show  another  scheme  which  re¬ 
quires  0{logn)  amortized  time  per  insertion  and  deletion 
with  an  output  complexity  of  0(r)  steps  when  r  mcixi- 
mal  points  are  to  be  listed.  The  data  structures  require 
0(n)  space. 

1  Introduction 

Given  a  set  S  of  points  in  the  x  —  y  plane,  pi  =  {xi,yi)  £ 
S  is  a  maximal  point  iff  it  is  not  dominated  by  another 
point  pj  where  a  point  pj  dominates  pi  iff  a:,-  <  Xj  and 
yi  <  yj .  The  set  of  maximal  points  of  S  form  a  structure 
which  monotonically  decreases  in  the  y  direction  as  the 
X  coordinate  of  the  points  increases.  Such  a  structure  is 
called  a  Staircase  structure.  The  maxima  problem  occurs 
in  a  large  number  of  applications  in  statistics,  economics, 
operations  research  etc.  The  reader  is  referred  to  the 
book  by  Preparata  and  Shamos  for  further  details  [PS] 
In  this  paper  is  described  the  design  of  a  data  struc¬ 
ture  for  the  dynamic  maintenance  of  the  maximas  of  a 
point  set.  We  maintain  approximations  to  the  set  in  a 
structure  which  is  an  approximate  staircase  structure. 
For  the  static  case  it  has  been  shown  that  the  staircase 
structure  can  be  computed  in  0{nlogn)  time  eliminat¬ 
ing  dominated  points[KLP].  In  the  dynamic  case  however 
one  needs  to  keep  track  of  the  dominated  points. 
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granted  provided  that  the  copies  are  not  made  or  distributed  for 
direct  commercial  advantage,  the  ACM  copyright  notice  and  the 
title  of  the  publication  and  its  date  appear,  and  notice  is  given 
that  copying  is  by  permission  of  the  Association  of  Computing 
Machinery.  To  copy  othenwise,  or  to  republish,  requires  a  fee 
and/or  specific  permission. 
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In  this  case  Overmars  and  Van  Leeuwen  [OL]  have 
designed  a  data  structure  which  requires  splitting  and 
merging  balanced  trees  when  points  are  inserted  and 
deleted.  In  their  scheme  0[log^n)  operations  are  re¬ 
quired  for  each  insertion  and  deletion.  A  scheme  by 
Willard  and  Luekar  [WL]  gives  a  bound  of  0{logn) 
for  updates  but  the  set  of  maximas  is  not  maintained. 
Fredrickson  and  Rodger  [FR]  and  Janardan  [J]  have  de¬ 
signed  a  scheme  which  maintains  the  staircase  struc¬ 
ture  of  the  set  of  maximas  and  allows  for  insertions 
and  deletions  in  O[logn)  and  0(log^n)  operations,  re¬ 
spectively.  These  are  the  most  asymptotically  efEcient 
schemes  known.  Their  data  structure  requires  0(n) 
space.  Also  [HS]  gives  a  scheme  for  offline  maintenance. 

We  first  present  an  improved  data  structure  that  main¬ 
tains  the  staircase  structure  in  0{logn)  time  per  inser¬ 
tion  and  deletion.  A  point  can  be  tested  to  be  a  maxima 
in  0{logn)  time  in  our  representation.  And  the  staircase 
structure  representing  the  set  of  Maximas  can  be  listed 
out  in  time  linear  in  the  size  of  the  set  of  Maximas  but 
with  0{logn)  operations  to  be  spent  on  changes  in  the 
report  of  the  maximal  elements.  The  data  structure  is 
simple  and  requires  0(n)  space. 

Furthermore  it  is  shown  that  the  data  structure  can 
be  modified  to  give  a  bound  of  0{nlogn  -|-  mlogn  +  r) 
operations  when  there  are  n  insertions,  m  deletions  and 
r  maximal  points  are  reported. 

We  hope  that  the  methodology  that  we  have  described 
may  also  be  applicable  to  dynamic  maintenance  of  other 
geometric  structures,  i.e.  convex  hulls,  intersection  of 
half  planes  and  kernel  of  simple  polygons.  Since  the 
optimal  dynamic  maintenance  of  these  geometric  struc¬ 
tures  has  been  unsolved  for  some  time,  the  solution  in 
this  paper  may  be  of  interest  in  terms  of  technique. 

2  Outline  of  the  Paper. 

This  paper  first  describes  the  structural  changes  required 
for  maintaining  the  maximas  under  insertions  and  dele¬ 
tions.  We  then  show  how  to  implement  the  scheme  in  a 
data  structure  requiring  0(n)  space. 

The  underlying  data  structure  that  we  use  for  storing 
the  points  is  the  red-black  tree  [Tj.  It  is  called  the  Max- 
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ima  Balanced  Tree.  The  set  of  points,  5  are  stored  at  the 
leaves  of  a  red-black  tree,  which  we  denote  by  MT{S),  in 
order  of  increasing  ^-coordinate  values.  At  each  node  is 
stored  the  set  of  maximas  of  a  set  of  points  stored  in  the 
subtree  rooted  at  that  point.  This  set  is  not  stored  ex¬ 
plicitly  as  we  will  see.  The  initial  tree  can  be  constructed 
in  0(nlogn)  time  easily.  Let  M{q)  be  the  set  of  maxi- 
mas  at  node  q  in  the  tree.  Furthermore  assume  that  at 
each  node  the  meiximas  are  specified  in  a  staircase  sorted 
order  with  points  having  decreasing  y-coordinate  and  in¬ 
creasing  x-coordinate.  We  let  Top{M{q))  represent  the 
point  with  the  maximum  y-coordinate  in  the  staircase 
structure  representing  the  maximas  at  node  q.  We  let 
y{p)  refer  to  the  y-coordinate  of  point  p. 

The  first  scheme  that  is  proposed  uses  the  fact  that 
during  insertions  binary  search  is  performed  only  once 
after  which  the  inserted  point  is  dominated.  This  fact 
has  also  been  used  in  [FR],  [J]  in  their  data  structure. 
We  thus  obtain  a  solution  with  0{logn)  insertion  time 
and  0{log^n)  deletion  time.  Also  isolated  is  the  reason 
for  the  0[log^n)  behaviour  of  the  scheme.  This  scheme 
is  briefiy  described  in  section  3. 

In  section  4,  the  algorithm  is  refined  so  that  both 
insertions  and  deletions  can  be  done  in  0{logn)  time. 
We  do  so  by  maintaining  approximations  to  the  stair¬ 
case  structure.  This  allows  the  use  of  only  a  constant 
number  of  binary  searches  to  update  the  balanced  tree 
structure  storing  the  maximas.  The  improvements  are 
described  without  a  description  of  the  secondary  struc¬ 
tures  required  to  store  the  maximas  at  the  nodes  of  the 
balanced  binary  tree.  Finally,  in  section  5,  the  space  re¬ 
quirements  are  shown  to  0(n).  In  section  6  the  scheme 
is  modified  to  list  the  maximal  points  in  linear  amortized 
time. 

3  An  0{logn)  insertion  and 
0{log^n)  deletion  scheme. 

Let  us  first  consider  insertions.  Let  p  be  the  point  to  be 
inserted  into  the  tree  MT{S)  storing  the  set  of  maximas 
of  the  point  set  S.  Since  the  leaves  are  arranged  in  as¬ 
sorted  order  a  binary  search  gives  us  the  leaf  at  which 
the  point  p  is  to  be  inserted.  Let  P  be  the  path  from  the 
root  to  the  leaf.  Suppose,  on  the  path  P,  we  encounter  a 
triple  of  nodes  (u,  v,  w)  where  v  is  the  common  parent  of 
u  and  w.  u  is  the  left  child  and  is  referred  to  by  left{v) 
and  w  the  right  child  and  referred  to  by  right{v).  P  uses 
V  and  one  of  u  and  w. 

Firstly  suppose  the  path  P  uses  w.  Suppose  the  max¬ 
imas  of  the  points  in  the  subtree  at  w  has  been  recom¬ 
puted  because  of  the  insertion  of  the  new  point  p.  M (v) 
has  to  be  recomputed  from  M (u)  and  M (w)  by  a  merger. 
The  merger,  in  general,  involves  locating  Top{M{w))  in 
M  (u)  and  then  replacing  the  portion  of  M  (u)  dominated 


by  M{w)  by  M{w)  itself.  However  when  a  single  point, 
p,  is  inserted  Af(io)  changes  only  locally.  The  merger 
operation  can  now  be  accomplished  by  changes  to  the 
current  staircase  structure  instead  of  complete  recompu¬ 
tation.  There  are  a  number  of  cases  depending  on  the 
topmost  node  of  M'{w),  Top{M'{w))  where  M'{w)  is 
the  updated  staircase  structure  at  w.  We  assume  that 
Top{M'{w))  has  changed.  If  it  does  not,  then  no  change 
in  the  merged  staircase  structure  is  required. 

Case  1;  Top{M'{w))  is  above  Top{M{v)).  In  this  case 
the  merger  requires  constant  time  since  M'[y)  is  actually 
M'{w).  (Fig.  Ml) 

Case  2:  Top{M'{v}))  is  not  above  Top{M{v)).  In  this 
case  the  merger  is  done  by  a  binary  search  on  M  (w)  and 
requires  0{logn)  time.  Note  that  from  this  node  up- 
words  to  the  root  the  top  node  of  any  stairccise  structure 
does  not  change  and  new  merger  points  need  not  be  re¬ 
computed.  (Fig.  M2) 

Next  consider  the  case  when  P  uses  u.  Again,  suppose 
M{u)  has  been  modified  and  let  the  modified  structure 
be  M'{u).  There  are  two  cases: 

Case  1;  Top{M{u))  has  changed.  There  are  two  cases. 
If  Top{M{w))  <  Top{M'{u))  then  change  the  top  node 
of  M{v)  to  Top{M'{u))  if  necessary.  Otherwise,  change 
Top{M{v))  to  Top{M{w))  if  necessary. 

Case  2:  Top  node  of  M{u)  has  not  changed.  In  this 
case  we  have  to  determine  the  changes  that  are  to  be 
made  to  M{v)  since  the  top  of  M{w)  has  to  be  posi¬ 
tioned  with  respect  to  the  changes  in  M{u).  We  note 
that  this  can  be  done  in  constant  time  since  only  a  com¬ 
parison  of  Top{M{w))  with  the  point  inserted  into  M{u) 
and  points  adjacent  to  it  hcis  to  be  made,  provided  that 
node  is  in  the  set  of  maximas.  This  information  is  easily 
maintained  as  we  proceed  up  the  path  P.(Fig.  M3) 

We  thus  see  that  an  O(logn)  insertion  is  possible  since 
binary  search  is  performed  once  only  in  Case  2  when  P 
uses  w. 

Next  consider  the  affect  of  deletions.  Again  consider 
a  path  P  from  leaf  to  root  and  a  triple  of  nodes  (u,  v,  w) 
on  the  path.  The  path  may  use  either  u  or  w.  First 
consider  the  case  when  the  path  P  uses  w.  Suppose  the 
maximal  set  has  been  recomputed  at  w  and  is  M'{w). 
There  are  two  cases  depending  on  Top{M{w)):  Case  1, 
where  Top{M{w))  changes.  This  case  has  two  subcases. 
And  Case  2  when  Top{M{w))  does  not  change.  We  con¬ 
sider  the  subcases. 

Case  1.1:  Top{M'{w))  <  Top{M{u)).  Then  M{v)  can 
be  computed  in  0{logn)  time  by  locating  Top{M'{w)) 
in  Top{M (u)  by  a  binary  search. 

Case  1.2:  Top{M'{w))  >  Top{M[u)).  Update 
Top{M{v))  in  0(1)  time  since  Af(i;)  is  M'{w). 

Case  2:  Top{M{w))  does  not  change.  M{v)  changes  ex- 
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actly  as  M[w)  is  changed  in  the  portion  of  the  staircase 
common  to  both  M{y)  and  M{w).  In  this  case  the  list  at 
V  need  not  be  searched  but  only  changed  at  Top[M[w)) 
to  incorporate  M'(w'). 

Next  consider  the  case  when  the  path  P  uses  u.  M{y) 
is  obtained  from  M'{u^  and  M(w')  by  a  merge  operation 
and  requires  0{logn)  operations. 

It  is  the  first  sub-case  of  the  first  case  when  the 
path  uses  w  and  u  that  creates  the  first  problem,  i.e. 
it  gives  a  time  bound  of  0[log^n)  since  the  merge  of 
M'(u)  and  M{w)  requires  O(logn)  time  due  to  search 
for  Top[M(w))  in  M'{u)  and  there  may  be  Q[logn) 
such  merges  to  be  performed.  Also  Top(M'{w))  may 
be  changed  at  0{logn)  nodes  along  path  P.  Finally  we 
consider  the  rebalancing  operations.  They  can  be  per¬ 
formed  in  0{logn)  time  since  the  balancing  requires  at 
most  0(1)  locations  of  top  nodes  of  staircase  structures. 

Next  we  show  how  to  improve  the  performance  to 
0[logn)  per  deletions  also. 

4  An  Improved  Solution. 

4.1  Outline. 

In  order  to  improve  the  performance  we  attempt  to  re¬ 
move  the  bottleneck  introduced  by  the  repeated  mergers 
that  are  required  on  path  P  when  M‘{u)  and  M{w)  are 
merged  after  a  deletion.  This  is  done  by  performing  a 
constant  number  of  mergers  as  follows: 

Let  Mset{P,  u)  be  the  set  of  nodes  on  the  path  P  with 
the  following  property:  Top{M{wi)),  Wi  £  Mset{P,u) 
intersects  the  section  of  the  staircase  in  M{u)  that  is 
changed  due  to  the  deletion.  Note  that  Wi  is  an  ances¬ 
tor  of  u  and  |Mset(P,  it)|  =  0{logn).  Only  one  merger 
is  required  and  that  is  the  merger  with  the  maximal  set 
M{wk)  where  is  the  node  in  the  binary  tree  such  that 
Top{M{wk))  exceeds  the  top,  Top{M{wi)),  of  all  other 
nodes  in  the  set  Mset[P,u).  All  the  other  mergers  re¬ 
quired  are  deferred  by  actually  constructing  the  chains 
so  that  M{wi),Wi  £  Mset{P,u)  is  merged  with  M{ui), 
where  Ui  is  the  sibling  of  Wi  ,  by  joining  Top{M{wi)) 
to  either  the  node  in  M(u)  at  which  M{{wk))  is  merged 
or  to  Top[M(ui)).  This  construction  is  required  for  a 
constant  number  of  sets,  Mset{P,u)  corresponding  to 
nodes  it  on  the  path  P.  This  destroys  the  staircase  na¬ 
ture  of  the  maximal  set,  creating  non-horizontal  and  ver¬ 
tical  lines  called  ’’kinks”  but  the  deletion  now  requires 
0{logn)  steps. 

The  insertion  also  requires  0[logn)  steps.  It  may  so 
happen  that  during  insertion  or  deletion  the  node  at 
which  the  ’’kinks”  occur  changes.  The  kinks  thus  shift. 
However  we  show  that  the  shift  only  occurs  once  on  the 
path  and  there  are  only  0[logn)  edges  to  be  transferred 
at  the  kinks. 


4.2  Detailed  Algorithm. 

We  first  formally  define  the  semi-staircase  structure  at 
the  nodes. 

The  Semi- staircase  structure  is  a  sequence  of  nodes, 
simultaneously  sorted  in  increasing  x  order,  and  in  de¬ 
creasing  y-order  such  that  two  adjacent  vertices  are  ei¬ 
ther  connected  by  a  horizontal  and  vertical  Manhattan 
path  or  by  a  straight  line  (called  kink  line).  (Fig.  M4) 

A  Stairstep  is  the  horizontal  and  vertical  Manhattan 
path  that  connects  two  adjacent  vertices. 

The  kink  line  (u,  v)  is  an  approximation  to  a  staircase 
structure  which  connects  the  two  vertices  u  and  v. 

We  need  the  following  property  of  kinks  which  will  be 
proved  later. 

Property  Cover;  Every  kink  line  connecting  a  point  u, 
which  is  not  maximal,  to  another  point  v  is  dominated 
by  a  horizontal  edge  or  a  kink  edge,  i.e.  there  exists  a 
point,  c  with  greater  y  co-ordinate  than  u  and  greater  x 
CO-  ordinates  than  it  and  v. 

When  the  point  c  is  connected  by  a  horizontal  line  to 
a  point  s  with  y-coordinate  equal  to  or  just  above  that  of 
V,  s  is  called  the  kink-cover  of  the  kink  line.  Note  that 
such  a  point  need  not  always  exist,  e.g.  it  does  not  exist 
when  V  is  the  top  node  of  a  staircase  structure.  Note  also 
that  we  may  obtain  a  sequence  of  kink  lines.  The  Cover 
property  is  also  satisfied  for  each  edge  of  this  sequence 
and  hence  by  the  entire  sequence. 

We  next  show  how  to  dynamically  maintain  this  semi¬ 
staircase  structure  under  insertions  and  deletions. 

4.3  Insertions 

We  first  consider  Insertions.  For  inserting  point  p  let  P 
be  the  path  from  leaf  to  root.  As  we  proceed  up  the  path 
consider  the  triple  of  nodes  (u,  v,  w)  such  that  v  is  the 
parent  of  u  and  w. 

Suppose  the  path  P  uses  w.  The  cases  are  similar  to 
the  previous  section  and  involve  a  binary  search  with 
Top[M{w)  onto  a  semi-staircase  structure.  The  first 
change  in  the  algorithm  occurs  when  y  =  y{Top[M{w))) 
intersects  a  kink  line  joined  to  v.  In  this  case  a  new  kink 
line  is  introduced  from  Top{M(w))  to  v.  The  second 
change  is  when  at  a  node  w,  Top(M(v))  is  connected  by 
a  kink  edge  to  a  node  in  M  (u)  and  the  node  to  which  the 
kink  edge  is  connected  is  removed  by  the  insertion  of  a 
point.  Top(M(v))  is  then  located  within  the  new  stair¬ 
case  in  constant  time  by  computing  the  intersection  with 
the  stairstep  newly  introduced.  Thus  as  in  the  previous 
section  one  location  of  Top(Af(w))  in  a  semi-staircase 
structure  is  required. 

Before  detailing  the  location  of  Top(M(w))  we  de¬ 
scribe  in  general  the  procedure  for  the  location  of  a  point 
p  in  a  semi-staircase.  The  procedure  is  recursive.  At  any 
node  V  in  the  tree  we  check  if  the  point  is  located  in  the 
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part  of  the  semi-staircase  at  the  left  son  or  at  the  right 
son  and  we  recur  in  the  appropriate  sub-tree.  This  check 
is  done  cis  follows:  Suppose  w  and  u  are  the  right  and 
left  sons  at  v  and  M{w)  is  merged  with  M{u)  by  a  hori¬ 
zontal  edge,  h.  Then  the  procedure  branches  to  the  left 
if  y{p)  is  greater  than  the  y  co-ordinate  of  point  on  h 
otherwise  the  procedure  branches  to  the  right.  Alter¬ 
natively,  if  at  V  there  is  a  kink  edge  merging  the  right 
semi-staircase  with  the  left  then  the  procedure  recurs  at 
the  left.  The  procedure  stops  at  a  leaf  having  discovered 
on  the  staircase  a  point  with  y  co-ordinate  just  greater 
than  y{p)-  This  is  the  node  to  which  p  is  located.  The 
procedure  requires  0{logn)  steps.  Also  those  0(logn) 
kink  edges  on  the  search  path  which  intersect  y  =  y{p) 
are  obtained. 

To  update  the  staircase  structures  after  the  location  of 
a  point  is  done,  suppose  the  node  to  which  the  location 
is  done  is  q' .  Firstly  the  edge  joining  Top{M{w)  to  q' 
is  added  to  the  semi-staircase  structure  at  v.  Next  the 
kink  edges  intersected  by  y  =  y{s)  have  their  top  point 
changed  to  the  node  in  the  staircase  just  below  q'  if  such 
a  node  exists.  Else  the  kink  edges  become  horizontal 
edges. 

When  path  P  uses  u,  the  update  after  an  insertion 
is  again  in  constant  time  as  described  in  the  previous 
section. 

Thus  insertions  can  be  done  in  O(logn)  steps. 

4.4  Deletions 

Next  consider  deletions:  Again  consider  the  triple 
(u,  V,  w).  Suppose  P,  the  path  from  the  leaf  deleted  (cor¬ 
responding  to  point  q)  to  the  root,  uses  w.  We  consider 
the  case  depending  on  whether  M'{w)  has  it’s  topnode 
different  from  M{w)  or  not. 

Case  1.1:  Top{M'{w))  is  changed,  i.e.  the  topmost 
node  of  the  previous  maximal  set  has  been  deleted.  In 
this  case  Top{M'{w))  is  to  be  located  again  in  Af(u) 
to  construct  M{v).  There  are  two  subcases:  Either  the 
new  topmost  node  is  the  topmost  node  of  M(v).  In  this 
case  the  check  is  done  in  0(1)  time.  On  the  other  hand 
the  topmost  node  has  to  be  relocated  within  M{u)  in 
0{logn)  time.  However  we  need  to  do  this  only  when 
the  set  of  maximas  is  affected.  Before  we  consider  the 
number  and  type  of  such  relocations  required  we  describe 
in  detail  one  such  relocation:  There  are  two  possible  situ¬ 
ations  depending  on  whether  the  previous  topmost  node 
is  connected  by  a  kink  edge  or  not. 

We  first  consider  the  situation  when  the  topmost  node 
in  M{w)  is  not  connected  by  a  kink  edge  to  a  node  in 
M('u)  but  by  a  horizontal  edge.  Let  q'  be  the  node 
in  M{u)  to  which  the  topmost  node  in  M{w)  was  con¬ 
nected.  The  relocation  depends  on  whether  or  not  other 
nodes  were  connected  by  kink  edges  to  the  node  q.  If  no 
other  nodes  were  connected  then  the  relocation  is  simple. 


The  relocation  is  done  onto  a  staircase  structure  Af(u). 
When  kink  edges  are  present  at  the  node  q,  relocation  of 
the  topmost  node  may  destroy  the  semi-stairccise  struc¬ 
ture  and  changes  have  to  be  made  to  preserve  the  semi¬ 
staircase  structure  and  the  cover  property.  We  describe 
these  changes  next. 

When  other  kink  edges  are  present  relocating 
Top{M'{w))  involves  two  sub-cases.  Either  the  top  node 
of  M'(w)  is  relocated  below  the  kink  edge  (s,  5),  where 
s  is  the  point  at  the  first  descendant  node  of  u  attached 
to  5  by  a  kink  edge.  Or  onto  the  kink  edge.  Let  d  be  the 
node  at  which  s  is  attached  to  g  by  a  kink  edge.  Note 
that  s  €  M{right{d))  where  right{d)  is  the  right  son  of 
d.  s  is  located  in  the  staircase  structure  at  lefi{d).  Sup¬ 
pose  the  node  to  which  the  relocation  is  done  is  q'.  Next, 
all  the  kink  edges  intersected  by  y  =  y{s)  have  their  top 
point  changed  to  the  node  in  the  staircase  just  below  q' 
if  such  a  node  exists.  Also  Top{M'{w))  is  relocated  in 
the  changed  structure  M'{u).  All  the  operations  require 
0{logn)  time  since  atmost  two  locations  are  done  and  at- 
most  O{logn)  kink  edges  have  their  top  point  changed. 

In  the  second  sub-case,  i.e.  when  Top{M'{w))  is  lo¬ 
cated  onto  the  kink  edge,  Top{M'(w))  is  located  in  M{u) 
by  a  procedure  similar  to  the  one  which  locates  s  in  the 
above  case.  And  all  the  kink  edges  intersected  by  the 
horizontal  edge  are  changed.  In  this  case  too  0{logn) 
steps  are  required. 

Next  consider  the  situation  when  Top{M{w))  was  con¬ 
nected  by  a  kink  edge  to  a  node  q  in  M{u).  Again,  let 
s  be  the  point  first  attached  to  g  by  a  kink  edge  at  a 
descendant  node  of  u  (if  such  a  node  exists).  Either 
the  y-coordinate  of  Tcypi^M'iw))  is  less  than  that  of  s  or 
not.  Firstly  consider  the  case  when  Top{M'{w))  <  y{s). 
Then  Top{M'{w)  is  relocated  in  the  staircase  structure 
at  M(u).  Furthermore  s  may  be  located  also,  modify¬ 
ing  kink  edges  intersected  by  y  =  y(s)  as  in  the  above 
case.  Next,  consider  the  case  when  Top[M'{w))  >  y(s). 
Then  a  kink  edge  from  Top{M'{w))  to  g  may  be  retained 
in  the  merge  step  or  Top{M'[w)  may  be  located  in  the 
staircase  structure  modifying  the  kink  edges  from  s  and 
other  nodes  as  described  in  the  previous  paragraph. 

We  next  characterize  when  the  relocation  of  points  in 
the  semi-staircases  is  necessary:  Let  Uset  be  the  set  of 
nodes,  whose  Top  has  changed  due  to  the  deletion  of  the 
particular  node.  Note  that  the  Top  value  of  these  nodes 
is  in  increasing  order  of  y-coordinate.  A  relocation  is 
done  at  the  node,  say  t,  with  the  highest  Top  value. 
Furthermore,  consider  a  node  u,u  ^  t  and  u  £  U set. 
u  is  not  relocated  but  Top{M{right{u)))  is  joined  to 
Top{M{left{u)))  by  a  kink  edge. 

Case  1.2;  Top{M'{w))  has  not  changed.  In  this  case 
no  changes  are  required  in  the  merger  step  and  the  com¬ 
position  of  M'{w)  and  M{v)  is  carried  upwords  in  0(1) 
time. 
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Next  suppose  the  path  P  uses  u.  We  assume  by  in¬ 
duction  that  M'{u)  has  a  semi-staircase  structure. 

Case  2.1:  M{w)  intersects  M{u)  at  the  stairstep  asso¬ 
ciated  with  the  removed  node.  Let  R(q)  be  the  semi¬ 
staircase  that  replaces  the  node  q  deleted.  R{q)  is  ob¬ 
tained  as  we  traverse  up  the  path  P.  We  let  S  be 
the  set  of  nodes  on  the  path  P  such  that  if  u  e  S 
then  Top{M{v)  intersects  the  stairstep  associated  with 
q  and  thus  R{q).  Let  w  be  the  node  in  5  such  that 
Top(M(w))  >  Top(M(u)),u  e  S.  Then  Top{M{w)  is 
located  in  R{q)  to  give  a  node  z  which  forms  the  node 
above  Top{M{w))  in  the  semi-staircase  structure  at  the 
parent  of  w.  Otherwise  a  direct  line  (kink  line)  is  drawn 
from  Top(^M[u'j),u  E  S  to  a.  node  t  in  R{q)  where  t  is 
the  node  in  R{q)  with  y-coordinate  just  less  than  ^  and 
greater  than  Top(^M(^w)).  If  such  a  t  does  not  exist  then 
either  a  normal  staircase  is  constructed  with  z  being  the 
node  next  in  the  semi-staircase  structure  at  the  parent 
of  u.  Also  iff  is  not  present  in  M{w),  where  w  is  the  sib¬ 
ling  of  u,  then  Top{M{u))  is  connected  via  a  kink  line  to 
Top{^M{wYj.  Note  that  this  procedure  involves  delaying 
the  construction  of  the  straight  lines  until  ^  is  obtained. 
Ail  the  merges  required  can  be  done  in  Oilogji)  time. 
(Fig.  M5) 

Case  2.2:  M{w)  does  not  intersect  M{u)  at  the 
stairstep  associated  with  the  changed  node.  Then  no 
change  in  the  merger  need  be  done. 

The  above  cases  complete  the  description  of  the 
changes  required  after  a  deletion.  It  is  easily  seen  that 
0{logn)  steps  are  required.  Furthermore,  balancing  re¬ 
quires  0{logn)  steps  as  each  rotation  or  double  rotation 
requires  at  most  0(1)  relocations  of  topmost  nodes  giv¬ 
ing  a  bound  of  0(logri)  steps. 

Note  however  that  at  the  end  of  the  deletion  some 
kinks  may  be  left  on  the  final  staircase  structure  at  the 
root. 

We  next  show  that  the  insertion  and  deletion  proce¬ 
dures  are  correct.  It  suffices  to  show  that  a  semi-staircase 
is  maintained  at  each  of  the  nodes  and  also  that  the  cover 
property  is  maintained. 

Lemma  1  Phe  iuseriion  and  deletion  proceduve  main¬ 
tains  semi-  staircases  at  each  node  of  the  Maxima  Bal¬ 
anced  Tree.  Moreover  the  semi-staircase  structures  sat¬ 
isfy  the  cover  property. 

Proof:  The  proof  is  by  induction.  It  is  trivially  true 
when  there  is  only  one  point.  Thus  assume  that  there 
are  n  points  and  the  data  structure  contains  lists  which 
are  staircase  structures. 

If  an  insertion  is  made  the  operation  only  introduces  a 
kink  edge  when  the  inserted  node  is  located  onto  a  kink 
edge  in  the  staircase  structure.  The  new  kink  edge  added 
maintains  the  semi-staircase  structure  and  also  the  cover 
property  since  either  the  kink  edge  is  the  dominating 
edge  or  the  dominating  edge  is  unchanged. 


When  a  deletion  is  performed,  the  operations  firstly 
introduces  kink  edges  by  finding  the  node  closest  to  the 
root  which  intersects  the  modified  staircase  and  locating 
it  in  the  staircase  structure.  All  other  edges  added  at 
descendant  nodes  are  kink  edges  and  valid,  i.e.  joining  a 
point  in  the  right  subtree  to  a  point  in  the  left  subtree 
unless  the  point  is  at  the  top. 

These  also  include  kinks  introduced  by  adding  edges 
from  the  Top  of  staircases  at  the  left  son  to  the  Top  of 
staircases  at  the  right  sons.  The  cover  property  is  main¬ 
tained  since  each  kink  edge  (u,  v)  is  covered  if  it  has  been 
determined  if  u  is  not  in  the  maximal  set.  Also  when 
kink  and  cover  edges  are  modified  by  locations  the  semi¬ 
staircase  structure  and  cover  property  are  maintained  as 
detailed  by  the  construction  procedure.  This  is  also  true 
after  balancing  of  the  tree.  ■ 

Since  the  cover  property  is  maintained  at  the  nodes 
of  the  tree  kinks  in  the  output  are  important  in  that 
these  kinks  have  to  be  removed  to  obtain  the  actual  out¬ 
put.  These  kinks  can  be  removed  at  the  time  of  report- 
ing  along  with  other  kinks  that  are  discovered  when  the 
kinks  are  removed  resulting  in  an  output  complexity  of 
0(m  -I-  {chng)logn)  when  there  are  chng  changes  in  the 
set  of  maximas  to  be  reported.  Details  of  the  report- 
ing  will  be  discussed  in  the  next  section.  Once  removed 
these  kinks  do  not  occur  again  in  the  output. 

To  summarize 

LsinniH  2  The  structural  changes  required  to  maintain 
the  semi- staircase  structures  at  each  node  of  the  Max¬ 
ima  Balanced  Tree  under  insertions  and  deletions  can 
be  carried  out  in  0(logn)  operations.  Moreover  0[logn) 
operations  are  required  for  every  change  to  be  reported  in 
the  maximal  set. 


4.5  An  0(n)  Space  Solution 

In  this  section  we  show  how  to  implement  the  scheme 
for  insertion  and  deletion  in  0{ri)  space.  This  is  accom¬ 
plished  by  maintaining  0(1)  information  at  the  nodes  of 
the  balanced  binary  search  tree. 

Firstly  at  each  node,  u,  is  maintained  left  and  right 
pointers,  left{y)  and  right[v),  to  the  sons  of  the  node, 
say  u  and  w.  We  also  maintain  two  variables  TOP 
and  INTER  which  contain  points.  TOP  is  the  top¬ 
most  point  in  the  semi-staircase  structure  at  v  (This 
point  is  also  referred  to  by  Top{M{v)))  before).  And 
INTER  is  the  node  at  which  Top{M{w))  is  located  in 
the  semi-staircase  structure  at  u,  i.e.  INTER  is  the 
point  in  the  staircase  structure  at  v  just  above  the  point 
TOP  at  node  w  in  the  staircase  structure  at  v.  Note 
that  Top(M(ti;))  may  be  attached  to  INTER  by  a  kink 
edge.  In  the  description  below  the  value  of  INTER  for 
nodes  where  a  kink  line  is  drawn  is  assumed  to  be  the 
y-coordinate  of  the  upper  point  of  the  kink  line.  To 
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list  the  points  more  efficiently  we  need  another  variable 
CHANGE  which  we  detail  below. 

We  next  describe  the  listing  and  search  procedures  on 
the  data  structure  without  using  CHANGE. 

4.6  Listing  the  maximas 

To  list  all  the  maximas  we  proceed  as  follows:  We  start 
at  the  root.  At  the  root  node  TOP  gives  the  first  node  in 
the  stairccise  structure.  To  obtain  nodes  contributed  by 
the  left  subtree  we  test  if  the  topmost  node  of  the  right 
subtree,  w,  is  located  strictly  below  the  topmost  node  of 
the  left  subtree,  u.  If  so  then  we  list,  recursively,  those 
maximal  elements  in  the  subtree  at  u  not  yet  reported 
and  with  y  co-ordinates  greater  than  the  j/-coordinate  of 
TOP{right{w)).  We  then  list  the  maximal  elements  in 
the  subtree  rooted  at  w  recursively.  When  we  apply  this 
procedure  recursively  we  note  that  at  a  node  the  left  son 
may  be  merged  with  the  right  son  using  a  kink  edge. 

We  thus  first  remove  this  kink  edge  before  recurring. 
This  procedure  is  formalized  as  follows: 

LIST{v,  stop,  last)-, 

(This  procedure  lists  the  maxima  in  the 
subtree  at  v  with  y-cordinate  greater 
than  stop,  last  is  the  y-coordinate 
of  the  last  maximal  point  output.) 

Begin 

If  last  >  y{TOP)  >  stop  then 
begin  output  TOP]  last  :=  TOP  end; 
else 
begin 

if  (right  son  is  connected  to  left 
son  by  a  kink  edge)  then 

LOCATEY(  Top{right{v)))] 
if  y(TOP)  >  y{INTER)  then 
begin 

LIST{left,  y{INTER),  last); 

LIST\right,  stop,  last) 

end 

else  LIST{right,  stop,  last) 
end 
end. 

We  now  describe  the  implementation  of  the  locate  pro¬ 
cedure,  LOCATEY.  The  locate  algorithm  has  been  de¬ 
scribed  before. 

Suppose  we  are  at  a  node  v  and  wish  to  search  for  the 
horizontal  location  of  p  in  the  semi-staircase  structure 
at  V.  There  are  two  cases:  Suppose  y{INTER)  is  less 
than  y{TOP).  If  y(p)  >  y{INTER)  then  we  recursively 
search  in  the  left  subtree.  And  if  y(p)  <  y{TOP{w)) 
where  w  is  the  right  son  of  the  current  tree  node  then 
we  recur  to  the  right.  Otherwise  we  get  two  subcases.  If 
INTER  is  TOP  because  of  a  kink  edge  from  the  right 


son  to  the  left  son  then  locate  y(p)  in  the  left  subtree 
again.  Otherwise  we  return  with  INTER  as  the  node 
required. 

Note  that  this  procedure  requires  0{logn)  time.  A  list 
of  kink  edges  along  the  path  s.t.  y  =  y(p)  intersects  the 
kink  edge  is  also  obtained.  These  edges  are  modified  so 
that  their  top  end  is  y(p)  thus  modifying  the  staircase 
structure  as  described  before. 

The  procedure,  LIST,  above  is  invoked  at  the  root 
with  stop  being  the  least  y-  coordinate.  We  omit  the 
proof  of  correctness.  The  procedure  requires  time  pro¬ 
portional  to  0[mlogn)  where  m  is  the  number  of  maxi¬ 
mal  points. 

To  list  the  maximal  points  in  0(m)  time  we  make 
the  following  modification  to  the  data  structure:  At 
each  node,  say  v,  we  store  a  pointer  called  CHANGE. 
CHANGE  points  to  the  first  preorder  descendant  node 
where  a  change  to  the  staircase  structure  at  v  is  made. 
\iINTER{v)  <  TOP{v)  then  CHANGE  points  to  the 
first  node  ,  say  u,  in  a  preorder  scan  of  the  left  sub¬ 
tree  rooted  at  v  where  TOP[right[u))  >  INTER[v). 
If  TOP{v)  =  TOP{right{v))  then  CHANGE{v)  is  the 
same  as  C H AN GE{right{v)). 

It  is  ecisy  to  see  that  using  these  pointers  we  can  re¬ 
cursively  list  the  maximas  in  linear  time.  This  includes 
the  time  required  to  remove  kinks  along  the  staircase 
structure.  The  following  scheme  does  so: 

LIST{y,  stop,  last)] 

(This  procedure  lists  the  maxima  in  the  subtree 
at  V  with  y-cordinate  greater  than  stop) 

Begin 

If  last  >  y(TOP)  >  stop  then 
begin  output(TOP);  last:=  TOP  end; 
else 
begin 

if  (right  son  is  connected  to  left 
son  by  a  kink  edge)  then 
LOCATEY(  Top{right{v)))  and  update 
CHANGE  using  LOCATE  CHANGE. 
if  y(TOP)  >  y{INTER)  then 
begin 

LIST{CHANGE,  y{INTER),  last)] 

LISTiright,  stop,  last) 

end 

else  LIST{CHANGE,  stop,  last) 
end 
end. 

Note  that  the  procedure  LOCATE  CHANGE  is  in¬ 
voked  to  find  the  value  of  CHANGE  at  every  node. 
This  is  required  since  this  value  is  not  updated  at  ev¬ 
ery  node  during  the  deletion  operation  when  kink  edges 
are  constructed.  The  procedure  LOCATE  CHANGE 
will  be  described  later  in  the  section.  Before  LOCATE 
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CHANGE  is  invoked  the  kink  is  removed  by  locat¬ 
ing  TOP[right[v)).  This  requires  0{logn)  steps  for 
every  kink  removal.  Since  the  corresponding  point  is 
not  in  the  previously  reported  output  (else  the  kink 
would  not  be  present)  the  complexity  of  the  procedure  is 
0(m  +  chng{logn))  where  m  is  the  number  of  maximas 
and  chng  is  bounded  by  the  number  of  changes  in  the 
set  of  maximas  since  the  Icist  report  where  every  change 
involves  removal  of  a  kink. 

We  next  show  that  the  pointers  CHANGE  can  be 
assigned  at  each  node  initially  in  0(1)  amortized  time. 
This  requires  the  use  of  a  stack  to  maintain  a  list  of 
nodes  at  which  relevant  changes  to  the  current  staircase 
structure  have  taken  place.  The  top  of  the  stack  gives 
a  candidate  to  be  assigned  to  CHANGE  at  the  cur¬ 
rent  node,  say  v,  as  we  traverse  up  the  tree  to  the  root. 
It  is  the  first  descendant  node  in  preorder  at  which  a 
change  in  the  staircase  structure  takes  place.  Note  that 
at  this  node,  say  u,  Top[rigkt[u))  was  located  within 
M{left[u)).  The  next  item  in  the  stack  has  the  same 
property,  i.e.  it  is  the  first  descendant  node  v  of  the  cur¬ 
rent  node  with  the  property  that  INTER[v)  is  greater 
than  the  value  of  INTER  at  the  current  node.  The  stack 
contains  nodes,  v,  such  that  INTER[y),  is  in  increcising 
order  for  the  nodes  as  we  go  from  top  to  bottom  in  the 
stack. 

To  obtain  a  stack  and  to  assign  a  value  to  CHANGE 
at  the  current  node,  v,  we  adopt  the  following  proce¬ 
dure.  At  node  u,  TOP{rigkt[v))  is  located  within  the 
staircase  structure  at  left{y).  If  the  location  is  such  that 
TOP{right{v))  exceeds  TOP{left[v))  then  the  value  of 
CHANGE  is  the  same  as  the  one  at  right(y)  and  the 
stack  at  the  current  node  is  the  same  as  the  stack  on  the 
right  son.  If  TOP  {right  (^v))  is  below  TOP{left{v))  then 
CHANGE  is  obtained  from  the  stack  of  CHANGE  val¬ 
ues  at  left{v)  as  follows:  Let  Can  be  the  node  on  top 
of  the  stack.  If  the  y-coordinate  of  INTER{v)  is  >  y- 
coordinate  of  INTER  at  Can  then  Can  is  removed  from 
the  stack  and  a  new  candidate  is  obtained  from  the  top 
of  the  stack.  This  removal  is  repeated  until  we  find  a 
node  on  the  stack  at  which  the  y-coordinate  of  INTER 
is  above  the  y-coordinate  of  INTER{v).  CHANGE  at 
V  is  then  assigned  to  be  this  node.  Moreover  the  modi¬ 
fied  stack  obtained  from  the  left  son  is  updated  with  v 
and  the  stack  at  right{v)  is  removed  from  further  consid¬ 
eration.  The  initialization  of  CHANGE  can  be  done  in 
constant  amortized  time  at  the  nodes  since  once  a  node 
is  deleted  from  a  stack  it  is  not  considered  again. 

We  finally  describe  the  procedure  LOCATE 
CHANGE.  This  procedure  is  used  when  the  merge  of 
the  right  son  with  the  left  son  at  v  is  done  by  a  non¬ 
kink  edge  using  LOCATEY .  In  this  procedure  let  yp 
be  the  y-coordinate  of  the  located  point.  A  candidate, 
say  Can  for  CHANGE  is  obtained  from  the  value  of 
CHANGE  at  the  staircase  structure  at  the  left  son  of 


V.  If  yp  >  y{INTER{Can))  a  new  value  of  Can  is  ob¬ 
tained  from  the  CHANGE  value  at  the  node  currently 
assigned  to  Can  and  the  process  repeated  until  we  find 
a  node  at  which  INTER  has  greater  y-coordinate  than 
yp.  Such  a  node  exists  when  this  procedure  is  called 
and  CHANGE  is  assigned  this  node.  At  each  step  the 
height  of  Can  decreases.  Thus  this  procedure  requires 
0{logn)  steps. 

4.7  Searching  for  Dominance 

In  this  section  we  show  how  to  determine  if  a  given  point, 
p,  is  dominated  by  the  set  of  maximas  or  not.  We  do  so 
by  a  search  for  the  y-coordinate  of  the  given  point  in  the 
set  of  points  with  greater  x-coordinates.  Let  P  be  the 
path  from  the  root  to  the  leaf  where  the  x-coordinate  of 
the  point  p  is  located.  Let  R  be  the  set  of  roots  of  the 
subtrees  to  the  right  of  path  P.  It  suffices  to  check  if 
y{p)  >  y{TOP{v))  for  all  v  £  R.  If  it  is,  then  the  point 
is  not  dominated  otherwise  it  is. 

4.8  Insertions  and  deletion 

To  bound  the  time  for  insertions  and  deletions  in  the 
data  structure  we  note  that  the  crucial  step  while  per¬ 
forming  insertions  and  deletions  is  that  of  locating  the 
topmost  node  of  a  semi-staircase  structure  in  another 
semi-staircase  structure.  This  is  problem  LOCATEY . 
An  0{logn)  solution  to  this  problem  has  been  presented 
above  in  the  0(n)  space  data  structure. 

Moreover  the  0(1)  rotations  required  can  also  be 
done  in  O(logn)  steps  since  they  require  0(1)  mergers, 
each  requiring  0{logn)  steps.  The  variables  TOP  and 
INTER  stored  at  the  nodes  at  which  relocations  are 
performed  can  also  be  modified  in  0{logn)  steps.  Note 
that  the  value  of  INTER  can  be  obtained  by  using 
LOCATEY .  The  value  of  TOP  is  immediately  avail¬ 
able. 

We  would  also  like  to  update  the  value  of  CHANGE 
at  the  nodes.  We  consider  the  insertion  path  P{p)  for  a 
point  p.  Let  Cp  be  the  node  at  which  p  is  located  within  a 
staircase  structure.  Below  that  node,  p  is  at  the  TOP  of 
the  staircase  structures.  Also  let  Cu  be  the  node  at  which 
p  is  removed  from  the  staircase  structures.  Above  this 
node  p  is  no  longer  on  the  staircase  and  will  not  affect 
the  value  of  CHANGE  at  these  nodes.  The  procedure 
which  modifies  CHANGE  values  is  similar  to  the  ini¬ 
tialization  procedure.  For  nodes  below  Cp  the  value  of 
CHANGE  is  not  modified.  The  value  of  CHANGE  at 
Cp  is  computed  from  the  left  son  of  Cp  by  the  procedure 
LOCATE  CHANGE  which  has  been  described  before. 
For  nodes  above  Cp  upto  the  value  of  CHANGE  is  Cp 
unless  a  change  in  the  current  staircase  structure  takes 
place.  Again  if  the  change  is  below  the  y-coordinate  of 
p  the  change  is  stacked.  And  if  the  change  is  above  the 
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3/-coordinate  of  p  then  Cp  is  removed  from  further  consid¬ 
eration  and  the  value  of  CHANGE  need  not  be  changed 
at  this  node  and  above  since  the  value  of  INTER  is  un¬ 
changed. 

Furthermore  the  entire  insertion  procedure  requires 
0{logn)  steps  since  at  most  0{logn)  values  are  stacked 
or  popped  while  recomputing  CHANGE  on  P(p).  And 
LOCATE  CHANGE  is  called  only  once. 

A  similar  procedure  to  update  CHANGE  is  used 
when  a  point  is  deleted.  Firstly,  note  that  only  0(1)  lo¬ 
cations  which  require  a  binary  search  are  performed  dur¬ 
ing  a  deletion.  Thus  0(1)  LOCATE  CHANGE  calls 
which  requires  0{logn)  operations  are  required.  When 
kink  edges  are  added  only  0(1)  work  is  required  to  up¬ 
date  CHANGE.  Modification  of  CHANGE  along  the 
deletion  path  is  done  using  stack  manipulations  as  in  the 
insertion  case  and  requres  0{logn)  time. 

We  next  show  how  to  rebalance.  We  consider  single 
rotations  only.  The  procedure  is  similar  for  double  rota^ 
tions.  Let  v  be  the  node  at  which  a  rebalancing  opera¬ 
tion  is  performed.  At  the  nodes  affected  by  the  rebalanc¬ 
ing  CHANGE  is  obtained  by  the  procedure  LOCATE 
CHANGE  described  above.  Also  the  rebalancing  may 
affect  the  value  of  CHANGE  above  v.  Note  that  rebal¬ 
ancing  changes  the  value  of  INTER  at  two  nodes.  The 
procedure  for  updating  CHANGE  above  v  is  similar  to 
that  for  updating  CHANGE  at  nodes  above  Cp  in  the 
insertion  case  when  point  p  is  inserted.  Note  that  the 
initial  nodes  to  be  stacked  to  compute  CHANGES  are 
chosen  from  the  nodes  affected  by  the  balancing.  Thus 
updates  to  CHANGE  after  rebalancings  can  be  done  in 
0[logn)  operations. 

We  state  our  final  result: 

Theorem  1  The  maximas  of  a  set  of  point  can  he  main¬ 
tained  in  a  data  structure  which  uses  0(n)  space  and 
which  requires  0(logn)  operations  per  update  when  0(n) 
points  are  deleted  or  inserted.  The  output  complexity  is 
0{m+[chng)logn)  whenm  points  are  reported  and  when 
there  have  been  chng  changes  since  the  last  report. 

5  An  improved  amortized 
Bound 

In  this  section  we  show  how  the  maximcis  can  be  reported 
in  linear  amortized  time. 

To  improve  the  reporting  bound  we  note  that  the 
O(logn)  factor  in  the  reporting  time  bound  essentially 
arises  because  a  point  repeatedly  occurs  in  the  changes 
in  the  maximal  set,  i.e.  it  occurs,  is  removed  from  the 
maximal  set  by  an  insertion,  and  then  re-occurs  in  the 
set  of  maximas.  Thus  its  location  in  the  staircase  is  to 
be  determined  at  each  step.  To  remedy  this  we  keep 
this  location  as  follows.  Let  p  be  a  point  and  let  v  be  a 
node  in  the  tree  where  the  point  p  first  occurs  within  a 


semi-staircase,  i.e.  not  at  the  top  of  the  semi-staircase 
at  V.  At  this  node  the  point  has  to  be  located  within  a 
staircase  structure.  This  location,  LOC{p),  is  important 
since  it  may  need  to  be  redetermined  during  deletions. 
So  we  keep  this  location  with  the  point.  However  this  lo¬ 
cation  is  not  updated  at  every  insertion  or  deletion  since 
only  the  top  points  in  a  staircase  are  manipulated.  Thus 
when  this  point  is  to  be  output  the  value  of  LOC{p) 
may  need  to  be  updated.  If  the  location  requires  a  bi¬ 
nary  search  the  time  for  this  search  is  charged  to  a  point 
that  is  inserted  or  deleted  and  changes  this  value.  We 
prove  this  below: 

Lemma  3  Every  update  of  LOG {p)  for  p  €  M{S),  the 
set  of  maximas  of  a  point  set  S,  can  be  charged  uniquely 
to  an  insertion  or  a  deletion. 

Proof:  To  prove  the  lemma,  suppose  LOC{p)  for  point  p 
is  modified.  Let  u  be  the  node  at  which  LOC{p)  is  mod¬ 
ified  and  let  v  be  the  son  of  u  on  P{p)  where  P(p)  is  the 
path  from  the  leaf  to  the  root.  The  relocation  is  required 
when  p  is  added  to  the  set  of  maximas.  The  modification 
must  be  due  to  an  insertion  or  a  deletion  since  the  last 
time  p  was  added  to  the  set  of  maximcis.  Since  the  Top  of 
every  node  is  updated  during  the  insertion  procedure,  p 
cannot  be  at  the  top  of  the  staircases  at  v  when  insertions 
affect  V.  Thus  p  must  have  become  the  topmost  node  of 
a  staircase  structure  due  to  a  deletion  and  moreover  the 
top  node  of  the  staircase  structure  at  v  just  prior  to  this 
event  must  be  above  the  topmost  node  of  the  sibling 
else  p  would  have  been  located,  (we  are  referring  here 
to  the  deletion  procedure  described  in  section  4).  Thus 
if  p  is  to  be  relocated  by  a  search  requiring  more  than 
0(1)  time  the  corresponding  section  of  the  staircase  at 
w,  the  sibling  of  v  must  have  been  modified.  The  loca¬ 
tion  is  charged  to  the  deletion  or  insertion  that  modified 
the  section.  Note  that  other  points  which  have  the  same 
LOC  value  as  p  are  also  affected  by  the  same  deletion 
or  insertion.  Let  this  set  be  called  OTHER.  The  points 
in  OTHER  are  dominated  by  p  since  p  is  maximal  and 
the  location  of  p  introduces  kink  edges  from  these  points 
to  the  located  point.  Moreover  the  points  in  OTHER 
can  be  ordered  by  domination  since  they  lie  along  the 
path  to  the  leaf  where  LOC{p)  is  present  in  the  Maxima 
Tree.  The  topmost  kink  edge  becomes  important  and  is 
removed  either  when  p  is  deleted  or  when  the  maximas 
are  to  be  reported.  Either  the  first  point  in  OTH ER  is 
in  the  set  of  maximeis  or  some  other  point  dominating  it 
is.  The  location  at  this  step  is  charged  to  p  and  points  in 
OTHER  associated  with  the  point  located  at  this  step. 

To  summarize  the  details:  With  each  point,  p  we  asso¬ 
ciate  a  set  OTHER{p).  This  is  the  set  of  points  whose 
LOC  will  be  affected  due  to  the  deletion  of  p.  The  relo¬ 
cation  of  a  point  occurs  when  the  point  becomes  a  maxi¬ 
mal  element  and  there  is  a  kink  line  from  the  point  in  the 
staircase  structure.  This  happens  in  two  ways.  Firstly 
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when  the  point,  p  becomes  the  Top  point  at  a  node  dur¬ 
ing  a  deletion  of  a  node  which  was  the  old  Top  point  of 
the  staircase  structure  containing  p  at  the  node  and  when 
LOC{p)  has  changed.  The  other  event  happens  when  the 
point,  p,  is  to  be  located  onto  a  staircase  structure  on 
which  the  deleted  point  was  located.  In  the  first  case 
the  location  is  charged  to  the  deleted  point  which  intro¬ 
duced  the  modified  semi-staircase  at  that  node.  Note 
that  only  one  location  is  done.  Other  points  connected 
by  kink  lines  and  which  are  encountered  during  the  loca¬ 
tion  are  assigned  to  the  located  point,  p,  by  adding  them 
to  the  set  OTHER[p)  since  these  points  will  be  affected 
due  to  the  deletion  of  p.  In  the  second  case  the  location 
of  p  is  charged  to  the  deleted  point  that  forced  the  loca¬ 
tion,  say  q,  and  the  elements  of  OTHER{q)  are  assigned 
to  OTHER[p).  Thus  a  deleted  point  is  charged  twice, 
once  in  the  first  case  and  once  in  the  second.  The  second 
type  of  charging  is  part  of  the  deletion  process  whereas 
in  the  first  case  a  kink  is  removed  from  the  output  and 
a  previous  deletion  is  charged. 

Finally,  note  that  the  LOC  values  do  not  change  due 
to  rotations  and  double  rotations.  The  lemma  follows. 
■ 

Note,  while  maintaining  LOC{p),  CHANGE  values 
at  tree  nodes  also  need  to  be  maintained.  This  in¬ 
volves  computing  CHANGE  by  a  call  to  LOCATE 
CHANGE  and  updating  CHANGE  on  the  path  from 
the  leaf  to  the  root.  This  is  done  by  a  procedure  simi¬ 
lar  to  that  which  updates  CHANGE  during  insertions 
and  requires  O(logn)  steps.  We  thus  have  the  following 
result: 

Theorem  2  The  set  of  Maximas  of  a  set  of  points  in 
the  plane  can  be  maintained  in  0{nlogn  -f  mlogn)  oper¬ 
ations  when  there  are  n  insertions  and  m  deletions.  The 
data  structure  requires  linear  space.  Moreover,  r  maxi¬ 
mal  points  can  be  listed  out  in  0(r)  operations. 

6  Conclusions  and 

Acknowledgements. 

We  have  shown  efficient  schemes  for  dynamically  main¬ 
taining  the  set  of  maximas  of  a  set  of  points  in  2- 
dimensions.  We  do  so  by  maintaining  approximate  in¬ 
formation.  This  technique  can  also  be  applied  to  main¬ 
taining  maximas  in  higher  dimensions  with  less  effective 
results.  Maintaining  the  maximas  efficiently  under  both 
deletions  and  insertions  in  higher  dimensions  is  a  chal¬ 
lenging  problem.  This  technique  has  also  been  applied 
to  convex  hulls  by  the  author  but  the  results  do  not  give 
optimal  insertion  and  deletion  time  as  yet.  The  results 
will  be  reported  elsewhere.  Other  applications  of  this 
technique  would  be  interesting.  I  would  like  thank  A. 
Dutta  and  S.  Sen  for  helpful  discussions.  I  would  also 
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support  during  the  final  part  of  the  work. 
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Abstract 

We  present  a  method  for  maintaining  biased  search 
trees  so  as  to  support  fast  finger  updates  (i.e.,  updates 
in  which  one  is  given  a  pointer  to  the  part  of  the  tree 
being  changed).  We  illustrate  the  power  of  such  biased 
finger  trees  by  showing  how  they  can  be  used  to  derive 
an  optimal  O(nlogn)  algorithm  for  the  3-dimensional 
layers-of-maxima  problem  and  also  obtain  an  improved 
method  for  dynamic  point  location. 


1  Introduction 

Binary  search  trees  are  one  of  the  most  useful  data 
structures,  and  are  ubiquitous  throughout  the  design 
and  analysis  of  efficient  algorithms  [14].  In  some  cases 
they  serve  as  a  stand-alone  structure  (e.g.,  implement¬ 
ing  a  dictionary  or  a  heap),  while  in  many  cases  they 
are  used  in  tandem  with  other  structures,  either  as  pri¬ 
mary  or  secondary  structures  (or  both,  as  in  the  range 
tree  [36]).  In  many  dynamic  computational  geometry 
algorithms  they  may  even  be  found  as  tertiary  struc¬ 
tures,  e.g.,  Goodrich  and  Tamassia  [19]. 
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1.1  Background  and  Motivation 


When  a  binary  search  tree  T  is  maintained  dynamically 
as  a  primary  structure  it  is  appropriate  to  count,  as  a 
part  of  the  update  time,  the  time  to  perform  a  top-down 
search  for  the  node(s)  in  T  being  changed.  This  may 
not  be  appropriate  when  T  is  used  in  tandem  with  other 
structures,  however,  for  one  may  be  given,  as  part  of  the 
input  to  an  update  operation,  pointers,  or  “fingers”  [20, 
25,  22],  directly  into  the  part  of  T  being  changed.  Such 
a  pointer  could  come,  for  example,  from  a  query  in 
some  auxiliary  data  structure.  This  may,  in  fact,  have 
been  a  prime  motivating  factor  behind  the  method  of 
Huddleston  and  Mehlhorn  [22]  for  designing  a  dynamic 
search  tree  that  has  an  0(1)  update  time  performance 
for  insertions  and  deletions  when  the  search  time  is  not 
counted,  where  we  use  “O(-)  time”  to  refer  to  a  worst- 
case  time  bound  that  is  amortized  over  a  sequence  of 
updates. 

Another  important  variant  concerns  the  case  when 
each  item  i  in  the  search  tree  is  given  a  weight,  Wi. 
This  weight  is  a  positive  integer  that  may  be  propor¬ 
tional  to  an  access  probability,  as  in  an  optimal  binary 
search  tree  structure  [3,  24].  Or  it  may  represent  the 
size  of  some  auxiliary  structure  associated  with  item  i, 
as  in  a  link-cut  structure  [40]  (which  itself  has  many 
applications  [12,  17,  18])  or  in  a  point  location  struc¬ 
ture  built  using  the  trapezoid  method  [8,  35,  38].  In 
cases  with  weighted  items  such  as  these  one  desires  a 
search  tree  satisfying  a  bias  property  that  the  depth  of 
each  item  i  in  the  tree  be  inversely  proportional  to  w^. 
Bent,  Sleator  and  Tarjan  [5]  give  a  method  for  main¬ 
taining  such  a  structure  subject  to  update  operations, 
such  as  insertions,  deletions,  joins,  and  splits,  as  well  as 
predecessor  query  operations.  Most  of  their  update  and 
query  operations  take  0(log  W/wi)  time  (in  some  cases 
as  an  amortized  bound),  with  the  rest  taking  slightly 
more  time,  where  W  is  the  sum  of  all  weights  in  the 
tree. 
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1.2  Our  Results 

In  this  paper  we  examine  a  framework  for  achieving 
fast  finger-tree  updates  in  a  biased  search  tree,  and  we 
refer  to  the  resulting  structure  as  a  biased  finger  tree. 
We  know  of  no  previous  work  for  a  structure  such  as 
this.  We  show  that  insertions  and  deletions  in  a  biased 
finger  tree  can  be  implemented  in  0(logw;i)  time,  not 
counting  search  time,  while  still  maintaining  the  prop¬ 
erty  that  each  item  i  is  at  depth  0(logW/wi)  in  the 
tree.  Moreover,  we  show  that,  while  split  operations 
will  take  OilogW/wi)  time  (which_is  unavoidable),  we 
can  implement  join  operations  in  0(1)  time. 

Our  structure  is  topologically  equivalent  to  that 
given  by  Bent,  Sleator,  and  Tarjan  [5].  In  fact,  if  each 
item  i  has  weight  rcj  =  1 ,  then  our  structure  is  topologi¬ 
cally  equivalent  to  a  red-black  tree  [14,  21,  41].  It  is  our 
update  methods  and  amortized  analysis  that  are  differ¬ 
ent,  and  this  is  what  allows  us  to  achieve  running  times 
that  are  significant  improvements  over  those  obtained 
by  Bent,  Sleator,  and  Tarjan,  even  if  one  ignores  the 
search  times  in  their  update  procedures.  Moreover,  we 
provide  an  alternative  proof  that  red-black  trees  sup¬ 
port  constant-time  amortized  finger  updates  (which  is 
a  fact  known  to  folklore) . 

We  show  the  utility  of  the  biased  finger  tree  structure 
by  giving  an  optimal  0(n  log  n)-time  space-sweeping 
algorithm  for  the  well-known  3-dimensional  layers-of- 
maxima  problem  [2,  7,  15,  27].  We  also  give  improved 
methods  for  dynamic  point  location  in  a  convex  subdi¬ 
vision  [35,  8],  and  present  a  method  for  dynamic  point 
location  in  staircase  subdivision  with  logarithmic  query 
and  update  times.  We  note  that  although  the  stair¬ 
case  subdivision  we  consider  is  a  very  restricted  form 
of  subdivision,  it  is  the  only  subdivision  we  know  of  for 
which  there  is  a  method  achieving  logarithmic  query 
and  update  times.  In  addition,  the  optimal  algorithm 
for  three-dimensional  layers  of  maxima  problem  uses 
our  dynamic  data  structure  for  staircase  subdivision. 

2  Biased  Finger  Trees 

Suppose  we  are  given  a  totally  ordered  universe  U  of 
weighted  items,  and  we  wish  to  represent  a  collection 
of  disjoint  subsets  of  U  in  binary  search  trees  subject 
to  the  “standard”  tree  search  queries,  as  well  as  item 
insertion  and  deletion  in  a  tree,  and  join  and  split  oper¬ 
ations  on  trees  (consistent  with  the  total  order).  Aho, 
Hopcroft,  and  Ullman  [3]  refer  to  these  as  the  concaten- 
able  queue  operations. 

In  this  section  we  describe  a  new  data  structure  that 
efficiently  supports  all  of  the  these  operations.  So  as 
to  concentrate  on  the  changes  required  by  an  update 
operation,  we  will  assume  that  each  update  operation 
comes  with  a  pointer  to  the  node(s)  in  the  tree(s)  where 
this  update  is  to  begin.  Formally,  we  define  our  update 
operations  as  follows; 


Insert(f, , T)  :  Insert  item  i  with  weight  Wi  into 
T,  where  p^-  is  a  pointer  to  the  predecessor,  , 
of  i  in  T  (if  i  has  no  predecessor  in  T,  then  we  let 
this  point  to  i’s  successor). 

Delete(f,pi,T)  :  Remove  item  i  from  the  tree  T,  given 
a  pointer  pi  to  the  node  storing  i. 

Split(f,T)  :  Partition  T  into  three  trees;  T,  which  con¬ 
tains  all  items  in  T  less  than  i,  the  item  i  itself, 
and  Tr,  which  contains  all  items  in  T  greater  than 

i. 

Join(T2;,  Ty)  ;  Construct  a  single  tree  from  and  Ty, 
where  all  the  items  in  are  smaller  than  the  items 
in  Ty. 

Change-weight(f,r(;',T,pi)  ;  Change  the  weight  of  the 
item  f  in  T  from  to  Wi' ,  given  the  pointer  p,  to 
the  node  storing  the  item  i. 

Slice{i,T,x,ii,i2)  '■  Break  the  item  f  in  T  into  two 
items  ii  and  12  such  that  i~  <  ii  <12  <  and 
the  weight  of  item  ii  is  x  *  Wi  and  the  weight  of 
item  Z2  is  (1  —  x)  *  Wi,  where  0  <  a;  <  1,  and  i~ 
and  are  predecessor  and  successor  items  of  i  in 
T  respectively. 

Fuse(ii,  12, b T)  :  Combine  the  items  ii  and  ^2  in  T 
into  a  single  item  i  of  weight  Wi^  +  Wi^  such  that 
fi  <  f  <  *2  in  T.  The  items  ii  and  12  need  not  be 
siblings,  but  should  be  adjacent  in  the  total  order. 

As  mentioned  above,  our  structure  is  topologically 
similar  to  the  biased  search  tree^  of  Bent,  Sleator  and 
Tarjan  [5].  Our  methods  for  updating  and  analyzing 
these  structures  are  significantly  different,  however,  and 
achieve  run  times  better  than  those  of  Bent  et  al.  in 
most  cases  (see  Table  1). 

We  assume  that  items  are  stored  in  the  leaves,  and 
each  internal  node  stores  two  items,  left  and  right, 
which  are  pointers  to  the  largest  item  in  the  left  subtree 
and  the  smallest  item  in  the  right  subtree,  respectively. 
In  addition,  the  root  maintains  pointers  to  the  mini¬ 
mum  and  maximum  leaf  items.  Every  node  x  of  the 
tree  stores  a  non-negative  integer  rank  r{x)  that  satis¬ 
fies  the  natural  extensions  of  red-black  tree  rank  [41]  to 
weighted  sets  [5]; 

1.  If  a;  is  a  leaf,  then  r{x)  =  [logWiJ,  where  i  is  the 
item  X  stores. 

2.  If  node  x  has  parent  y,  then  r(x)  <  r{y);  if  x  is 
a  leaf,  then  r(x)  <  r{y)  -  1.  Node  x  is  major  if 
r(x)  =  r{y)  —  1  and  minor  if  r(x)  <  r{y)  —  1. 

3.  If  node  x  has  grandparent  y,  then  r(x)  <  r{y)  —  1. 

In  addition  to  the  above  rank  conditions,  we  also  require 
that  a  node  be  minor  if  and  only  if  its  sibling  or  a  child 

^Bent,  Sleator  and  Tarjan  actually  introduce  two  kinds  of  bi¬ 
ased  search  trees;  our  biased  finger  trees  are  structurally  equiva¬ 
lent  to  the  ones  they  call  locally  biased. 
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Update  Operation 

Previous  Biased  Trees  [5] 

Biased  Finger  Trees 

Search(i,T) 

Insert(i,  Wi,p,- ,  T) 

Delete/,  T,  p,) 

Split(i,  T) 

Join(U,Tj,) 

Change-Weight  (I,  wu  ,T,p,) 

Slice/,  T,x,ii,i2) 

Fuse/i,/, i,T) 

0/ogW/w,) 

^  min(u;__  .tu.)  ) 

0/ogW/w,) 

O/ogW/wf 

0{\lOgWx/Wy\) 

^(lo  (maxQ^)) 

O/ogW/wi) 

0(|log^|  +  l) 
0/ogw,) 

O/ogW/wf 

0(1) 

0(1  log  wfwuO 

O(log  —r^ - r) 

0(min(logWjj,logu;jJ) 

Table  1:  Summary  of  Time  Bounds  for  Biased  Tree  Operations.  W  and  W  are  the  sum  of  the  weights  of 
all  the  items  before  and  after  the  update  operation  respectively,  (Wy)  denotes  the  sum  of  weights  in  Tj,  (Ty), 
and  i~  (i+)  denotes  the  predecessor  (successor)  of  i.  All  the  complexities  for  update  operations  only  count  the 
time  to  perform  the  update,  and  do  not  include  search  times. 


of  its  sibling  is  a  major  leaf  [5].  We  refer  to  this  as  the 
bias  property. 

In  the  remainder  of  this  section  we  provide  algo¬ 
rithms  for  various  update  operations  on  biased  finger 
trees,  and  also  analyze  their  amortized  complexities. 

2.1  Rebalancing  a  Biased  Finger  Tree 

We  begin  our  discussion  by  analyzing  the  time  needed 
to  rebalance  a  biased  tree  after  an  update  has  occurred. 
We  use  the  banker’s  view  of  amortization  [42]  to  ana¬ 
lyze  the  rebalancing  and  update  operations.  With  each 
node  a;  of  a  biased  finger  tree  we  associate^  a  value, 
C{x),  of  “credits”,  with  0  <  C(x).  Moreover,  we  par¬ 
tition  these  credits  into  three  types — one  type  that  is 
similar  to  those  used  in  the  analysis  of  Bent,  Sleator, 
and  Tarjan  [5],  one  type  that  assigns  1  credit  to  the 
nodes  on  the  spine  of  the  tree,  and  one  type  suggested 
by  Kosaraju  [26].  We  omit  details  here. 

After  an  update  operation,  we  perform  promote  or 
demote  operations  on  the  ranks  of  some  of  the  nodes 
of  the  biased  finger  tree,  which  increase  or  decrease 
the  rank  of  the  nodes,  respectively.  These  operations 
locally  preserve  the  rank  properties,  but  may  cause  vi¬ 
olation  of  the  rank  property  on  other  nodes,  which  may 
require  further  promotions  or  demotions  or  may  even 
require  rebalancing  to  globally  preserve  the  rank  prop¬ 
erties.  We  show  that  the  total  complexity  of  promotion, 
demotion  and  rebalancing  operations  due  to  a  single 
promote  or  demote  operation  is  0(1).  The  structure  of 
our  case  analysis  follows  closely  that  Tarjan  [41]  used 
for  red-black  trees.  We  again  omit  the  details  here,  and 
in  the  full  version  prove  the  following: 

Lemma  2.1  The  total  complexity  of  promotion,  demo¬ 
tion  and  rebalancing  operations  on  a  biased  finger  tree 

^This  credit  notion  is  only  used  for  analysis  purposes.  No 
actual  credits  are  stored  anywhere. 


due  to  a  single  promote/ demote  operation  is  0(1)  (ac¬ 
tually  at  most  8  credits).  Also,  each  operation  adds  at 
most  two  pairs  of  equal  rank  siblings  to  the  tree. 

2.2  Update  Operations 

We  now  discuss  the  methods  for  various  update  opera¬ 
tions.  We  begin  with  the  join  operation.  We  describe  a 
“bottom-up”  strategy,  which,  contrasts  with  the  “top- 
down”  approach  of  Bent,  Sleator  and  Tarjan  [5]. 

Join:  Consider  the  join  of  two  biased  trees  U  and  Ty. 
Let  u  and  v  be  the  rightmost  leaf  and  the  leftmost  leaf 
of  Tj;  and  Ty  respectively.  Let  w  and  I  be  the  parent  of 
u  and  V  respectively  (see  Fig.  1).  The  nodes  u  and  v 
can  be  accessed  using  the  pointers  in  the  root  nodes  x 
and  y  respectively.  We  have  the  following  cases: 

Case  1.  r{x)  =  r{y).  In  this  case  we  create  a  new 
node  2:  with  U  as  the  left  subtree  and  Ty  as  the 
right  subtree,  and  we  assign  a  rank  of  r{x)  -b  1  to 
2.  We  then  proceed  up  the  tree  as  in  the  promote 
operation. 

Case  2.  r{x)  <r(y).  In  this  case  we  traverse  the  right¬ 
most  path  of  Tx  and  the  leftmost  path  of  Ty,  both 
bottom  up,  in  the  increasing  order  of  ranks  of  the 
spine  nodes.  As  we  proceed  we  store  the  pointers 
to  the  nodes  encountered  and  also  tags  indicating 
whether  they  are  from  U  or  Ty  in  an  array  A  or¬ 
dered  by  node  ranks.  We  also  keep  track  of  the 
nodes  of  equal  ranks  last  encountered  in  the  two 
paths  and  we  terminate  the  traversal  when  reach¬ 
ing  the  root  x.  Suppose  t  is  the  smallest  rank  node 
along  the  leftmost  path  of  Ty  having  rank  greater 
than  X.  Suppose  a  and  b  are  the  last  encountered 
equal  rank  nodes  during  the  traversal,  and  note 
that  the  node  x  was  stored  as  the  last  node  in  ar¬ 
ray  A.  We  attach  x  along  with  its  left  subtree  to 
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Figure  1:  Join  of  trees  and  Ty. 


t  as  the  left  child  of  t.  For  the  other  nodes  in  A, 
we  proceed  as  follows  (see  Fig.  1):  Consider  the 
next  node,  c,  in  the  array  A.  Suppose  c  is  part  of 
Tx-  If  the  successor  of  c  in  A  is  a  node  d  from  T^, 
we  attach  c  and  its  left  subtree  as  a  right  child  of 
d.  If  the  successor  of  c  in  ^  is  a  node  d  from  Ty, 
we  attach  c  and  its  left  subtree  as  a  left  child  of  d. 
Suppose  c  is  part  of  Ty.  If  the  successor  of  c  in  >1  is 
a  node  d  from  Tx ,  we  attach  c  and  its  right  subtree 
as  a  right  child  of  d.  If  the  successor  of  c  in  ^  is  a 
node  d  from  Ty,  we  attach  c  and  its  right  subtree 
as  a  left  child  of  d.  We  continue  this  process  until 
the  nodes  a  and  b  are  encountered.  Then,  we  cre¬ 
ate  a  new  node  z  with  To  and  Tb  as  left  and  right 
subtrees  respectively.  We  assign  a  rank  of  r'(o)  -f  1 
to  z  and  rebalance  if  required  through  a  promote 
operation.  This  terminates  the  join.  If  there  are 
no  equal  rank  spine  nodes  a  and  b,  then  we  join  all 
the  nodes  in  the  array  A  in  the  above  manner. 

Case  3.  r{y)  <  r{x).  Symmetrical  to  above  case. 

Analysis:  We  show  in  the  full  version  that  the  to¬ 
tal  number  of  credits  needed  to  perform  this  update  is 
0(1).  This,  and  Lemma  2.1,  establishes  that  the  run¬ 
ning  time  for  a  join  is  0(1). 

Split:  We  perform  the  split  operation  as  in  [5].  We 
show  that  with  the  same  complexity  we  can  preserve 
all  three  types  of  credits  in  the  nodes  of  the  resulting 
trees. 

Insertion:  Consider  the  insertion  of  an  item  i  with 
weight  Wi  to  a  biased  finger  tree  T.  Recall  that  i~ 
denotes  the  immediate  predecessor  of  item  z  in  T  if  it 
exists,  and  immediate  successor  of  z  in  T  otherwise.  We 
provide  a  pointer  to  z“.  In  the  full  version  we  describe 
how  to  perform  a  bottom-up  insertion  from  i~ .  The 
most  interesting  case  occurs  when  the  new  item  has 
weight  much  larger  than  i~,  for  we  must  then  splay  i 
up  the  tree  to  its  proper  position  while  joining  together 


the  trees  whose  roots  become  siblings  in  this  process 
(so  as  to  maintain  our  minor-node  bias  property).  We 
show  that  this  can  be  done  in  0{\  logWi/wi-  |)  time. 
Deletion:  Consider  the  deletion  of  an  item  z  with 
weight  Wi  from  the  biased  finger  tree  T  for  which  a 
pointer  to  the  item  z  is  provided.  This  operation  is  simi¬ 
lar  to  that  of  insertion  and  we  have  different  cases  based 
on  whether  the  node  deleted  is  minor/major.  We  use 
different  operations  to  preserve  bias  property  in  each 
case.  We  give  details  in  the  full  version  where  we  show 
the  complexity  of  deletion  to  be  O(logzni). 

We  summarize: 

Theorem  2.2  One  can  maintain  a  collection  of  biased 
search  trees  subject  to  tree  searches,  element  insertion 
and  deletion,  change  of  weight  of  element,  slicing  and 
fusing  of  elements,  as  well  as  tree  joining  and  splitting, 
in  the  bounds  quoted  in  Table  1  for  biased  finger  trees, 
not  counting  the  search  times. 

Bent  et  al.  [5]  show  that  repeated  single-node  joins 
on  the  right  hand  side  can  construct  a  biased  finger  tree 
in  0(n)  worst-case  time.  In  our  case,  however,  we  can 
show  the  following: 

Theorem  2.3  Any  sequence  of  joins  that  constructs  a 
biased  finger  tree  ofn  items  can  be  implemented  in  0{n) 
worst-case  time. 

Proof:  The  proof  follows  immediately  from  the  fact 
that  our  join  algorithm  on  biased  finger  trees  takes  0(1) 
time.  H 

Let  us  now  turn  our  attention  to  some  non-trivial 
applications. 

3  The  Layers- of-Maxima  Prob¬ 
lem 

In  this  section,  we  use  the  biased  finger  tree  data  struc¬ 
ture  to  solve  an  open  (static)  computational  geometry 
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Figure  2:  Computation  of  Layer  for  a  New  Point  p. 


problem:  the  3-dimensional  layers-of-maxima  problem. 
Before  we  describe  our  method,  however,  we  introduce 
some  definitions.  A  point  p  £  dominates  a  point 
q  €  if  x{q)  <  x{p),  y{q)  <  y{p),  and  z{q)  <  z{p). 
Given  a  set  S  of  points  in  5?®,  a  point  p  is  a  maximum 
point  in  S,  if  it  is  not  dominated  by  any  other  point  in 
S.  We  define  the  dominance  region  of  a  point  p  in  as 
a  set  Z?  C  3?®  such  that  p  dominates  every  point  q  £  D. 
The  maxima  set  problem  is  to  find  all  the  maximum 
points  in  S.  Kung,  Luccio,  and  Preparata  [27]  showed 
that  this  problem  can  be  solved  in  0(n  log  n)  time.  In 
the  related  layers-of-maxima  problem,  one  imagines  an 
iterative  process,  where  one  finds  a  maxima  set  M  in 
S,  removes  all  the  points  of  M  from  5,  and  repeats  this 
process  until  S  is  empty.  The  iteration  number  in  which 
a  point  p  is  removed  from  S  is  called  p’s  layer,  and  we 
denote  it  by  l{p),  and  the  layers-of-maxima  problem 
is  to  determine  the  layer  of  each  point  p  in  S.  This 
is  related  to  the  well-known  convex  layers  problem  [7], 
and  it  appears  that  it  can  be  solved  for  a  3-dimensional 
point  set  S  in  0(n  log  n  log  log  n)  time  [1]  using  the  dy¬ 
namic  fractional  cascading  technique  of  Mehlhorn  and 
Naher  [32].  We  show  how  to  solve  the  3-dimensional 
layers-of-maxima  problem  in  0(n log n)  time,  which  is 
optimal 

We  solve  this  problem  using  a  three-dimensional 
sweep,  and  a  dynamic  method  for  point  location  in  a 
staircase  subdivision.  Given  a  set  5  of  n  points  in  5R^, 
we  first  sort  the  points  along  the  2  axis,  and  then  sweep 
the  points  in  the  decreasing  order  of  their  2:  coordinates 
to  construct  the  maxima  layers.  When  we  sweep  across 
a  point,  we  compute  its  layer  using  the  information 
about  the  layers  computed  so  far.  The  information  we 
maintain  for  each  layer  is  the  union  of  the  dominance 

simple  linear  time  reduction  can  be  shown  from  sorting 
problem  to  three-dimensional  layers-of-maxima  problem,  thereby 
showing  a  lower  bound  of  n(nlogn)  for  three-dimensional  layers 
of  maxima  problem. 


regions  of  the  points  in  that  layer.  We  denote  this  by 
T){1),  for  a  layer  1.  We  show  that  to  correctly  compute 
the  layers,  it  is  sufficient  to  maintain  for  each  layer  I,  the 
boundary  of  the  intersection  of  the  sweep  plane,  say  tt, 
with  'D{1).  The  intersection  region  is  two  dimensional, 
and  we  call  its  boundary  a  staircase. 

The  shape  of  the  staircase  representing  a  layer 
changes,  as  we  continue  the  sweep.  During  the  sweep, 
we  maintain  only  a  subset  of  points  that  belong  to  each 
layer.  This  is  because,  if  points,  say  p  and  q,  belong  to 
a  layer  I,  and  if  the  projection  of  dominance  region  of 
p  onto  TT  dominates  that  of  q,  then  point  q  will  not  be 
part  of  the  boundary  of  intersection.  This  simplifies  the 
identification  of  layer  for  a  new  point.  Let  5'  C  5  be  the 
current  set  of  points  maintained  by  the  algorithm.  We 
show  that  S'  has  the  property  that  the  the  staircases 
corresponding  to  the  layers  of  points  in  S',  subdivide 
the  xy-plane  into  disjoint  regions  (as  defined  earlier). 
We  call  this  a  staircase  subdivision.  We  show  that  this 
property  is  preserved  at  each  step  of  the  sweep,  when 
we  compute  the  layer  for  a  new  point.  Hence,  at  any  in¬ 
stant  the  current  set  of  maxima  layers  form  a  staircase 
subdivision,  and  we  reduce  the  computation  of  layer 
for  a  new  point  to  operations  on  the  staircase  subdivi¬ 
sion.  We  call  the  region  in  the  subdivision  between  two 
staircases,  a  face.  Hence,  if  there  are  m  layers,  they 
subdivide  the  a:y-plane  into  m  -I-  1  faces.  The  projec¬ 
tion  of  each  new  point  onto  the  xy-plane  belongs  to  a 
unique  face  among  these  m  H-  1  faces.  We  work  with 
the  staircase  subdivision,  and  the  projection  of  a  new 
point,  to  identify  the  point’s  layer. 

The  algorithm  for  computing  the  layer  number  of  a 
new  point  p  is,  then,  as  follows: 

1.  Identify  the  two  staircases  in  the  staircase  subdi¬ 
vision  between  which  the  new  point  p  lies.  Assign 
p  to  higher-numbered  layer  of  these  two.  For  ex¬ 
ample,  in  Figure  2,  p  lies  between  the  layers  1  and 
2,  and  gets  assigned  to  layer  2.  If  p  lies  below  the 
highest-numbered  layer,  say  m,  then  assign  p  to  a 
new  layer  m  -H  1 . 

2.  Compute  the  horizontal  segment  h,  and  the  verti¬ 
cal  segment  v  from  p,  which  hit  the  boundary  or 
some  layer  (of  course  the  layer  hit  by  h  is  the  same 
layer  that  is  hit  by  v,  and  has  same  number  as  p’s 
just  computed  layer). 

3.  Insert  the  segment  h  and  the  segment  v  into  the 
subdivision. 

4.  Delete  the  segments  in  the  layer  /(p),  which  are 
dominated  by  p  in  the  2:p-plane.  For  example,  in 
Figure  2,  we  delete  segments  in  portion  A  of  layer 
2. 

Correctness  and  Analysis:  We  now  show  that  each 
new  point  p  is  identified  with  its  correct  layer.  We  use 
Figure  2  to  illustrate  the  idea.  Consider  layer  2.  The 


154 


staircase  of  layer  2  is  the  boundary  of  the  intersection  of 
the  union  of  the  dominance  regions  of  the  points  in  the 
layer  2  with  the  sweep  plane  tt.  Since  the  points  are  pro¬ 
cessed  in  decreasing  order  of  their  2;  coordinates,  when 
we  insert  p,  the  dominance  region  of  point  p  does  not 
dominate  any  of  the  points  in  layer  2.  Also  the  points 
in  layer  2  do  not  dominate  p  along  x  and  y  coordinates. 
So,  point  p  belongs  to  layer  2  (i.e.,  l{p)  =  2),  as  identi¬ 
fied  by  the  algorithm.  Now  to  update  the  boundary  of 
intersection  with  tt,  we  delete  the  portion  A  from  layer 
2,  and  include  the  segments  h  and  v  into  the  subdivi¬ 
sion.  To  see  why  only  the  boundary  is  maintained,  we 
observe  that  if  point  q  is  introduced  later,  the  algorithm 
will  identify  q  with  layer  l{p).  But  q  does  not  belong  to 
layer  l{p)  ,  since  q  is  dominated  by  p.  So,  it  should  ei¬ 
ther  initiate  a  new  layer,  or  belong  to  an  existing  layer 
(see  Figure  2).  In  any  case,  l{q)  =  l{p)  +  1.  Also,  we 
observe  that  after  deleting  points  in  portion  A,  the  new 
boundary  for  layer  2  is  in  the  form  of  a  staircase,  thus 
preserving  the  property. 

Suppose  location,  insertion  and  deletion  of  a 
vertex/edge  in  a  staircase  subdivision  take  time 
Q{n),  I{n),  and  D{n)  time  respectively.  We  implement 
step  1  as  a  point  location  in  staircase  subdivision,  and 
it  takes  0{Q{n))  time.  We  represent  the  staircase  cor¬ 
responding  to  each  layer  by  a  dictionary,  for  ordering 
along  X  and  y  axes  (since  both  are  same  ordering).  Us¬ 
ing  these  data  structures,  we  compute  in  O(logn)  time 
the  horizontal  segment  h,  and  the  vertical  segment  v 
for  a  new  point  p.  Hence,  step  2  takes  O(logn)  time. 
Therefore,  the  total  time  for  computing  the  layer  of 
each  point  is  0(logn  -I-  Q{n)  -b  I{n)  +  k*  D{n)),  where 
k  is  the  number  of  points  deleted  in  step  4.  Since  each 
point  is  deleted  at  most  once,  and  is  not  inserted  back, 
we  amortize  the  cost  of  k  deletions  on  each  of  the  k 
points.  Hence,  the  complexity  of  computing  layer  of 
each  new  point  is  (5(logn  +  Q{n)  +  I{n)  +  D{n)).  In 
the  next  section,  we  show  a  method  which  achieves 
Q{n)  =  I{n)  =  D{n)  =  O(Iogn),  resulting  in  a 
0{logn)  algorithm  for  computing  the  layer  of  a  single 
point.  This  gives  us  an  O(nlogn)  algorithm  for  com¬ 
puting  the  three  dimensional  layers-of-maxima,  and  is 
optimal. 

4  Dynamic  Point  Location 

In  this  section,  we  address  the  general  problem  of  dy¬ 
namic  point  location  in  a  convex  subdivision.  So, 
suppose  we  are  given  a  connected  subdivision  S  of 
the  plane  such  that  S  partitions  the  plane  into  two- 
dimensional  cells  bounded  by  straight  line  segments. 
The  point  location  problem  is  to  construct  a  data 
structure  that  allows  one  to  determine  for  any  query 
point  p  the  name  of  the  cell  in  S  that  contains  p  (see 
[13,  15,  16,  23,  28,  29,  35,  36,  39]).  It  is  well-known 
that  one  can  construct  a  linear-space  data  structure  for 


answering  such  queries  in  O(logn)  time  [13,  16,  23,  39]. 

These  optimal  data  structures  are  static,  however, 
in  that  they  do  not  allow  for  any  changes  to  S  to  oc¬ 
cur  after  the  data  structure  is  constructed.  There  has, 
therefore,  been  an  increasing  interest  more  recently  into 
methods  for  performing  point  location  in  a  dynamic 
setting,  where  one  is  allowed  to  make  changes  to  S, 
such  as  adding  or  deleting  edges  and  vertices.  It  is 
easy  to  see  that,  by  a  simple  reduction  from  the  sort¬ 
ing  problem,  a  sequence  of  n  queries  and  updates  to 
S  requires  U(nlogn)  time  in  the  comparison  model, 
yet  there  is  no  existing  fully  dynamic  framework  that 
achieves  O(logn)  time  for  both  queries  and  updates 
(even  in  an  amortized  sense) .  The  currently  best  meth¬ 
ods  are  summarized  in  Table  2.  The  results  in  that  ta¬ 
ble  are  distinguished  by  the  assumptions  they  make  on 
the  structure  of  S.  For  example,  a  cont;ea;  subdivision  is 
one  in  which  each  face  is  convex  (except  for  the  external 
face),  a  staircase  subdivision  is  one  in  which  each  face 
is  a  region  bounded  between  two  infinite  staircases,  a 
rectilinear  subdivision  is  one  in  which  each  edge  is  par¬ 
allel  to  the  X-  or  y-axis,  a  monotone  subdivision  is  one 
in  which  each  face  is  monotone  with  respect  to  (say) 
the  a;-axis,  a  connected  subdivision  is  one  which  forms 
a  connected  graph,  and  a  general  subdivision  is  one  that 
may  contain  “holes.”  The  interested  reader  is  referred 
to  the  excellent  survey  by  Chiang  and  Tamassia  [9]  for 
a  discussion  of  these  and  other  results  in  dynamic  com¬ 
putational  geometry. 

4.1  Our  Data  Structure 

Suppose  we  are  given  a  convex  subdivision  S  that  we 
would  like  to  maintain  dynamically  subject  to  point  lo¬ 
cation  queries  and  edge  and  vertex  insertions  and  dele¬ 
tions.  As  mentioned  above,  our  method  for  maintain¬ 
ing  S  is  based  upon  a  dynamic  implementation  of  the 
“trapezoid  method”  of  Preparata  [35]  for  static  point 
location.  Incidentally,  this  is  also  the  approach  used  by 
Chiang  and  Tamassia  [8],  albeit  in  a  different  way.  Let 
us  assume,  for  the  time  being,  that  the  a;-coordinates 
of  the  segment  endpoints  are  integers  in  the  range  [1 ,  n] 
(we  will  show  later  how  to  get  around  this  restriction  us¬ 
ing  the  the  BB[a]  tree).  We  define  our  structure  recur¬ 
sively,  following  the  general  approach  of  Preparata  [35] . 
Our  structure  is  a  rooted  tree,  T,  each  of  whose  nodes 
is  associated  with  a  trapezoid  t  whose  parallel  bound¬ 
ary  edges  are  vertical.  We  imagine  the  trapezoid  r  as 
being  a  “window”  on  S,  with  the  remaining  task  being 
that  of  locating  a  query  point  in  S  restricted  to  this 
trapezoidal  window.  With  the  root  of  T  we  associate  a 
bounding  rectangle  for  S. 

■^Cheng  and  Janardan’s  update  method  is  actually  a  de- 
amortization  of  an  amortized  scheme  via  the  “rebuild-while-you- 
work”  technique  of  Overmars  [34]. 

^Our  method  can  actually  used  for  any  dynamic  point  location 
environment  satisfying  a  certain  pseudo-edge  property. 
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Type 

Queries 

Insert 

Delete 

Reference 

general 

0(lognlog  logn) 

0  (logn  log  logn) 

O(log^n) 

Baumgarten  et  al.  [4] 

connected 

(9(log^  n) 

O(logn) 

O(logn) 

Cheng- Janardan  [11]'^ 

connected 

O(logn) 

0(log®  n) 

O(log'n) 

Chiang  et  al.  [10] 

monotone 

O(logn) 

0(log^  n) 

O(log^n) 

Chiang-Tamassia  [8] 

monotone 

0(log^  n) 

O(logn) 

O(logn) 

Goodrich-Tamassia  [18] 

rectilinear 

0  (logn  log  logn) 

0(lognlog  logn) 

0  (logn  log  logn) 

Mehlhorn-Naher  [32] 

convex 

0(logn  logN") 

0  (logn  log  N") 

0(lognlog7V) 

Preparata-Tamassia  [38] 

convex® 

O(logn) 

O(logn) 

0(!og^  n) 

this  paper 

staircase 

O(logn) 

O(logn) 

O(logn) 

this  paper 

Table  2:  Previous  and  New  results  in  dynamic  point  location.  N  denotes  the  number  of  possible  y-coordinates  for 
edge  endpoints  in  the  subdivision. 


Let  V  therefore  be  a  node  in  T  with  trapezoid  r  asso¬ 
ciated  with  it.  If  no  vertex  or  edge  of  S  intersects  the 
interior  of  r,  then  we  say  that  r  is  empty,  in  which  case 
u  is  a  leaf  of  T.  Note  that  in  this  case  any  point  deter¬ 
mined  to  be  inside  r  is  immediately  located  in  the  cell 
of  S  containing  r.  Let  us  therefore  inductively  assume 
that  r  contains  at  least  one  endpoint  of  a  segment  in 
S.  There  are  two  cases: 

1.  There  is  no  face  of  S  that  intersects  r’s  left  and 
right  boundaries  while  not  intersecting  r’s  top  or 
bottom  boundary.  In  this  case  we  divide  r  in  two 
by  a  vertical  line  down  the  “middle”  (we  choose  a 
vertical  line  which  balances  the  height  of  the  tree 
on  both  sides)  of  r,  an  action  we  refer  to  as  a  ver¬ 
tical  cut.  This  creates  two  new  trapezoids  r;  and 
Tt,  which  are  ordered  by  the  “right  of”  relation. 
We  create  two  new  nodes  vi  and  Vr,  which  are 
respectively  the  left  and  right  child  of  v,  with  vi 
associated  with  n  and  Vr  associated  with  Tr- 

2.  There  is  at  least  one  face  of  S  that  intersects  both 
the  left  and  right  boundaries  of  r  and  does  not 
have  a  spanning  edge  of  r  as  its  top  or  bottom 
boundary.  In  this  case  we  “cut”  r  through  each 
of  the  faces  of  S  that  intersect  t’s  left  and  right 
boundaries.  This  creates  a  collection  of  trapezoids 
Ti,T2,  ■  ■  ■  ,Tk  ordered  by  the  “above”  relation.  We 
refer  to  this  action  as  a  collection  of  horizontal 
pseudo-cuts  (even  though  it  would  be  more  accu¬ 
rate  to  call  them  “non- vertical  pseudo-cuts”).  We 
associate  a  node  in  T  with  each  and  make 
this  set  of  nodes  be  the  children  of  u  in  T,  ordered 
from  left-to-right  by  the  “above”  relation  on  their 
respective  associated  trapezoids. 

Repeating  the  above  trapezoidal  cutting  operations 
recursively  at  each  child  of  v  creates  our  tree  T  (see 
Figure  3).  The  tree  T,  of  course,  cannot  yet  be  used  to 
perform  an  efficient  point  location  query,  since  a  node 
u  in  T  may  have  many  children  if  its  associated  action 
forms  a  collection  of  horizontal  cuts.  To  help  deal  with 


this  issue  we  define  the  weight  of  a  node  w  G  T  to  be  the 
number  of  leaf  descendents  of  v  in  T,  and  we  use  w{v)  to 
denote  this  quantity.  Given  this  weight  function,  then, 
we  store  the  children  of  each  node  u  in  T  as  leaves  in  a 
biased  finger  tree  Ty  and  doubly  link  all  the  leaves  of  T„. 
Of  course,  such  a  biased  finger  tree  is  a  trivial  tree  for 
each  node  v  corresponding  to  a  vertical  cut,  but  this  is 
not  a  problem,  for  it  gives  us  a  way  to  efficiently  search 
the  children  of  a  node  whose  corresponding  action  is  a 
collection  of  horizontal  cuts. 

The  structure  of  T  satisfies  an  invariant  that  if  a  face 
/  spans  a  trapezoid,  then  either  it  has  a  spanning  edge 
e  of  the  subdivision  on  its  top  or  bottom  boundary  or 
it  is  split  into  two  by  a  pseudo-cut.  In  either  case,  the 
face  /  has  a  bounding  spanning  edge  if  it  spans  r.  We 
say  that  a  face  /  or  an  edge  e  of  the  subdivision  covers 
a  trapezoid  r  if  it  spans  r  horizontally  and  it  does  not 
span  any  ancestor  of  r  in  T.  The  structure  of  T  has  the 
property  that  any  face  or  edge  covers  at  most  O(logn) 
trapezoids  and  also  each  face  or  edge  covers  at  most 
two  nodes  at  any  level  of  T.  These  properties  follow 
easily  from  segment  tree  like  arguments  [31]. 

We  now  describe  the  point-location  query  algo¬ 
rithm  for  our  data  structure.  Consider  the  operation 
query{T,x,y),  where  x  and  y  represent  the  coordinates 
of  the  query  point  and  r  is  a  current  trapezoid  in  the 
subdivision  (which  represents  a  node  in  our  primary 
data  structure).  We  alternately  make  comparisons  with 
nodes  in  the  primary  and  secondary  data  structures. 
In  the  primary  data  structure  (triangular  nodes  rep¬ 
resenting  trapezoids),  we  compare  the  x  value  of  the 
point  against  the  x  value  of  the  vertical  cut  at  t.  This 
identifies  the  left  or  right  secondary  data  structure  of  r 
containing  the  query  point.  We  then  use  the  secondary 
data  structure  to  identify  a  trapezoid  containing  the 
query  point  among  the  several  trapezoids  separated  by 
horizontal  cuts.  In  the  secondary  data  structure  i.e., 
in  the  biased  finger  tree  stored  in  the  trapezoid  r,  we 
compare  the  {x,y)  value  of  the  point  against  the  sup- 
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Figure  3:  Trapezoidal  Decomposition  of  a  Convex  Subdivision  with  Pseudo-Edge  Cuts.  The  triangular  nodes 
denote  vertical  cuts  and  the  circular  nodes  denote  horizontal  cuts. 


porting  line  of  the  spanning  edge  or  pseudo-cut  ® .  This 
identifies  a  leaf  node,  say  q,  in  the  biased  finger  tree 
that  represents  a  trapezoid,  say  r,,  in  the  primary  data 
structure.  We  now  recursively  locate  the  point  by  call¬ 
ing  query  {Tg,x,y). 

The  arguments  in  the  previous  section  on  the  struc¬ 
ture  of  our  data  structure  imply  that,  starting  from  the 
root  of  T,  we  can  perform  the  point  location  query  in 
0(logu;(r)  +  depth{T))  time  in  the  worst  case,  where 
r  denotes  the  root  of  T.  This  is  because  the  times  to 
perform  the  biased  merge  tree  queries  down  a  path  in 
T  form  a  telescoping  sum  that  is  0{logw{r)).  Noting 
that  w{r)  is  O(nlogn)  [35]  and  depth{T)  is  O(logn) 
(since  our  primary  data  structure  is  kept  balanced) 
gives  us  the  desired  result  that  a  point  location  query 
takes  (9(logn)  time. 

Our  method  for  updating  this  structure  is  rather  in¬ 
volved;  hence,  for  space  reasons,  we  can  only  sketch 
the  main  ideas  in  this  extended  abstract.  In  the  case 
of  an  insertion  of  an  edge  e  into  a  face  /  our  method 
traverses  down  the  tree  for  the  endpoints  of  e.  At  each 
node  that  /  covers  and  such  that  e  cuts  so  that  the  new 
faces  no  longer  cover  we  must  perform  0(1)  join  oper¬ 
ations.  Since  there  are  O(logn)  such  nodes,  the  total 
time  for  these  updates,  then,  is  O(logn).  In  addition, 
there  are  also  O(logn)  places  in  the  tree  where  we  must 
insert  this  new  edge,  each  of  which  can  be  implemented 
in  0(1)  time,  given  a  pointer  to  the  location  for  this  in¬ 
sertion  (which  we  obtain  from  the  A  list  for  /).  Thus, 
the  total  time  for  an  insertion  is  O(logn).  The  case  of 

®When  we  use  the  query  algorithm  to  locate  edges,  if  the  edge 
spans  the  trapezoid  t  then  we  compare  the  y- value  of  the  point 
of  intersection  of  the  edge  with  the  left  boundary  of  t  against  the 
y- values  of  the  points  of  intersections  of  the  horizontal  cuts  with 
the  left  boundary  of  r. 


a  deletion  is  essentially  the  reverse  of  the  above  opera¬ 
tions.  This  operation  runs  in  0(log^  n)  time,  however, 
since  we  now  may  have  to  perform  O(logn)  split  op¬ 
erations  (and  their  complexity,  unfortunately,  does  not 
form  a  telescoping  sum) . 

4.2  Rebalancing  the  Primary  Structure 

In  this  section  we  show  how  to  relax  the  constraint  that 
the  endpoints  have  x  coordinates  in  the  range  [1,  nj.  We 
use  a  S5[a]-tree  as  a  primary  tree  for  vertical  cuts  with 
biased  finger  tree  as  a  secondary  structure  in  each  node. 
We  briefly  review  the  properties  of  BB[a]-tree.  Let  /(/) 
denote  the  time  to  update  the  secondary  structures  af¬ 
ter  a  rotation  at  a  node  whose  subtree  has  I  leaves. 
Also,  assume  that  we  perform  a  sequence  of  n  update 
operations,  each  an  insertion  or  a  deletion,  into  an  ini¬ 
tially  empty  BB[a]-txee.  Now,  we  have  the  following 
times  for  rebalancing  [30]: 

•  If  f{l)  =  0{l  log"^  Z),  with  c  >  0,  then  the  rebalanc¬ 
ing  time  for  an  update  operation  is  ©(log"^"^^  n). 

•  If  /(Z)  =  0(1°'),  with  a  <  1,  then  the  rebalancing 
time  for  an  update  operation  is  0(1). 

In  our  case,  we  show  /(n)  =  0{n)  and  so  the  rebal¬ 
ancing  cost  is  O(logn).  We  give  the  details  in  the  full 
version. 

Thus  we  show. 

Theorem  4.1  Given  a  convex  subdivision  S  of  n  ver¬ 
tices,  there  exists  a  data  structure  which  allows  point 
location  queries  in  O(logn)  time,  vertex/edge  insertion 
in  O(logn)  time,  and  vertex/edge  deletion  in  O(log^n) 
time. 
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a.  A  Spanning  Face  Without  any  Pseudo-cut  b.  A  Spanning  Face  With  a  Pseudo-cut 

Figure  4:  Pseudo-Cut  for  a  staircase  spanning  face. 


4.3  Dynamic  Point  Location  in  Stair¬ 
case  Subdivisions 

In  this  section,  we  construct  an  O(logn)  depth  trape¬ 
zoidal  structure  for  staircase  subsubdivisions  similar  to 
the  one  for  convex  subdivision,  which  allows  us  to  per¬ 
form  query  in  O(logn)  time,  and  updates  in  O(logn) 
time.  We  always  require  that  deletion  of  an  edges  (ver¬ 
tices)  should  always  be  accompanied  by  an  insertion  of 
an  edge  (vertices). 

The  trapezoidal  structure  for  staircase  subdivision 
satisfies  a  slightly  different  invariant  from  that  of  con¬ 
vex  subdivision.  Here  we  use  an  invariant  that  if  a  face 
/  spans  a  trapezoid  r  and  /  can  be  split  into  two  faces 
by  a  (truly)  horizontal  spanning  segment,  say  e,  of  r, 
then  we  split  r  into  two  trapezoids  using  e.  So,  unlike 
convex  subdivision,  not  every  spanning  face  of  r  will 
have  a  pseudo-cut  (see  Figures  4. a  and  4.b)  here.  Us¬ 
ing  this  invariant,  we  can  easily  do  an  analysis  similar 
to  that  of  convex  subdivision  to  show  that  the  insert  op¬ 
eration  in  a  staircase  subdivision  takes  takes  O(logn) 
time.  The  details  are  omitted. 

Using  this  invariant,  we  can  easily  do  an  analysis 
similar  to  that  of  convex  subdivision  to  show  that  the 
insert  operation  in  a  staircase  subdivision  takes  takes 
(5(logn)  time.  The  details  are  omitted.  We  observe 
that,  each  edge  deleted  is  always  bounded  on  the  top 
and  on  the  right  by  “long”  edges.  This  implies  that  we 
already  have  a  horizontal  spanning  pseudo  edge  for  the 
face  resulting  after  deletion,  and  hence  deletion  of  an 
edge  does  not  require  an  introduction  of  a  new  pseudo¬ 
cut  into  the  subdivision.  This  eliminates  the  costlier 
case  of  deletion  which  takes  0(log^  n),  and  all  the  other 
cases  take  O(logn)  time.  We  give  details  in  full  ver¬ 
sion.  Thus,  this  observation  results  in  a  O(logn)  time 
method  for  deletion  of  an  edge  in  the  staircase  subdi¬ 
vision. 

Thus  we  show. 


Theorem  4.2  Given  a  staircase  subdivision  S  of  n 
vertices,  there  exists  a  data  structure  which  allows  point 
location  queries  in  O(logn)  time,  edge  (vertex)  inser¬ 
tion  in  O(logn)  time,  and  edge  (vertex)  deletion  in 
O(logn)  time  (the  deletions  are  always  coupled  with  in¬ 
sertions,  however). 

The  edge  insertion  and  deletions  performed  on  stair¬ 
case  subdivision  in  our  three-dimensional  layers  of  max¬ 
ima  algorithm  satisfy  the  constraints  specified  in  The¬ 
orem  4.2.  Therefore  we  have, 

Theorem  4.3  Given  a  set  S  of  n  points  in  5R^,  one 
can  construct  the  layers  of  maxima  for  S  in  0(n  log n) 
time,  which  is  optimal. 
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Abstract 

This  paper  gives  an  algorithm  for  approximately  solv¬ 
ing  the  post  office  problem:  given  n  points  (called 
sites)  in  d  dimensions,  build  a  data  structure  so  that, 
given  a  query  point  q,  a  closest  site  to  q  can  be  found 
quickly.  The  algorithm  is  also  given  a  relative  error 
bound  e,  and  depends  on  a  ratio  p,  which  is  no  more 
than  the  ratio  of  the  distance  between  the  farthest 
pair  of  sites  to  the  distance  between  the  closest  pair 
of  sites.  The  algorithm  builds  a  data  structure  of 
size  in  time  0{n‘^Tj)0{l/e)^^~^\ 

Here  rj  =  log(p/e).  With  this  data  structure,  a  site  is 
returned  whose  distance  to  a  query  point  q  is  within 
1  +  e  of  the  distance  of  the  closest  site.  A  query  needs 
0(log time,  with  high  probability. 

1  Introduction 

The  post-office  problem  is  the  following:  given  a  set 
5  of  n  points  (called  sites)  in  d  dimensions,  build 
a  data  structure  so  that  given  a  query  point  q,  the 
closest  site  to  q  can  be  found  quickly.  Many  data 
structures  have  been  proposed  for  this  problem;  their 
query  times  generally  are  faster  than  the  trivial  0{n), 
and  often  are  O(logu),  as  n  ^  oo.  The  dependence 
of  the  query  time  on  the  dimension  d  is  generally 
very  steep  for  the  nontrivial  algorithms,  at  least 
Even  heuristic  algorithms  that  are  fast  in  practice, 
such  as  bucketing  and  kd-trees,  also  have  this  expo¬ 
nential  dependence.  This  is  unfortunate,  since  many 
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of  the  most  interesting  potential  applications  have  d 
at  least  10,  and  often  much  greater.  Thus  the  promise 
of  a  query  time  of  0(log  n)  is  crushed  by  the  “curse 
of  dimensionality.” 

This  paper  gives  a  modest  but  significant  improve¬ 
ment  in  the  dimension  dependence  of  one  recent  algo¬ 
rithm,  that  of  Arya  and  Mount. [AM93]  They  attack 
the  problem  of  approximate  solution  to  the  post-office 
problem;  their  procedure  returns  an  e-c/osesf  site,  one 
whose  distance  to  the  query  point  q  is  within  1  -f  e  of 
closest.  Here  e  >  0  is  input  when  their  data  structure 
is  built.  The  algorithm  given  here  takes  their  ap¬ 
proach  to  what  is  arguably  its  logical  conclusion,  re¬ 
duces  the  query  time  from  0(log®  n)  to  0(log  n),^  and 
reduces  the  “constant”  factors  for  the  dimension  from 
0{l/eY  to  about  0(l/e)(‘'“i)/2.  (The  latter  reduc¬ 
tion  applies  to  the  storage  and  query-time  bounds.) 
The  new  algorithm  has  an  additional  factor  in  storage 
and  preprocessing  time  of  log  p,  where  p  is  bounded 
above  by  the  ratio  of  the  distance  between  the  sites 
farthest  apart  to  the  distance  of  the  sites  closest  to¬ 
gether.  (In  fact  p  is  roughly  the  maximum,  over  all 
sites  s,  of  the  ratio  of  the  distance  to  s  to  its  farthest 
Delaunay  neighbor,  to  the  distance  of  s  to  its  closest 
Delaunay  neighbor.  Hence  logp  is  negligible  relative 
to  other  factors.) 

Arya  et  al.  have  more  recently  described  a  dif¬ 
ferent  approximation  algorithm,  based  on  quadtree 
techniques,  that  has  better  storage  and  preprocess¬ 
ing  bounds  than  those  given  here,  but  with  a  query 
time  that  has  a  dependence. [AMN+94] 

The  new  algorithm  uses  a  technique  previously 
used  for  polytope  approximation  [Cla93]  in  order  to 
approximate  a  certain  Voronoi  region  for  each  site  by 
a  simpler  Voronoi  region  for  the  site.  The  approxi¬ 
mation  problem  is  solved  using  randomization.  The 
algorithm  also  builds  a  data  structure  similar  to  a 
skip  list,  and  uses  randomization  for  that.[Pug90] 
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The  following  section  first  describes  the  general  ap¬ 
proach,  and  then  develops  that  approach  in  the  fol¬ 
lowing  subsections. 

2  The  algorithm 

Following  Arya  and  Mount,  the  general  idea  is  to  find, 
for  each  site  s,  a  list  of  sites  N,  with  the  following 
property:  if  s  is  not  the  closest  site  to  the  query  point 
q,  then  there  is  a  site  in  N,  closer  to  q  than  s.  With 
this  property,  a  simple  search  procedure  will  lead  to 
the  closest  site:  pick  any  site  s;  if  a  site  t  E  N,  is 
closer  to  g,  assign  <  to  s  and  repeat;  otherwise  return 
s  as  closest. 

This  approach  is  not  so  interesting  just  yet.  The 
list  N,  must  be  the  set  of  Delaunay  neighbors  of  s,  as 
the  interested  reader  can  easily  show.  This  makes  for 
a  space  requirement  of  in  the  worst  case,  for 

d  >  2.  Also,  the  query  time  is  Q(n)  in  the  worst  Ccise: 
there  is  no  speedup  over  the  obvious  algorithm.  (For 
uniformly  distributed  points,  the  query  time  is  more 
like  so  this  approach  is  not  entirely  useless, 

however.) 

For  more  interesting  results,  make  the  problem  eas¬ 
ier:  instead  of  the  closest  site,  find  an  e-closest  site. 
(Again,  such  a  site  heis  a  distance  to  the  query  point 
that  is  within  1  -l-e  of  the  distance  of  the  closest  site’s, 
for  £  >  0.)  This  is  the  approximate  query  problem 
solved  by  Arya  and  Mount.  They  used  a  collection 
of  narrow  cones  to  obtain  their  lists,  in  a  way  similar 
to  Yao’s  use  of  them  for  finding  minimum  spanning 
trees[Yao82].  Here  the  approach  is  to  go  from  the  de¬ 
sired  conditions  on  the  lists  to  a  problem  similar  to 
polytope  approximation. 

The  modified  construction  begins  as  follows.  For 
each  site  s,  consider  a  list  L,  with  the  following  prop¬ 
erty:  for  any  q,  if  there  is  b  E  S  with 

d{q,s)  >  {l  +  €)d{q,b), 
then  there  is  b'  E  L,  with 

d{q,s)  >  (1  -b£')d(g,6'), 

where  e'  =  e/2.  (Note  that  if  e  =  0,  then  Lj  =  N,.) 
Using  such  lists,  the  search  procedure  starts  at  any 
site  s.  If  there  is  t  E  Ls  with  d(g,  s)  >  (1  -|-  e')d{q,  t), 
then  assign  t  to  s  and  repeat.  Otherwise,  return  s. 
With  Lg  as  defined,  the  returned  s  is  e-closest.  Note 
that  the  procedure  makes  progress  at  each  step:  the 
distance  of  the  current  site  decreases  by  1/(1  -f-  e'). 

Consider  the  condition  satisfied  by  L,  in  a  contra¬ 
positive  way.  Fixing  s  E  S,  let  Afe{S)  be  defined  by 

AfciS)  =  {g  I  d{q,  s)  <  (1  -b  e)d{q,  b)  for  all  b  E  S}, 


so  that 

Nt’{Lg)  =  {g  I  d{q,s)  <  {l  +  e')d{q,b)  for  all  6  G  LJ. 

The  condition  on  L,  is  equivalent  to  M(.i{Lg)  C 
Nt{S).  The  set  A// (5)  is  the  Voronoi  region  of  s  in 
a  certain  multiplicatively  weighted  Voronoi  diagram. 
The  Lg  we  want  is  a  small  one  that  such  that 
is  inside  A/ (5);  the  problem  of  finding  such  an  L,  can 
be  solved  by  techniques  previously  applied  to  poly¬ 
tope  approximation. [Cla93] 

The  following  subsection  discusses  the  problem  of 
finding  L,;  §2.2  bounds  the  size  of  such  a  list;  §2.3 
shows  how  to  use  the  lists  to  obtain  fast  query  times. 
Finally,  §3  makes  some  concluding  remarks. 

2.1  Finding  Lg 

To  find  Ls ,  we’ll  translate  the  problem  to  d  +  1  di¬ 
mensions  using  standard  “lifting  map”  techniques. 

If  we  put  s  at  the  origin,  the  region  is  the 

intersection  of  all  regions  of  the  form 

{z\z^  <  (l  +  e)2(z-6)^}, 

where  b  E  S.  The  condition  here  is  z^/(H-£)^  <  (^- 
6)^,  or  az^  >  2b'Z  —  b^,  where  a  =  1  — l/(l-|-£)^  2e. 

Now  let  {z,y)  denote  a  point  in  with  z  E 

and  y  E  R-  We  have 

^fs{S)  =  {z  \  ay  >  2b  ■  z  -  b^  and  y  =  z^}. 

So  for  b  E  S,  let  denote  the  halfspace 

ne,b  =  [{z>y)  I  ay  >  26  -  z  -  6^}, 

and  'Pc{S)  denote  the  poly  tope  which  is  the  intersec¬ 
tion  of  such  halfspaces, 

v,{s) = n 

bes 

Then 

K{S)  =  {z  I  (z,  y)  E  VeiS)  and  y  =  z^}. 

Let  'F  =  {(z,  y)  I  y  >  z^}.  Then  for  Mt'{Ls)  C  VeiS), 
it  is  enough  that 

T’c-(L,)n’F  C'P£(5)n’F.  (1) 

The  problem  of  finding  L,  satisfying  this  con¬ 
dition  can  be  solved  using  techniques  as  for  poly¬ 
tope  approximation[Cla93],  which  are  very  simi¬ 
lar  to  techniques  described  long  ago  for  linear 
programming[Cla88].  The  general  idea  is  this:  give 
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each  site  an  integral  weight,  initially  one.  Take  a  ran¬ 
dom  subset  R  of  S,  choosing  each  site  with  probabil¬ 
ity  proportional  to  its  weight,  and  making  R  about 
the  right  size.  Test  if  R  satisfies  (1);  if  so,  return 
it.  Otherwise,  we  can  derive  from  the  testing  of  R 
a  subset  of  S  that  we  can  expect  to  be  small,  and 
that  must  contain  a  site  of  an  appropriate  L,.  Dou¬ 
ble  the  weights  of  the  sites  in  that  subset.  Repeat 
this  process  until  done. 

Here  are  a  few  more  details  of  the  algorithm:  fix 
some  optimal  L  satisfying  (1),  where  the  size  c  of  C 
is  as  small  as  possible.  Assume  for  the  moment  that 
c  is  known.  (By  using  exponentially  increasing  esti¬ 
mates  of  c,  we  lose  only  a  small  factor  in  the  size  of 
the  returned  set,  and  in  running  time.)  Give  each 
p  £  S  a.  weight,  initially  equal  to  one.  Choose  a  ran¬ 
dom  subset  R  C  S  of  size  cCidlogc,  where  Ci  is  a 
small  constant,  choosing  a  site  with  probability  pro¬ 
portional  to  its  weight.  For  each  b  £  S,  check  that 

C7fe,6.  (2) 

This  is  a  convex  programming  problem,  and  as  shown 
by  Adler  and  Shamir,  it  is  solvable  in  0{n)  ex¬ 
pected  time  using  a  randomized  procedure  similar  to 
one  for  linear  programming. [AS90]  (The  base  case, 
with  0{d^)  constraints,  can  be  solved  in  polynomial 
time.[Vai89])  If  all  6  e  5  satisfy  (2),  then  return  R  as 
L,.  Suppose  the  condition  fails  for  some  b  £  S.  Then 
there  will  be  some  vertex  v  of  Ve'iR)  n  ^  not  in 
such  a  vertex  can  be  returned  as  part  of  the  output 
of  the  convex  programming  algorithm.  Find  all  the 
halfspaces  that  do  not  contain  v,  for  b  £  S.  This 
set  V  of  halfspaces  will  have  relatively  few  members, 
with  high  probability,  and  will  contain  a  member  of 
the  optimal  set  C.  If  the  size  of  set  V  is  less  than 
C2dn/r,  then  double  the  weights  of  the  correspond¬ 
ing  sites.  (Here  ^2  is  another  small  constant.)  Repeat 
this  procedure  until  a  list  Lg  is  returned. 

An  analysis  of  this  procedure  appears  in  [Cla93], 
and  will  appear  in  the  full  paper. 

2.2  The  size  of  Ls 

This  algorithm  returns  a  set  Lg  of  size  within 
0((ilogc)  of  the  best  possible  size  c,  as  can  be  shown 
as  in  [Cla93].  How  large  can  c  be?  We  have  the  fol¬ 
lowing  bound,  which  this  subsection  will  prove. 

Theorem  2.1  The  optimal  size  c  of  Lg  is 
0(l/e)(^-i)/2dlog(p/e). 

We’ll  need  the  following  lemma,  due  to 
Dudley. [Dud74] 


Lemma  2.2  Lei  P  C  R‘^  be  compact  and  convex,  and 
contained  in  a  ball  of  radius  1.  For  e  with  0  <  e  <  1, 
there  is  a  convex  polytope  P'  £)  P  with 
facets,  and  with  P'  within  Haussdorf  distance  e  of  P. 

We’ll  apply  this  lemma  to  bound  the  size  of  L, .  To 
do  so,  split  into  slabs 

=  {(^.2/)  \di  <y  <  di+i}, 

for  i  =  0...m,  where  ^/d^  =  (1/2)  minjgs  ||6||,  and 
di  =  3c/,_i/2,  and  m  is  large  enough  that  y/dff,  > 
(2/a) maxjgs  ll&ll-  We  have  m  =  logO{p/a).  The 
following  lemma  implies  that  only  the  points  of  these 
slabs  need  be  considered. 

Lemma  2.3  Every  halfspace  7ie,b  contains  all  points 
not  in  some 

Proof.  We  need  to  show  that  y  >  z'^  and  either 
y  >  dm  OT  y  <  do  imply  for  any  b  £  S.  Since  b  ■  z 
is  maximized  for  given  ||2||  when  z  =  6||z||/||6||,  when 
showing  that  (z,y)  £  Hc^b  we  can  assume  z  =  yb 
for  some  7.  If  z^  >  dm,  the  minimum  value  of  y 
to  consider  is  z^,  so  it’s  enough  to  show  that  az^  > 
2b  ■  z  —  b^,  or  ay^b'^  >  2yb'^  —  b^.  This  is  implied 
by  7  >  (2/a).  If  z^  <  dm,  then  similarly  we  need 
ay  >  4b^la  >  2yb'^  —  6^,  which  holds  for  7  <  2/a. 
When  y  <  do,  we  have  <  y  <  6^/4,  and  so  we  need 
az^  >  276^  -  6^,  where  7  <  1/2.  Since  az^  >  0,  the 
result  follows.  □ 

Consider  a  given  slab  .  We  have 

5 

adi>-a'di+i,  (3) 

for  sufficiently  small  e.  Consider  the  sets 

=  f]{iz,y)\a'di+,>2b-z-b^}, 
bes 

and 

P’’  =  Pi  {(-z,  y)  I  adj  >  26  •  z  -  6^}. 

b£S 

Then  (considering  only  points  in  ^,) 

V,>(S)  CV^  CV^  C  Pc(S). 

Moreover,  condition  (3)  implies  that  there  is  a  gap 
between  and  that  we’ll  use  shortly.  Now  let 
ball 

B'  =  {z\z^<  di+i}, 

so  that  all  points  in  $,■  have  z  projections  in  B' ,  and 
let 

B  =  {z  I  z^  <  (1  -p  2f)^d,.(,i}. 
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Let  Vp  =  {z  \  {z,y)  G  'P“},  and  similarly  define  Vp. 
Then  Vp  n  B'  CVpOB,  and  by  (3),  every  point  of 
the  former  is  at  least 

a'di+i/4 

m\ 

from  any  point  not  in  the  latter.  If  a  plane  7it',b  does 
not  contain  and  so  is  relevant  here,  then  as  in  the 
proof  of  Lemma  2.3,  we  have  di+i  >  6^/4.  This  says 
that  gap  between  Vp^B'  and  VpDB  is  proportional  to 
e^/di+i,  and  so  by  appropriate  scaling  we  can  apply 
Lemma  2.2  to  find  a  polytope  Vp  such  that 

v^  n  J5'  c  v;  c  :Pp  n  5 

and  has  facets.  This  implies  that 

there  is  a  poly  tope  in  of  the  same  complexity, 

such  that  relative  to  'I';, 

VAS)  CV^  CV,{S). 

It  not  hard  to  show  that  there  is  a  “coarsening”  poly¬ 
tope  Ve'{Ll),  with  at  most  d  times  as  many  facets  as 
■P',  such  that  Vf,i{L\)  n  C  Vc(S). 

By  putting  the  lists  Lj  together  into  Lj ,  we  obtain 
the  size  bound  of  the  theorem  of  this  subsection. 

2.3  Solving  closest-point  problems 

How  can  a  fast  query  time  be  obtained  using  the  lists 
Lj?  Just  as  with  Arya  and  Mount’s  work,  a  skip- 
list  approach  is  helpful. [Pug90]  Choose  a  family  of 
subsets  of  5  as  follows:  let  Ro  =  S;  to  obtain  Rj+i 
from  Rj,  pick  each  element  of  Rj  to  be  in  Rj  +  i  with 
probability  1/2.  Repeat  this  process  until  an  empty 
Rk  is  obtained.  If  s  G  Rj  but  not  Rj  +  i,  say  that  s 
has  level  j.  Construct  the  lists  L,j  for  each  Rj,  and 
so  s  G  S’  has  lists  for  each  subset  up  to  its  level.  To 
answer  a  query,  start  with  some  s  G  Rk-i,  and  find 
the  f-closest  site  tk-i  in  Rk-i  using  the  lists  Ls^k-i- 
Now  find  an  c-closest  site  tk-2  in  Rk-2,  starting  with 
tk-i-  Repeat  until  to  is  found,  and  return  to  as  an 
f-closest  site  in  S. 

The  correctness  of  this  procedure  should  be 
clear.  How  much  time  does  it  take?  Since  each 
list  is  bounded  in  size  by  0(dc  log  c),  where  c  is 
0(l/e)(‘^“^)/^dlog(p/e),  the  query  time  is  equal  to 
0(dc  log  c)  times  the  number  of  sites  visited  in  the 
procedure. 

It  is  worthwhile  to  compare  this  procedure  with 
one  that  finds  the  closest  site  in  Rj  at  stage  j,  not 
just  the  f-closest.  Suppose  we  have  tj  as  the  f-closest 
at  some  stage,  but  indeed  a  site  t  is  closest  in  Rj. 
When  finding  the  f-closest  in  Rj+i,  the  approximate 


procedure  will  in  two  steps  find  a  site  t'  in  Rj+i  such 
that  d{q,t')  <  d{q,t)/{l  +  e')'^.  (Here  we  assume  that 
the  search  in  Rj+i  takes  at  least  two  steps.)  Since 
(1  -t-  f')^  >  (1  +  e)  for  f  >  0,  we  know  that  t'  is  closer 
to  q  than  t.  The  number  of  sites  visited  at  stage 
j  -f-  1  for  the  exact  procedure  is  proportional  to  the 
number  of  sites  of  Rj+i  closer  to  q  than  t;  hence  the 
number  of  sites  visited  for  the  approximate  procedure 
in  Rj+i  is  no  more  than  2  plus  the  number  for  the 
exact  procedure. 

To  analyze  the  exact  search  procedure,  we  can 
follow  Sen’s  analysis  of  skip  lists. [Sen]  Look  at  the 
search  procedure  “backwards”:  starting  at  the  clos¬ 
est  site  to  q  in  Rj,  visit  sites  in  Rj  in  order  of  in¬ 
creasing  distance,  until  a  site  also  in  Rj+i  is  encoun¬ 
tered.  Call  this  a  level  jump.  Once  the  level  jump 
occurs,  only  sites  in  Rj  +  i  are  visited.  The  proba¬ 
bility  of  a  level  jump  at  a  given  visited  node  is  1/2. 
Thus  the  probability  that  at  least  k  level  jumps  occur 
in  V  node  visits  is  the  probability  that  a  binomially 
distributed  random  variable  hfis  at  least  k  successes 
in  V  trials.  The  query  time  can  be  greater  than  V 
only  if  either  the  number  of  level  jumps  exceeds  K 
or  if  fewer  than  K  level  jumps  occur  in  V  attempts; 
the  former  probability  is  no  more  than  n/2^,  which 
we’ll  need  less  than  some  probability  Pi.  This  im¬ 
plies  K  >  lg(n/Pi).  The  probability  of  fewer  than  K 
level  jumps  in  V  trials  can  be  bounded  using  Cher- 
noff  bounds  for  the  binomial;  letting  7  =  2K/V, 
it  is  exp(-7(l  -  7)^/2).  The  probability  that  the 
query  time  exceeds  21g(n/Pi)/7  for  a  given  point  q 
is  therefore  at  most  Pi  -f  exp(-lg(n/Pi)(l  -  7)^/7)- 
Hence,  setting  Pi  =  l/rA,  an  0{Q)\ogn  query  time 
is  achievable  with  failure  probability  0(1/71*^). 

This  analysis  applies  only  to  a  single  given  point 
q-,  what  about  arbitrary  points?  As  with  similar  situ¬ 
ations  in  randomized  geometric  algorithms,  a  good 
query  time  holds  for  all  points  because  there  are 
combinatorially  distinct  classes  of  points.  That 
is,  in  an  exact  search  algorithm,  two  points  91  and 
92  will  have  the  same  sequence  of  visited  sites,  and 
so  the  same  query  time,  if  the  distance  order  on  the 
sites  induced  by  the  two  points  is  the  same.  In  other 
words,  whether  we  sort  the  sites  in  order  of  distance 
from  9i,  or  sort  them  in  order  of  distance  from  92, 
we  get  the  same  sorted  order.  How  many  classes  of 
points  are  distinct  in  this  way?  Let  B  be  the  set 
of  (2)  perpendicular  bisector  hyperplanes  of  pairs  of 
sites,  and  let  A{B)  be  the  subdivision  of  R‘^  induced 
by  those  bisectors.  Then  all  points  in  one  cell  (block) 
of  A{B)  induce  the  same  distance  orders,  and  so  have 
the  same  query  time.  The  number  of  cells  of  A{B) 
is  ((=^)  <  rP^.  Thus  a  query  time  for  any  point  of 
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O(logn)  occurs  with  probability  1  — 

Queries  can  be  made  a  bit  faster  by  splitting  up 
the  each  list  L,  j  into  lists  where  the  superscript 

corresponds  to  the  slabs  in  §2.2.  When  searching 
for  a  given  site  s  at  a  given  stage  j,  the  list  j  with 
i  =  21gd(s,g)  can  be  used.  This  gives  a  query  time 
independent  of  p. 

3  Concluding  Remarks 

It  should  be  possible  to  have  an  algorithm  that  is 
polynomial  in  d,  by  making  e  a  sufficiently  large  con¬ 
stant,  as  in  Bern’s  note.[Ber93] 
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New  techniques  for  exact  and  approximate  dynamic  closest-point 

problems 


Sanjiv  Kapoor*  Michiel  Smid^ 


Abstract 

Let  5  be  a  set  of  n  points  in  IR^.  It  is  shown 
that  a  range  tree  can  be  used  to  find  an  Loo" 
nearest  neighbor  in  S  of  any  query  point,  in 
0((logn)-^“^  loglogn)  time.  This  data  structure 
has  size  0(n(logn)^“^)  and  an  amortized  update 
time  of  ©((logn)-^”^  loglogn).  This  result  is  used 
to  solve  the  (1  +  e)-approximate  L2-nearest  neigh¬ 
bor  problem  within  the  same  bounds.  In  this  prob¬ 
lem,  for  any  query  point  p,  a  point  q  £  S  is  com¬ 
puted  such  that  the  euclidean  distance  between  p 
and  q  is  at  most  (1-1-e)  times  the  euclidean  distance 
between  p  and  its  true  nearest  neighbor.  This  is  the 
first  dynamic  data  structure  for  this  problem  hav¬ 
ing  close  to  linear  size  and  poly  logarithmic  query 
and  update  times. 

New  dynamic  data  structures  are  given  that 
maintain  a  closest  pair  of  S.  For  L>  >  3,  a 
structure  of  size  0(n)  is  presented  with  amortized 
update  time  0((logn)^~^  loglogn).  For  D  =  2 
and  any  non-negative  integer  constant  fc,  struc- 
tmres  of  size  C)(nlogn/(loglogn)*')  (resp.  0(n)) 
are  presented  having  an  amortized  update  time  of 
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O (log n loglogn)  (resp.  O((logn)^/(loglogn)*=)). 
Previously,  no  deterministic  linear  size  data  struc¬ 
ture  having  polylogarithmic  update  time  was 
known  for  this  problem. 

1  Introduction 

Closest-point  problems  are  among  the  basic  prob¬ 
lems  in  computational  geometry.  In  such  problems, 
a  set  5  of  n  points  in  is  given  and  we  have  to 
store  it  in  a  data  structure  such  that  a  point  in 
S  nearest  to  a  query  point  can  be  computed  effi¬ 
ciently,  or  we  have  to  compute  a  closest  pair  in  5,  or 
for  each  point  in  S  another  point  in  5  that  is  closest 
to  it.  Distances  are  measured  in  the  Lt-metric,  for 
a  fixed  t,  1  <  t  <  00.  These  problems  £ire  known  as 
the  nearest-neighbor  problem,  the  closest  pair  prob¬ 
lem,  and  the  all-nearest-neighbors-problem.  In  the 
dynamic  version,  the  set  S  is  changed  by  insertions 
and  deletions  of  points. 

The  planar  version  of  the  nearest-neighbor  prob¬ 
lem  can  be  solved  optimally,  i.e.,  with  O(logn) 
query  time  using  0[n)  space,  by  means  of  Voronoi 
diagrams.  In  higher  dimensions,  however,  the  sit¬ 
uation  is  much  worse.  The  best  results  known  are 
due  to  Clarkson  [3]  and  Arya  et  cd.[l].  In  [3],  a 
randomized  data  structure  is  given  that  finds  a 
nearest-neighbor  of  a  query  point  in  O(logn)  ex¬ 
pected  time.  This  structure  has  size  0{n^^l‘^^^^), 
where  8  is  an  arbitrarily  small  positive  constant. 
In  [1],  the  problem  is  solved  with  an  expected 
query  time  of  0(n^“^/r('^+^)/'^l(logn)'^(^))  using 
0(n loglogn)  space.  It  seems  that  in  higher  di¬ 
mensions  it  is  impossible  to  obtain  polylogarithmic 
query  time  using  0(n(logn)°(^))  space.  Moreover, 
even  in  the  planar  case  there  is  no  dynamic  data 
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structure  known  that  has  poly  logarithmic  query 
and  update  times  and  that  uses  0(n(logn)‘^(^)) 
space. 

Therefore,  it  is  natural  to  ask  whether  the  ap¬ 
proximate  nearest-neighbor  problem  allows  more  ef¬ 
ficient  solutions.  Let  e  >  0.  A  point  q  E  S  is  called 
a  (1  -1-  €)-approximate  neighbor  of  a  point  p  £  IR-^, 
if  dt{p,q)  <  (1  +  e)dt{p,p*),  where  p*  £  S  is  the 
true  nearest-neighbor  of  p. 

This  approximate  neighbor  problem  was  consid¬ 
ered  in  Arya  et  al.[l,  2].  In  the  latter  paper,  the 
problem  is  solved  optimally:  They  give  a  deter¬ 
ministic  data  structure  of  size  0(n)  that  can  find 
a  (1  -f  e)-approximate  neighbor  in  O(logn)  time, 
for  any  positive  constant  e.  At  this  moment,  how¬ 
ever,  no  dynamic  data  structures  are  known  for 
this  problem.  In  this  paper,  we  prove  the  following 
results. 

Theorem  1  Let  S  be  a  set  of  n  points  in 
IR'^ .  There  exists  a  dynamic  data  struc¬ 
ture  of  size  0(n(logn)'^~^)  that,  given  a  query 
point  p  £  IR'^,  finds  an  L^o -neighbor  of  p 

in  S  in  0((logn)'®”^  loglogn)  time.  This 
data  structure  has  an  amortized  update  time  of 
0((logn)^“Moglog  n). 

Theorem  2  Let  S  be  a  set  of  n  points  in  IR'^  and 
let  e  be  a  positive  constant.  There  exists  a  dynamic 
data  structure  of  size  0(n(log  that,  given  a 

query  point  p  £  IR"^,  finds  a  {l-\-e)-approximate  L2- 
neighbor  of  p  in  S  in  O  ((log  log  log  n)  time. 

This  data  structure  has  an  amortized  update  time 
of  0 {{log  n)^~^  log  logn). 

We  also  consider  the  dynamic  closest  pair  prob¬ 
lem,  in  which  we  have  to  maintain  a  closest  pair 
under  insertions  and  deletions.  This  problem  has 
been  investigated  oidy  recently.  In  Table  1,  we  give 
an  overview  of  the  currently  best  known  data  struc¬ 
tures  for  maintaining  a  closest  pair  under  insertions 
and/or  deletions  of  points.  For  an  up-to-date  sur¬ 
vey,  we  refer  the  reader  to  Schwarz’s  Ph.D.  The¬ 
sis  [9]. 

Note  that  aU  data  structures  of  Table  1  are  deter¬ 
ministic  and  can  be  implemented  in  the  algebraic 
computation  tree  model.  K  we  add  randomization 
to  this  model,  then  there  is  a  data  structure  of  size 
0{n)  that  maintains  a  closest  pair  in  0((logn)^) 
expected  time  per  update.  (See  Golin  et  al.[5].) 


In  this  paper,  we  give  new  deterministic  data 
structures  for  the  dynamic  closest  pair  problem. 
These  structures  are  based  on  our  solution  to  the 
Too -neighbor  problem,  on  data  structures  for  main¬ 
taining  boxes  of  constant  overlap,  and  on  a  new 
transformation.  Given  any  dynamic  closest  pair 
data  structure  having  more  than  linear  size,  this 
transformation  produces  another  dynamic  closest 
pair  structure  that  uses  less  space.  The  complexity 
boimds  of  the  new  structures  are  shown  in  the  last 
three  lines  of  Table  1.  The  results  of  the  last  two 
lines  are  obtained  by  applying  the  transformation 
repeatedly.  Note  that  we  obtain  the  first  linear  size 
deterministic  data  structure  that  maintains  a  clos¬ 
est  pair  in  polylogarithmic  time.  As  a  by-product, 
we  prove: 

Theorem  3  Let  S  he  a  set  of  n  points  in  IR^. 
There  exists  a  dynamic  data  structure  of  size 
0(n(logn)'^~^)  that  maintains  an  Loo-neighbor  of 
each  point  in  S .  This  data  structure  has  an  amor¬ 
tized  update  time  o/G((logn)^“’- loglogn). 

An  our  algorithms  use  classical  and  weU- 
imderstood  data  structuring  techniques,  such  as 
range  trees,  segment  trees  and  dynamic  fractional 
cascading. 

The  rest  of  this  paper  is  organized  as  follows.  In 
Section  2,  we  show  how  range  trees  can  be  used 
to  solve  the  Too-neighbor  problem.  We  also  give 
the  data  structure  for  solving  the  approximate  Tj- 
nearest  neighbor  problem. 

Our  first  data  structure  for  maintaining  the  clos¬ 
est  pair  stores  a  dynamically  changing  set  of  boxes 
that  are  of  constant  overlap,  i.e.,  there  is  a  constant 
c  such  that  each  box  contains  the  centers  of  at  most 
c  boxes  in  its  interior.  We  have  to  maintain  such 
boxes  under  insertions  and  deletions  such  that  for 
any  query  point  p  £  IR^,  we  can  find  aU  boxes  that 
contain  p.  In  Section  3,  we  give  two  data  structures 
for  this  problem.  The  first  one  is  based  on  segment 
trees.  Therefore,  its  size  is  superlinear.  We  also 
give  a  linear  size  solution  having  a  slightly  worse 
update  time  in  the  planar  case.  This  solution  is 
based  on  skewer  trees.  (See  [4,  11].) 

In  Section  4,  we  use  the  results  obtained  to 
give  a  new  data  structure  that  maintains  the  clos¬ 
est  pair  in  a  point  set.  This  structure  has  size 
0{n{logn)^~^)  and  an  amortized  update  time  of 
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Table  1:  Deterministic  data  structures  for  the  dynamic  closest  pair  problem.  Distances  are  measured  in 
the  if-metric,  for  a  fixed  1  <  t  <  oo.  In  the  last  two  lines,  k  is  an  arbitrary  non-negative  integer  constant. 
The  update  times  are  either  worst-case  (w)  or  amortized  (a). 

0((logn)-^“^  loglogn).  It  immediately  gives  the  whose  first  coordinate  is  at  least  equal  to  pi.  Dur- 

dynamic  data  structure  for  the  all-Too-neaxest-  ing  this  search,  each  time  we  move  from  a  node  v 

neighbors  problem.  In  Section  5,  we  give  the  trans-  to  its  left  son,  add  the  right  son  of  v  to  the  set 

formation  that  reduces  the  size  of  a  dynamic  clos-  M.  Let  v  be  the  leaf  in  which  this  search  ends, 

est  pair  structure.  Applying  this  transformation  If  the  point  stored  in  this  leaf  has  a  first  coordi- 

repeatedly  to  the  structure  of  Section  4  gives  the  nate  that  is  at  least  equal  to  pi,  then  add  v  to 

new  results  mentioned  in  Table  1.  the  set  M.  It  is  well  known  that  the  final  set  M 

satisfies  {r  G  5  :  ri  >  pi}  =  UveAf^"-  Here, 
S„  denotes  the  set  of  points  that  are  stored  in  the 
2  The  problem  subtree  of  u.  Range  trees  can  be  maintained  under 

insertions  and  deletions  of  points.  The  update  algo- 
Let  5  be  a  set  of  n  points  in  IR  .  We  want  to  rithms  use  dynamic  fractional  cascading.  (See  [6].) 

store  this  set  into  a  d^a  structure  such  that  for  ^  D-dimensional  range  tree  for  a  set  of  n  points 

any  query  point  p  G  IR  ,  we  can  find  a  point  in  S  0(n(logn)^-i)  size  and  can  be  maintained  in 

having  minimal  Loo-histance  to  p.  Such  a  point  is  0((logn)^“^  loglog  n)  amortized  time  per  inser- 
caUed  an  L^-neighbor  of  p  in  5.  deletion. 

Let  q  be  an  L^-neighbor  of  p  in  the  set  {s  G  foUowing  notations.  For  any  point 

5  :  Si  >  pi}.  We  caU  q  a  right- L^- neighbor  of  ^  ^  (pi,p2, . . . ,pp)  G  IR^,  we  denote  by  p'  the 
p  in  S.  Similarly,  a  point  r  is  called  a  left-L^-  5  ^  of  points  in 

neighbor  of  p  in  S,  if  it  is  an  Loo-neighbor  of  p  in  5/  ^  .  p  g  g}.  In  Figure  1,  our  recur- 

the  set  {s  G  5  :  Si  <  pi}.  In  order  to  guarantee  algorithm  is  given  that  finds  an  i«,-neighbor 

that  both  neighbors  always  exist,  we  add  the  2D  p  G  IR^. 

artificial  points  (ui, .  . . ,  cld),  where  ail  are  zero, 
except  for  one  which  is  either  00  or  —00,  to  the  set 

5  Lemma  1  1-  If  the  variable  stop  has  value  false 

The  data  structure  that  solves  the  ioo-neighbor  after  the  while-loop  of  Stage  2  has  been  com- 

problem  is  just  a  D-dimensional  rarape  tree  storing  pleted,  then  the  set  C  contains  a  right-L^- 

the  points  of  5  and  the  artificial  points.  (See  [7].)  neighbor  of  p  in  S. 

We  recall  an  important  property  of  range  trees: 

Let  p  be  any  point  in  IR"^.  Consider  the  set  2.  If  the  variable  stop  has  value  true  after  the 

{r  G  5  :  ri  >  pi},  i.e.,  the  set  of  all  points  in  while-loop  of  Stage  2  has  been  completed,  then 

S  having  a  first  coordinate  that  is  at  least  equal  to  the  set  C  U  S^  contains  a  right- Loo -neighbor  of 

p’s  first  coordinate.  Using  the  range  tree,  we  can  p  in  S . 

decompose  this  set  into  O(logn)  canonical  subsets; 

Initialize  M  :=  0.  Starting  in  the  root  of  the  main  3.  During  the  while-loop  of  Stage  3,  the  setCuS^ 
tree,  search  for  the  leftmost  leaf  storing  a  point  contains  a  right-L 00-neighbor  of  p  in  S . 
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Algorithm  Neighbor(p,  5,  D)  (*  returns  an  ioo-neighbor  of  p  in  5  *) 

begin 

1.  Assume  D  —  1. 

■=  maximal  element  in  the  1-dimensional  range  tree  that  is  less  than  p; 
q^  :=  minimal  element  in  the  1-dimensional  rcinge  tree  that  is  at  least  equal  to  p; 
if  Ip  -  I  <  Ip  -  then  return  else  return  q^  fi; 

2.  Assume  D  >2. 

2a.  Compute  a  right-ioo-neighbor: 

Stage  1.  Compute  the  set  M  of  nodes  of  the  main  tree,  see  above. 

Number  these  nodes  vi,V2, . .  .,Vm,  m  =  |M|,  where  Vi  is  closer  to 
the  root  than  ni_i,  2  <  i  <  m. 

Stage  2.  (*  one  of  the  sets  5„.  contains  a  right- ioo -neighbor  of  p  *) 

C  :=  0;  i  :=  1;  stop  false; 
while  i  <  m  and  stop  =  false 
do  x'  :=  Neighbor(p',  5'.,  D  -  1); 

r  :=  the  point  stored  in  the  rightmost  leaf  of  the  subtree  of  vp, 

if  d^{p',x')  >  |pi  -  7-1 1 

then  C  :=  C  LI  {s};  i  :=  t  -f  1 

else  V  :=  Vi]  stop  true 

fi 

od; 

if  stop  =  false 

then  q^  :=  a  point  of  C  having  minimal  ioo-distance  to  p; 
goto  2b 

fi; 

stage  3.  (*  the  set  C  U  Sy  contains  a  right-Xoo-neighbor  ofp  *) 
while  V  is  not  a  leaf 
do  w  :=  left  son  of  v; 

x'  Neighbor(p',  5^,  D  -  1); 

r  :=  the  point  stored  in  the  rightmost  leaf  of  the  subtree  of  w; 

if  d^{p',x')  >  Ipi  -  7-1 1 

then  C  :=  C  U  {a:};  v  :=  right  son  of  v 

else  V  :=  w 

fi 

od; 

q^  :=  a.  point  of  C  U  having  minimal  hoo-distance  to  p; 

2b.  Compute  a  left-Xoo-neighbor: 

In  a  completely  symmetric  way,  compute  a  left-hgo-neighbor  q^  ofp; 

2c.  if  doo(p,  q^)  <  doo{p,  q^)  then  return  q^  else  retiun  q^  fi 
end 


Figure  1:  Finding  an  hoo -neighbor. 
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This  lemma  implies  that  point  which  is  com¬ 
puted  in  part  2a  is  a  right-Xoo-neighbor  of  p  in  5. 
Similarly,  point  computed  in  part  2b  is  a  left- 
ioo-neighhor  of  p.  This  proves  that  the  point  that 
is  returned  in  part  2c  is  an  Too'Jiciglibor  of  p. 

Algorithm  Neighbor(p,  S,D)  can  be  implemented 
using  fractional  cascading,  such  that  its  running 
time  is  bounded  by  0((logn)-*^'"^).  Hence,  we  have 
a  data  structure  for  the  problem  that 

has  a  query  time  of  0((logn)^“^)  and  that  uses 
0(n(logn)'®“^)  space.  Range  trees  can  be  main¬ 
tained  under  insertions  and  deletions  of  points.  Be¬ 
cause  of  dynamic  fractional  cascading,  the  query 
time  increases  by  a  factor  of  O (log log  n).  This 
proves  Theorem  1. 

Let  5  be  a  set  of  n  points  in  IR^  and  let  e  >  0 
be  a  fixed  constant.  For  any  point  p  G  IR^,  we 
denote  by  p*  its  L2'ii6ighbor  in  5.  A  point  q  E  S 
is  called  a  (1  -}-  e)- approximate  L2-neighbor  of  p  if 
d2{p,<l)  <  (1  +  ^)d2{p,P*)-  We  want  to  store  the 
set  S  in  a  data  structure  such  that  for  any  query 
point  p  G  IR'^  we  can  find  a  (1  -}-  e)- approximate 
Li-neighbor  of  it. 

Let  p  G  IR^  and  let  g  be  an  ioo-neighbor  of  p  in 
S.  It  is  easy  to  show  that  g  is  a  v^-approximate 
L2-neighbor  of  p.  We  extend  this  as  follows.  Let 
{!Fi)  be  a  family  of  orthonormal  coordinate  systems 
all  sharing  the  same  origin.  Assiune  that  for  any 
point  a:  in  IR^  there  is  an  index  i  such  that  in  Ti 
all  coordinates  of  x  are  almost  equal,  i.e.,  within 
1  -t-  e  from  each  other.  In  Yao  [15],  it  is  shown  how 
such  a  family  consisting  of  0((c/e)'^“^)  coordinate 
systems  can  be  constructed.  In  the  planar  case, 
this  family  is  obtained  by  rotating  the  X-  and  Y- 
axes  over  an  angle  of  i  ■  (j),  0  <  i  <  27r/(^,  where 
0<(/><7r/4is  such  that  tang!)  =  c/(2  -f  e). 

The  data  structure  for  approximate  Li- 
neighbor  queries:  For  each  index  i,  let  Sj  denote 
the  set  of  points  in  S  with  coordinates  in  the  sys¬ 
tem  Xi.  The  data  structure  consists  of  a  collection 
of  range  trees;  the  z-th  range  tree  stores  the  set  Si. 
Finding  an  approximate  L2-neighbor:  Let  p  G 
IR'^  be  a  query  point.  For  each  index  i,  use  the  i- 
th  range  tree  to  find  an  Loo -neighbor  g(*)  of  p  in 
Si.  Report  an  Xqo -neighbor  that  has  minimal  Li- 
distance  to  p. 

It  can  be  shown  that  this  query  algorithm  reports 
a  (l-l-e)-approximate  jL2-neighbor  ofp.  This  proves 
Theorem  2. 


3  The  containment  problem  for 
boxes  of  constant  overlap 

A  box  is  a  H- dimensional  axes-paraJlel  cube,  i.e., 
it  is  of  the  form  [oi  :  ai  -(-  ^]  X  [02  :  02  -f  X 
. . .  X  [ao  :  an  -f  ^],  for  real  nmnbers  oi,  02, . . . ,  a© 
and  (5  >  0.  The  center  of  this  box  is  the  point 
(ai  -|-  ^/2, 02  -f  b  12, . . . ,  ud  -|-  b  12). 

Let  S  be  a  set  of  n  boxes  in  IR^  that  are  of 
constant  overlap,  i.e.,  there  is  an  integer  constant 
CD  such  that  each  box  B  of  S  contains  the  centers 
of  at  most  CD  boxes  in  its  interior.  (Here,  we  also 
count  the  center  of  B  itself.  Note  that  many  boxes 
may  have  their  center  on  the  boundary  of  B.) 

We  want  to  store  these  boxes  in  a  data  structure 
such  that  for  any  query  point  p  G  IR'^,  we  can  find 
aU  boxes  that  contain  p. 

Remark:  Note  that  we  distinguish  between  “be¬ 
ing  contained  in  a  box”  and  “being  contained  in  the 
interior  of  a  box”. 

In  the  full  paper  it  is  shown  that  any  point  p  G 
IR'^  is  contained  in  at  most  boxes  of  S. 

(The  proof  is  technical.) 

3.1  A  solution  based  on  segment  trees 

We  start  with  the  one- dimensional  case.  Let  S  be 
a  set  of  n  intervals  [aj  :  bj],  1  <  j  <  n,  that  are 
of  constant  overlap  with  constant  c.  We  store  the 
intervals  in  the  leaves  of  a  balanced  binary  search 
tree  T,  sorted  by  their  right  endpoints.  The  leaves 
of  this  tree  are  threaded  in  a  doubly- linked  list. 
Let  p  G  IR  be  a  query  element.  Search  in  T  for 
the  leftmost  leaf  containing  a  right  endpoint  that 
is  at  least  equal  to  p.  Starting  in  this  leaf,  walk 
along  the  leaves  to  the  right  and  report  all  inter¬ 
vals  encountered  that  contain  p.  Stop  walking  as 
soon  as  4c  intervals  have  been  reported,  or  c  inter¬ 
vals  have  been  encountered  that  do  not  contaun  p. 
It  is  clear  that  this  data  structrue  solves  the  one¬ 
dimensional  problem  in  0{n)  space  with  a  query 
time  of  O(logn).  Moreover,  intervals  can  be  in¬ 
serted  and  deleted  in  O(logn)  time. 

We  next  consider  the  B-dimensional  case,  where 
D  >2.  Let  5  be  a  set  of  n  boxes  in  IR^  that  are 
of  constant  overlap  with  constant  C£).  If  B  =  Bi  X 
B2  X  . . .  X  Bd  is  a  box  in  IR'^,  then  B'  denotes  the 
box  B2  X  . . .  X  Bd  in  IR'^"^.  Similarly,  S'  denotes 
the  set  {B'  :  B  G  5}.  Recall  that  we  use  a  similar 
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notation  for  points. 

The  D-dimensional  structure  for  D  >  2:  Let 
ai  <  02  <  . . .  <  Cm,  where  m  <  2n,  be  the  sorted 
sequence  of  all  distinct  endpoints  of  the  intervals 
of  the  first  coordinates  of  the  boxes  in  S.  We  store 
the  elementary  intervals  (  —  00  :  ai),[ai  :  ai],(ai  : 
^2)?  [^2  •  02] ,  * . . ,  [o^  I  (^m  •  00) 

in  this  order  in  the  leaves  of  a  balanced  binary 
search  tree,  called  the  main  tree.  Each  node  v  of 
this  tree  has  associated  with  it  an  interval  being 
the  union  of  the  elementary  intervals  of  the  leaves 
in  the  subtree  of  v.  Let  5„  be  the  set  of  all  boxes 
B  =  Bi  X  B2  X  ...  X  Bd  in  S  such  that  C  Bi 
and  g  Bi.  {f{v)  is  the  father  of  v.) 

Each  node  v  of  the  main  tree  contains  (a  pointer 
to)  an  associated  structure  for  the  set  Partition 
this  set  into  S^i,  Sy^  and  Syr,  consisting  of  those 
boxes  of  Sy  whose  centers  lie  to  the  left  of  the  “ver¬ 
tical”  slab  ly  X  in  or  on  the  boimdary  of  this 

slab,  and  to  the  right  of  this  slab,  respectively. 

The  associated  structure  of  v  consists  of  three 
recursively  defined  (D  —  1)- dimensional  structures 
for  the  sets  5',,  5'^  and  S',.. 

The  query  algorithm:  Let  p  =  (pi,p2,  •  ■■,Pd)  G 
IR'°  be  a  query  point.  Search  in  the  main  tree  for 
the  elementary  interval  that  contains  pi.  For  each 
node  V  on  the  search  path,  recursively  perform  a 
{D  -  1)- dimensional  query  with  the  point  p'  in  the 
three  structures  that  are  stored  with  v. 

At  the  last  level  of  the  recursion,  a  one¬ 
dimensional  query  is  performed  in  a  binary  search 
tree.  In  this  tree,  the  algorithm  stops  if  it  has  re¬ 
ported  2^'^^  C£)  boxes  of  S  that  contain  p,  or  if  it 
has  encountered  cd  boxes  of  S  that  do  not  contain 
P- 

Lemma  2  The  query  algorithm  is  correct. 

Proof:  It  is  well  known  that  the  set  of  ail  boxes 
B  =  Bi  X  ...  X  Bd  such  that  pi  6  Bi  is  exactly 
the  umon  of  all  sets  Sy,  where  u  is  a  node  on  the 
search  path  to  pi.  Hence,  we  only  have  to  consider 
the  nodes  on  this  search  path. 

Let  w  be  a  node  on  the  path  to  pi.  We  have 
to  find  aU  boxes  in  5„  whose  last  D  —  1  inter¬ 
vals  contain  {p2,  ■  ■  ■  ,Pd)-  We  claim  that  the  re¬ 
cursive  queries  in  the  three  structures  stored  with 
V  find  these  boxes.  We  know  that  there  are  at  most 
boxes  that  contain  p.  Hence,  at  the  last 


level  of  the  recursion,  the  query  algorithm  can  stop 
as  soon  as  it  has  reported  this  many  boxes.  It  re¬ 
mains  to  prove  that,  at  the  last  level,  the  query 
algorithm  can  stop  if  it  has  encountered  cd  boxes 
that  do  not  contain  p. 

Consider  such  a  last  level.  That  is,  let 
Vi,V2, . .  be  nodes  such  that  (1)  Vi  =  v,  (2) 

Vi  is  a  node  of  the  main  tree  of  one  of  the  three 
structures  that  are  stored  with  u,_i,  and  (3)  Vi  lies 
on  the  search  path  to  pi. 

The  algorithm  makes  one- dimensional  queries 
with  p£)  in  the  three  structures — binary  search 
trees — that  are  stored  with  Consider  one 

such  query.  The  algorithm  searches  for  pxj.  Let  r 
be  the  leaf  in  which  this  search  ends.  Starting  in  r, 
the  algorithm  walks  along  the  leaves  to  the  right. 
During  this  walk,  it  encounters  boxes  that  do  or  do 
not  contain  p.  Assume  that  in  leaf  s,  the  c/p-th  box 
is  encountered  that  does  not  contain  p.  We  have 
to  show  that  the  algorithm  can  stop  in  s.  That  is, 
we  must  show  that  all  leaves  to  the  right  of  s  store 
boxes  that  do  not  contain  p. 

Assume  this  is  not  the  case.  Then,  there  is  a  box 
A  =  [ai  :  oi  -t-  a]  X  [02  :  02  -f  a]  X  . . .  X  [aj5  :  ajo  -f-  a] 
that  is  stored  in  a  leaf  to  the  right  of  s  and  that 
contains  p.  Let  B^^l  =  [6^1  :  bji  +  /3j]  x  [bj2  : 
bj2  +  /3j]  X  ...  X  [bjD  :  bjD  -1  /3j],  I  <  j  <  cd, 
be  the  encountered  boxes  between  r  and  s  that  do 
not  contain  p.  In  the  full  paper,  we  prove  that  A 
contains  the  centers  of  all  these  boxes  in  its  interior. 
This  is  a  contradiction.  ■ 

Theorem  4  Let  S  be  a  set  of  n  boxes  in  IR^  of 
constant  overlap.  There  exists  a  dynamic  data 
structure  of  size  0(n(logn)'°“^)  such  that  for  any 
point  p  G  IR'^,  we  can  find  all  boxes  of  S  that 
contain  p  in  0((logn)®“Moglogn)  time.  This 
data  structure  has  an  amortized  update  time  of 
0((logn)^-i  log  logo). 

3.2  A  solution  based  on  skewer  trees 

In  this  section,  we  give  a  linear  space  solution  to 
the  box  containment  problem.  This  solution  is 
based  on  skewer  trees,  introduced  by  Edelsbrmmer 
et  al.  [4]. 

Let  ^  be  a  set  of  n  boxes  in  IR"^  that  are  of 
constant  overlap  with  constant  cd-  For  D  =  1, 
the  data  structure  is  the  same  as  in  the  previous 
subsection. 
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The  i)-dimensional  structure  for  Z)  >  2:  If 
S  is  empty,  then  the  data  structure  is  also  empty. 
Assume  that  5  is  non-empty.  Let  7  be  the  median 
of  the  set  {(ai-t-6i)/2  :  [ai  ;  61]  X  [02  :  62]  X . .  .X  [a£)  : 
bo]  6  5},  and  let  e  be  the  largest  element  that  is 
less  than  7  in  the  set  of  all  elements  ci,  (oi  +  bi)/2 
and  bi,  where  [ai  :  61]  X  . . .  X  [an  ;  6d]  ranges  over 
S.  Let  7i  :=  7  -  e/2  and  let  a  be  the  hyperplane 
in  with  equation  xi  =  7i. 

Let  5<,  So  and  5>  be  the  set  of  boxes  [ai  : 
61]  X  ...  X  [aj)  :  bn]  in  S  such  that  61  <  71, 
ai  <  7i  <  an<i  7i  <  respectively.  The  D- 
dimensional  data  structure  for  the  set  S  is  an  aug¬ 
mented  binary  search  tree — called  the  main  tree — 
having  the  following  form: 

1.  The  root  contains  the  hyperplane  a. 

2.  The  root  contains  pointers  to  its  left  and  right 
sons,  which  are  D-dimensional  structmes  for  the 
sets  S<  and  5>,  respectively. 

3.  The  root  contains  (a  pointer  to)  an  associated 
structure  for  the  set  Sq:  Partition  this  set  into  Sqi 
and  5or,  consisting  of  those  boxes  in  Sq  whose  cen¬ 
ters  lie  to  the  left  of  the  hyperplane  a,  and  to  the 
right  of  CT,  respectively.  The  associated  structme  of 
the  root  consists  of  two  {D  —  1)- dimensional  struc¬ 
tures  for  the  sets  5q;  and  Sq^. 

The  query  algorithm  and  its  analysis  is  similar 
to  that  of  the  previous  subsection.  The  details  can 
be  found  in  the  full  paper.  (See  also  Smid  [11].) 

Theorem  5  Let  S  be  a  set  of  n  boxes  in  IR'^  of 
constant  overlap.  There  exists  a  data  structure  of 
size  0{n)  such  that  for  any  point  p  £  IR'^,  we  can 
find  all  boxes  of  S  that  contain  p  in  C)((logn)^“^) 
time.  This  static  data  structure  can  be  built  in 
0(n  log  n)  time. 

For  the  dynamic  version  of  the  problem,  the 
query  time  is  0((logn)'^“^  loglogn)  and  the  size 
of  the  structure  is  0[n).  Boxes  can  be  inserted  and 
deleted  in  amortized  time  0((logn)^loglogn). 

4  Maintaining  the  closest  pair 

In  this  section,  we  apply  the  results  obtained  so 
far  to  maintain  a  closest  pair  of  a  point  set  under 
insertions  and  deletions.  Let  S  be  a  set  of  n  points 
in  IR^  and  let  1  <  t  <  00  be  a  real  number.  We 
denote  the  Zt-distance  between  any  two  points  p 
and  q  in  IR^  by  d{p,  q).  The  pair  P,Q  £  S  \s  called 


a  closest  pair  of  S  if  d{P,  Q)  =  min{d(p,  q)  ■  p,q  £ 
S,p^  ?}• 

We  introduce  the  following  notations.  For  any 
point  p  £  IR^,  box{p)  denotes  the  smallest  box 
centered  at  p  that  contedns  at  least  (2D+2)'^  points 
of  5\  {p}.  In  other  words,  the  side  length  of  box{p) 
is  twice  the  Zoo-distance  between  p  and  its  (1  -f 
(2D  -h  2)^)-th  resp.  (2D  -|-  2)^-th  Zoo-ueighbor,  if 
p  £  S  resp.  p  ^  S. 

Let  N{p)  be  the  set  of  points  of  5  \  {p}  that  are 
contained  in  the  interior  of  box{p).  Note  that  N{p) 
has  size  less  than  (2D  -|-  2)^.  In  fact,  N{p)  may 
even  be  empty. 

Lemma  3  The  set  {{p,q)  :p£  S,q£  N{p)}  con¬ 
tains  a  closest  pair  of  S . 

Proof:  Let  {P,Q)  be  a  closest  pair  of  5.  We  have 
to  show  that  Q  £  N{P).  Assume  this  is  not  the 
case.  Let  6  be  the  side  length  of  box{P).  Since  Q 
lies  outside  or  on  the  boundary  of  this  box,  we  have 
d{P,Q)>S/2. 

Partition  box{P)  into  (2D  -f-  2)®  subboxes  with 
sides  of  length  S/{2D  +  2).  Since  box{P)  contains 
at  least  1  +  (2D  -|-  2)^  points  of  5,  one  of  these 
subboxes  contains  at  least  two  points.  These  two 
points  have  distance  at  most  D  •  S f{2D  -|-  2)  <  S/2, 
which  is  a  contradiction,  because  {P,Q)  is  a  closest 
pair  of  5.  ■ 

The  set  {box{p)  :  p  G  5}  is  of  constant  overlap: 
each  box  contains  the  centers  of  at  most  (2D  4-  2)^ 
boxes  in  its  interior.  These  centers  are  precisely 
the  points  of  N{p)  U  {p}.  This  fact  and  the  above 
lemma  suggest  the  following  data  structure. 

The  closest  pair  data  structure: 

1.  The  points  of  5  are  stored  in  a  range  tree. 

2.  The  distances  of  the  multiset  {d{p,q)  :  p  £ 
S,q  £  N{p)}  are  stored  in  a  heap.  With  each  dis¬ 
tance,  we  store  the  corresponding  pair  of  points. 
(Note  that  both  d{p,q)  and  d{q,p)  may  occm  in 
the  heap.) 

3.  The  points  of  S  are  stored  in  a  dictionary.  With 
each  point  p,  we  store  a  list  containing  the  ele¬ 
ments  of  N{p).  For  convenience,  we  also  caU  this 
list  N{p).  With  each  point  q  in  N{p),  we  store  a 
pointer  to  the  occurrence  of  d{p,  q)  in  the  heap. 

4.  The  set  {6oa:(p)  :  p  G  5}  is  stored  in  the  dy¬ 
namic  data  structure  of  Theorem  4.  This  structure 
is  called  the  box  tree. 
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It  follows  from  Lemma  3  that  the  pair  of  points 
that  is  stored  with  the  minimal  element  of  the  heap 
is  a  closest  pair  of  S.  The  update  algorithms  are 
rather  straightforward.  We  only  give  the  insertion 
algorithm. 

The  insertion  algorithm:  Let  p  G  IR^  be  the 
point  to  be  inserted.  Assume  p  ^  S. 

1.  Using  the  range  tree,  find  the  {2D  +  2)®  Loo- 
neighbors  of  p  in  5.  The  point  among  these  neigh¬ 
bors  having  maximal  Loo-distance  to  p  determines 
box{p).  The  neighbors  that  are  contained  in  the 
interior  of  hox{p)  form  the  list  N{p). 

2.  Insert  p  into  the  range  tree  and  insert  the  dis¬ 
tances  d{p,q),  q  G  N {p)  into  the  heap.  Then,  in¬ 
sert  p — together  with  the  list  N{p) — into  the  dic¬ 
tionary.  With  each  point  q  in  N{p),  store  a  pointer 
to  d{p,q)  in  the  heap.  Finally,  insert  box{p)  into 
the  box  tree. 

3.  Using  the  box  tree,  find  all  boxes  that  contain 
p.  For  each  reported  element  box{q),  q  ^  p,  that 
contains  p  in  its  interior,  do  the  following: 

3. a.  Search  in  the  dictionary  for  q.  Insert  p  into 
N{q),  insert  d{q,p)  into  the  heap  and  store  with  p 
a  pointer  to  d{q,p). 

3.b.  If  N{q)  has  size  less  than  {2D  +  2)^,  then 
the  insertion  algorithm  is  completed.  Otherwise,  if 
N{q)  has  size  {2D  +  2)^,  let  ri, . . . ,  r;  be  aU  points 
in  N{q)  that  have  maximal  Loo-distance  to  q.  For 
each  I  <  i  <  I,  delete  r*  from  N{q)  and  delete 
d{q,ri)  from  the  heap.  Finally,  delete  box{q)  from 
the  box  tree  and  insert  the  box  centered  at  q  having 
ri  on  its  boundary  as  the  new  box{q). 

During  the  update  algorithms,  we  perform  a  con¬ 
stant  number  of  query  and  update  operations  in  the 
range  tree,  the  box  tree,  the  heap  and  the  dictio¬ 
nary.  Therefore,  Theorems  1  and  4  imply: 

Theorem  6  Let  S  be  a  set  of  n  points  in  IR'^  and 
let  1  <  t  <  oo.  There  exists  a  data  structure  of 
size  0(n(logn)'^“^)  that  maintains  an  Lfclosest 
pair  of  S  in  0((logn)®“^  loglogn)  amortized  time 
per  insertion  and  deletion. 

Consider  again  our  data  structure.  The  box 
box{p)  that  is  associated  with  point  p  contains  an 
Loo-neighbor  of  p  in  5  \  {p}.  Therefore,  the  data 
structure  can  easily  be  adapted  such  that  it  main¬ 
tains  for  each  point  in  S  its  Loo -neighbor.  This 
proves  Theorem  3. 


5  A  transformation  for  reducing 
the  space  complexity 

The  closest  pair  data  structure  of  the  previous  sec¬ 
tion  uses  more  than  linear  space.  This  raises  the 
question  if  the  same  update  time  can  be  obtained 
using  only  linear  space.  Note  that  we  have  a  lin¬ 
ear  space  solution  for  maintaining  the  set  {box{p)  : 
p  G  5}.  (See  Theorem  5.)  For  the  Loo-neighbor 
problem,  however,  no  linear  space  solution  having 
polylogarithmic  query  and  update  times  is  known. 
Hence,  in  order  to  reduce  the  space  complexity,  we 
should  avoid  using  the  range  tree. 

We  wiU  give  a  transformation  that,  given  any 
dynamic  closest  pair  data  structure  having  more 
than  linear  size,  produces  another  dynamic  closest 
pair  structure  that  uses  less  space. 

Let  DS  be  a  data  structure  that  maintains  a  clos¬ 
est  pair  in  a  set  of  n  points  in  IR^  under  inser¬ 
tions  and  deletions.  Let  S{n)  and  U{n)  denote  the 
size  and  update  time  of  DS,  respectively.  We  as¬ 
sume  that  S{n)  and  U{n)  are  non-decreasing.  Let 
f{n)  be  a  non- decreasing  integer  function  such  that 
1  <  f{n)  <  n/2. 

Let  5  C  IR-^  be  the  current  set  of  points.  The 
cardinality  of  S  is  denoted  by  n.  Our  transformed 
data  structure  wiU  be  completely  rebuilt  after  a 
sufficiently  long  sequence  of  updates.  Let  So  be 
the  set  of  points  at  the  moment  of  the  most  recent 
rebuilding  and  let  no  be  its  size  at  that  moment. 

As  in  the  previous  section,  for  each  p  G  IR^, 
box{p)  denotes  the  smallest  box  centered  at  p  that 
contains  at  least  {2D  -(-  2)-^  points  of  5  \  {p}.  The 
set  of  aU  points  of  5  \  {p}  that  are  in  the  interior 
of  this  box  is  denoted  by  iV(p).  If  p  G  5o,  then 
boxo{p)  denotes  the  smallest  box  centered  at  p  that 
contains  at  least  {2D  +  2)^  points  of  Sq  \  {p}. 

The  transformed  closest  pair  data  structure: 

1.  The  set  S  is  partitioned  into  sets  A  and  B  such 
that  A  C  {p  G  S  :  p  G  So  A  box{p)  C  6oa:o(p)}- 

2.  The  distances  of  the  multiset  {d{p,  q)  '■  p  G 
A,q  G  A’(p)}  are  stored  in  a  heap.  With  each  dis¬ 
tance,  we  store  the  corresponding  pair  of  points. 

3.  The  boxes  of  the  set  {boxo{p)  :  p  G  ^o}  are 
stored  in  a  list,  called  the  box  list.  With  each  el¬ 
ement  boxo{p)  in  this  list,  we  store  a  bit  having 
value  true  if  and  only  if  p  6  A.  Moreover,  if  p  G  A, 
we  store  with  boxo{p)  the  box  6oa;(p). 

4.  The  boxes  of  the  set  {boxo{p)  :  p  G  5o}  are 
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stored  in  the  static  data  structure  of  Theorem  5. 
This  structure  is  called  the  box  tree.  With  each 
box  in  this  structure,  we  store  a  pointer  to  its  oc¬ 
currence  in  the  box  list. 

5.  The  points  of  S  are  stored  in  a  dictionary.  With 
each  point  p,  we  store  a  bit  indicating  whether  p 
belongs  to  A  or  5.  If  p  G  A,  then  we  store  with  p 
5.a.  a  pointer  to  the  occurrence  of  boxo{p)  in  the 
box  list,  and 

5. b.  a  list  containing  the  elements  of  N{p).  For 
convenience,  we  also  call  this  list  N{p).  With  each 
point  q  in  N{p),  we  store  a  pointer  to  the  occur¬ 
rence  of  d{p,  g)  in  the  heap. 

6.  The  set  B  is  stored  in  the  dynamic  data  struc¬ 
ture  DS,  called  the  B- structure. 

Lemma  4  Let  6  be  the  minimal  distance  stored  in 
the  heap  and  let  8'  be  the  distance  of  a  closest  pair 
in  B.  Then,  min(^,  ^')  is  the  distance  of  a  closest 
pair  in  the  set  S. 

Proof:  Let  {P,Q)  be  a  closest  pair  in  S.  We  dis¬ 
tinguish  two  cases. 

Case  1:  At  least  one  of  P  and  Q  is  contained  in 
A.  Assume  that  P  G  A.  Since  box{P)  contains  at 
least  1  -1-  {2D  +  2)^  points  of  5,  Q  is  contained  in 
the  interior  of  this  box.  Hence,  Q  G  N{P)  and  the 
disteince  d{P,Q)  is  stored  in  the  heap.  Therefore, 
8  =  d{P,Q).  Clearly,  d{P,Q)  <  8'.  This  proves 
that  (i(P,  Q)  =  min(^,  ^'). 

Case  2:  Both  P  and  Q  are  contained  in  B.  Then 
8'  =  d{P,Q)  and  d{P,Q)  <8.M 
Initialization:  At  the  moment  of  initialization, 
S  =  So  =  A  and  B  -  Using  Vaidya’s  al¬ 
gorithm  [14],  compute  for  each  point  p  in  5  its 
1  -f  {2D  A  2)®  ioo-neighbors.  The  point  among 
these  neighbors  having  maximal  Poo-distance  to  p 
determines  6oa:(p)  =  boxo{p).  The  neighbors  (ex¬ 
cept  p  itself)  that  are  contained  in  the  interior  of 
this  box  form  the  hst  N{p).  It  is  clear  how  the  rest 
of  the  data  structure  can  be  built. 

The  insertion  algorithm:  Let  p  G  be  the 
point  to  be  inserted. 

1 .  Insert  p  into  the  dictionary  and  store  with  p  a 
bit  saying  that  p  belongs  to  B.  Then,  insert  p  into 
the  P-structure. 

2.  Using  the  box  tree,  find  aU  boxes  that  contain 
p.  For  each  reported  element  boxo{q),  follow  the 
pointer  to  its  occmrrence  in  the  box  list.  If  the  bit  of 


boxo{q)  has  value  true,  then  check  ifp  is  contained 
in  the  interior  of  box{q).  If  so,  do  the  following: 

2. a.  Search  in  the  dictionary  for  q.  Insert  p  into 
N{q),  insert  d{q,p)  into  the  heap  and  store  with  p 
a  pointer  to  d{q,p). 

2.b.  If  N{q)  has  size  less  than  {2D  4-  2)^,  then 
the  insertion  algorithm  is  completed.  Otherwise, 
let  ri, . . . ,  rj  be  all  points  of  N {q)  that  are  at  maxi¬ 
mal  Poo-distance  from  q.  For  each  1  <  i  <  /,  delete 
n  from  N{q)  and  delete  d{q,  ri)  from  the  heap.  Fi¬ 
nally,  replace  box{q) — which  is  stored  with  boxo{q) 
in  the  box  list — by  the  box  centered  at  q  having  ri 
on  its  boundary,  being  the  new  box{q). 

Note  that  since  AC{pG5':pG5oA  box{p)  C 
boxo{p)},  all  boxes  box{q),  g  G  A,  that  contain  p 
are  foimd  in  Step  2. 

The  deletion  algorithm:  Let  p  G  5  be  the  point 
to  be  deleted. 

1.  Search  for  p  in  the  dictionary.  If  p  £  B, 
then  delete  p  from  this  dictionary  and  from  the  B- 
structure.  Otherwise,  if  p  G  A,  follow  the  pointer 
to  boxo{p)  in  the  box  list  and  set  its  bit  to  false. 
Moreover,  for  each  point  g  in  N{p),  delete  the  dis¬ 
tance  d{p,q)  from  the  heap.  Then,  delete  p  from 
the  dictionary. 

2.  Using  the  box  tree,  find  all  boxes  that  contain 
p.  For  each  reported  element  boxo{q),  follow  the 
pointer  to  its  occurrence  in  the  box  list.  If  the  bit 
of  boxo{q)  has  value  true,  then  check  if  p  is  con¬ 
tained  in  box{q).  If  so,  do  the  following:  Set  the 
bit  of  boxo{q)  to  false.  Search  in  the  dictionary  for 
g.  For  each  point  r  in  N{q),  delete  d{q,  r)  from  the 
heap.  Then,  delete  the  pointer  from  g  to  boxo{q), 
delete  the  hst  N{q),  and  store  with  g  a  bit  say¬ 
ing  that  it  belongs  to  B.  Finally,  insert  g  into  the 
B-structure. 

Rebuild:  Recall  that  no  is  the  size  of  5  at  the 
moment  we  initiahze  the  structure.  After  /(no) 
updates  have  been  performed,  we  discard  the  entire 
structure  and  initialize  a  new  data  structure  for  the 
current  S. 

Theorem  7  Let  DS  be  any  data  structure  for  the 
dynamic  closest  pair  problem.  Let  S{n)  and  U{n) 
denote  the  size  and  update  time  of  DS,  respectively. 
Let  1  <  /(^)  <  n/2  be  a  non-decreasing  integer 
function.  We  can  transform  DS  into  another  data 
structure  for  the  dynamic  closest  pair  problem  hav¬ 
ing  size  0{n  4-  5(/(n))),  and  an  amortized  update 
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time  ofO{{logn)^  ^  +  U{f{n))  +  (nlogn)//(n)). 

If  we  apply  this  theorem  twice  (for  the  case  D  >  3), 
respectively  k  times  (for  the  case  D  =  2),  with  ap¬ 
propriate  choices  for  the  function  f{n),  then  we  get 
the  results  given  in  the  last  three  lines  of  Table  1. 
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Competitive  Searching  in  a  Generalized  Street 

Amitava  Datta*  Christian  Icking** 


Abstract 

We  consider  the  problem  of  a  robot  which  heis  to  find 
a  path  in  an  unknown  simple  polygon  from  one  point 
s  to  another  point  t,  based  only  on  what  it  has  seen  so 
far.  A  Street  is  a  polygon  for  which  the  two  boundary 
chains  from  s  to  t  are  mutually  weakly  visible,  and  the 
set  of  streets  was  the  only  class  of  polygons  for  which  a 
competitive  search  algorithm  weis  known. 

We  define  a  new,  strictly  larger  class  of  polygons, 
called  generalized  streets  or  ^-streets  which  are  charac¬ 
terized  by  the  property  that  every  point  on  the  bound¬ 
ary  of  a  ^-street  is  visible  from  a  point  on  a  horizon¬ 
tal  line  segment  connecting  the  two  boundary  chains 
from  s  to  t.  We  present  an  on-line  strategy  for  a  robot 
placed  at  s  to  find  t  in  an  unknown  rectilinear  ^-street; 
the  length  of  the  path  created  is  at  most  9  times  the 
length  of  the  shortest  path  in  the  Li  metric.  This  is 
optimal  since  we  show  that  no  strategy  can  achieve  a 
smaller  competitive  factor  for  all  rectilinear  ^-streets. 
Compared  to  the  L2-shortest  path,  the  strategy  is  9.06- 
competitive  which  leaves  only  a  very  small  gap  to  the 
lower  bound  of  9. 

1  Introduction 

Path  planning  is  one  of  the  fundamental  problems  in 
robotics.  In  contrast  to  path  planning  with  complete 
information,  i.  e.  if  the  whole  scene  is  known  in  ad¬ 
vance,  it  is  interesting  for  many  real  life  situations  to 
consider  the  problem  of  finding  a  path  on-line.  Here, 
the  geometry  of  the  scene  is  unknown  and  the  robot 
has  to  make  decisions  depending  on  the  information  it 
gathers  through  (e.  g.  visual  or  tactile)  sensors. 
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Lumelsky  and  Stepanov  [8]  study  this  problem  when 
only  tactile  sensors  are  used.  Papadimitriou  and  Yan- 
nakakis  [9]  consider  several  variants  of  the  problem  as  a 
two-person  game  and  gave  bounds  on  the  length  of  the 
generated  path  in  terms  of  the  length  of  the  shortest 
path.  Blum  et  al  [2]  present  deterministic  and  ran¬ 
domized  on-line  algorithms  for  several  versions  of  this 
problem  when  the  obstacles  are  rectangles  or  convex 
polygons. 

Following  the  concept  of  competitive  algorithms  intro¬ 
duced  by  Sleator  and  Tarjan  [10]  for  general  problems 
in  computer  science,  a  strategy  for  searching  on-line  is 
called  k-competitive  for  a  constant  k,  if  it  can  be  guaran¬ 
teed  that  the  generated  path  is  no  longer  than  k  times 
the  shortest  path.  The  constant  k  is  then  called  the 
competitive  factor. 

Klein  [6]  describes  the  problem  of  designing  an  on¬ 
line  strategy  for  a  robot  when  the  robot  is  constrained 
to  move  inside  a  simple  polygon  from  a  vertex  s  to  a 
vertex  t.  He  gives  an  on-line  strategy  for  a  class  of 
polygons  called  streets.  A  street  is  a  polygon  such  that 
the  clockwise  polygonal  chain  L  from  s  to  i  and  the 
anticlockwise  polygonal  chain  R  from  s  to  t  are  mutually 
weakly  visible.  An  alternative  criterion  for  recognizing 
a  street  weis  also  given  in  [5].  Das  et  al  [3]  give  an 
algorithm  to  compute,  for  a  given  polygon,  all  pairs  of 
points  s  and  t  such  that  the  polygon  is  a  street  with 
respect  to  these  points. 

Klein  proves  a  lower  bound  of  y/2  (>  1.41)  on  the 
competitive  factor  for  searching  in  a  street,  and  his 
strategy  achieves  a  competitive  factor  of  (<  5.72). 
Recently,  Kleinberg  [7]  has  improved  this  ratio  to  2^2 
(<  2.83).  He  also  mentions  that  his  strategy  is  \/2- 
competitive  for  rectilinear  streets  which  is  optimal.  In¬ 
terestingly,  this  means  in  fact  that  one  can  find  on-line 
an  Li -shortest  path  for  rectilinear  streets. 

It  hcis  been  posed  as  a  challenging  open  problem 
whether  more  general  cleisses  of  polygons  admit  com¬ 
petitive  searching.  In  this  paper,  we  introduce  one  such 
class. 

The  rest  of  the  paper  is  organized  as  follows.  In  Sec¬ 
tion  2,  we  introduce  some  definitions  and  properties  re¬ 
lated  to  ^-streets.  We  give  the  strategy  for  rectilinear 
^-streets  and  analyze  its  complexity  in  Section  3.  Some 
open  questions  are  mentioned  in  Section  4. 


Figure  1:  A  ^-street. 


2  Definitions  and  properties 

We  consider  a  simple  polygon  P  in  the  plane  with  a  start 
vertex,  s,  and  a  target  vertex,  t.  We  use  the  Cartesian 
coordinate  system.  The  notion  of  horizontal  (resp.  ver¬ 
tical)  indicates  a  direction  parallel  to  the  ar-axis  (resp. 
y-axis). 

We  assume  for  simplicity  that  no  three  vertices  are 
aligned.  Our  results  can  be  easily  adapted  to  handle  the 
general  case  when  three  vertices  are  aligned  or  s  and  t 
are  not  vertices.  The  clockwise  (resp.  anticlockwise) 
chain  from  s  to  t  is  called  the  left  chain  or  L  (resp. 
right  chain  or  R).  These  two  chains  induce  a  natural 
ordering  of  vertices  of  the  polygon  from  s  to  t. 

Definition  1  [6]  A  polygon  is  called  a  street  if  the  two 
chains  L  and  R  are  mutually  weakly  visible. 

Definition  2  A  horizontal  line  segment  inside  the 
polygon  P  with  both  endpoints  on  the  boundary  of  P 
is  called  a  chord.  The  extension  of  a  horizontal  edge 
e  across  the  interior  of  P  such  that  the  two  end  points 
hit  the  polygon  boundary  is  called  the  chord  of  e,  analo¬ 
gously  we  define  the  chord  of  a  vertex.  If  a  chord  touches 
both  the  L  and  R  chains  it  is  called  an  LR-chord. 

Definition  3  A  simple  polygon  in  the  plane  is  called 
a  generalized  street  or  Q -street  if  for  every  boundary 


point  p  ^  LU  R,  there  exists  an  iiZ-chord  c  such  that 
p  is  visible  from  a  point  on  c. 

Figure  1  shows  an  example  of  a  ^-street.  Some  LR- 
chords,  drawn  as  dashed  lines,  indicate  visibilities  along 
the  dotted  rays. 

Lemma  4  The  class  of  Q-streets  contains  all  streets 
and  this  containment  is  proper. 

Proof.  Consider  a  street  and  a  point  p  in  its  left  chain 
L,  see  Figure  2.  We  want  to  show  that  p  is  visible  from 
a  point  on  an  Li?-chord.  Within  the  street,  p  must  be 
visible  from  some  point  q  £  R.  We  shoot  a  horizontal 
ray  from  p  into  the  interior  of  the  street.  If  it  hits  R 
then  p  lies  on  an  LiZ-chord  and  we  are  done. 

Otherwise  we  move  a  point  r  from  p  to  y  on  the  line 
segment  pq  from  which  we  continue  to  shoot  horizontal 
rays  in  both  directions,  see  Figure  2.  We  stop  at  the 
first  position  where  a  ray  touches  the  right  chain  R, 
which  must  eventually  happen  since  q  £  R.  This  ray 
determines  an  TiZ-chord  from  which  p  is  visible. 

This  proves  that  every  street  is  a  ^-street.  In  Fig¬ 
ure  1,  we  have  seen  an  example  of  a  ^-street  which  is 
not  a  street.  □ 

The  following  lemma  states  a  simple  property  of  LR- 
chords. 
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Figure  2:  A  street  is  always  a  ^-street  (Lemma  4). 


Lemma  5  Every  path  in  P  from  s  to  t  intersects  all 
LR-chords  of  the  polygon. 

Proof.  Consider  an  LiJ-chord  c  and  suppose  c  touches 
the  chains  L  and  R  at  the  points  p  and  q  respectively. 
The  line  segment  pq  divides  the  polygon  into  two  parts. 
The  point  t  cannot  be  in  the  part  which  also  contains 
s,  otherwise  p  and  q  would  be  on  the  same  chain  L  or 
R.  This  implies  that  s  and  t  are  in  different  parts  and 
any  path  from  s  to  t  must  intersect  □ 

3  A  strategy  for  rectilinear  ^-streets 

From  now  on,  we  concentrate  on  rectilinear  ^-streets; 
all  edges  are  either  horizontal  or  vertical,  see  Figure  3 
for  an  example.  It  is  clear  that  the  class  of  rectilinear 
^-streets  includes  all  rectilinear  streets  and  is  strictly 
larger. 

Definition  6 

•  A  horizontal  edge  of  the  polygon  is  called  a  cut  edge 
if  both  of  its  vertices  are  reflex.  The  chord  of  a  cut 
edge  is  called  a  cut  chord. 

•  A  horizontal  edge  is  called  a  step  edge  if  one  of  its 
vertices  is  convex  and  the  other  is  reflex.  The  chord 
of  a  step  edge  is  called  a  step  chord. 

•  A  cut  chord  divides  the  polygon  into  three  parts,  a 
step  chord  divides  it  into  two  parts.  We  call  these 
parts  the  regions  of  the  chord. 

We  refer  to  a  cut  chord  or  step  chord  as  a  critical 
chord,  see  Figure  4  for  examples.  It  is  easy  to  see  that 


iflf  the  two  points  s  and  t  are  in  two  different  regions  of 
a  chord  then  it  is  an  LiZ-chord. 

Definition  7  A  reflex  vertex  /  is  called  a  landmark 
with  respect  to  an  LR-chord  c  if 

•  /  is  visible  from  some  point  of  c,  and 

•  both,  the  vertical  edge  incident  on  I,  and  c,  are  part 
of  the  same  region  of  the  chord  of  1. 

See  Figure  4  for  an  example  of  a  landmark.  A  land¬ 
mark  is  always  an  endpoint  of  a  cut  chord  or  a  step 
chord.  For  the  robot,  to  see  a  landmark  will  be  an  evi¬ 
dence  that  the  target  is  hidden  behind  the  chord  of  that 
landmark. 

Lemma  8  Let  I  be  a  landmark  with  respect  to  an  LR- 
chord  and  let  c  be  the  chord  of  the  horizontal  (cut  or 
step)  edge  ofl.  Then  the  following  is  true. 

1.  The  points  s  andi  are  sr  different  regions  ofc,  i.  e. 
c  is  an  LR-chord. 

2.  All  landmarks  (if  any)  with  respect  to  c  are  in  the 
regions  of  s  and  t. 

3.  Either  the  point  t  or  some  landmark  in  the  region 
oft  is  visible  from  some  point  on  c. 

Proof.  We  prove  the  lemma  for  the  case  of  c  being  a 
cut  chord.  The  proof  for  a  step  chord  is  similar. 

Suppose,  c  is  a  cut  chord  which  passes  through  the 
cut  edge  e.  Suppose,  the  vertex  v  is  adjacent  to  e  and 
w  is  a  landmark  with  respect  to  an  LiZ-chord  ci.  First, 
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Figure  3:  A  rectilinear  ^-street. 


assume  that  s  and  t  are  in  the  same  region  of  c.  If  s 
and  t  are  in  the  region  which  does  not  contain  ci,  then 
Cl  is  not  an  liJ-chord.  If  they  are  in  the  same  region 
which  contains  ci,  then  there  is  at  least  one  point  on 
the  edge  e  which  is  not  visible  from  any  Lii-chord  and 
the  polygon  is  not  a  ^-street,  a  contradiction.  Hence, 
c  is  an  Li2-chord  and  the  two  points  s  and  t  are  in  two 
different  regions  of  the  chord  c. 

For  proving  the  second  statement,  assume  that  there 
is  a  landmark  I2  on  a  horizontal  edge  62  in  a  region  R2 
other  than  the  ones  which  contain  s  and  t.  Note  that 
no  horizontal  line  segment  (except  the  chord  c)  whose 
end  points  touch  the  polygon  in  the  region  R2  can  be 
an  Li2-chord.  This  is  because  the  complete  polygonal 
chain  which  constitutes  this  region  belongs  either  to  the 
L  or  to  the  R  chain.  So,  there  is  at  least  one  point  on 
62  which  is  not  visible  from  any  ii?-chord.  This  means 
the  polygon  is  not  a  ^-street,  a  contradiction. 

For  the  third  statement,  consider  the  two  regions 
which  do  not  contain  the  point  s.  We  have  already 
proved  that  t  must  be  in  one  of  these  two  regions.  So, 
t  can  be  hidden  from  all  the  points  of  the  chord  c  only 
through  the  presence  of  a  landmark.  □ 

Lemma  9  Ifc  is  the  chord  of  the  starting  point  s  then 
the  following  holds. 


•  The  point  t  or  some  landmark  in  the  region  of  t  is 
visible  from  some  point  on  c. 

•  The  only  visible  landmarks  are  in  the  region  ofT. 

Proof.  Similar  to  that  of  Lemma  8.  □ 

Now,  we  turn  to  a  description  of  our  algorithm.  We 
assume  that  at  every  position  in  its  path,  the  robot 
can  get  the  visibility  map  of  the  polygon  through  its 
sensors.  The  idea  for  our  strategy  is  inspired  by  the 
strategy,  called  doubling,  for  searching  a  point  on  a  line 
by  Baeza- Yates  et  al  [1].  There,  the  robot  goes  back  and 
forth  on  the  line,  at  each  step  doubling  the  distance  to 
the  start  point,  until  the  target  is  reached. 

Lemma  10  [1]  The  doubling  strategy  for  searching  a 
point  on  a  line  has  a  competitive  factor  of  9,  and  this 
is  optimal. 

For  the  problem  of  finding  a  path  in  a  ^-street,  we 
apply  a  slight  variant  of  the  doubling  strategy.  Stand¬ 
ing  on  a  critical  Li?-chord,  our  robot  performs  doubling 
on  this  chord  until  the  target  becomes  visible  or  a  land¬ 
mark  is  in  sight.  If  at  any  time  during  this  doubling  a 
vertical  wall  is  hit,  the  robot  walks  straight  in  the  other 
direction.  Also,  it  may  happen  that  after  reaching  a  cut 
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Figure  4:  A  step  edge  e,,  a  cut  edge  eg,  and  the  respective  chords. 
Vertex  /  is  a  landmark  with  respect  to  the  chord  of  e^. 


or  step  chord,  the  robot  immediately  sees  the  next  land¬ 
mark  and  it  does  not  execute  doubling  on  this  chord. 
We  do  not  differentiate  between  all  these  variants  of  the 
doubling  algorithm  in  our  strategy  and  call  all  of  them 
doubling. 

The  doubling  strategy  in  [1]  is  formulated  to  search 
for  the  target  only  on  an  integer  grid.  It  is  not  com¬ 
petitive  if  the  distance  to  the  target  may  be  any  real 
number,  because  the  first  step  may  be  to  the  wrong 
direction.  Nevertheless,  we  can  use  doubling  in  our  en¬ 
vironment  of  real  coordinates  if  we  chose  as  length  of 
the  first  step  for  each  doubling  the  distance  of  the  ac¬ 
tual  position  of  the  robot  to  the  nearest  projection  of 
all  visible  vertices  on  the  actual  chord. 

Our  algorithm,  called  doubleX,  is  presented  (in 
pseudo-code)  in  Table  1.  The  robot  walks  on  a  rec¬ 
tilinear  path  and  tries  to  keep  close  to  an  Li-shortest 
path.  Initially,  when  the  robot  is  at  the  point  s,  it  either 
sees  a  landmark  or  the  goal  by  executing  the  doubling 
algorithm  on  the  chord  of  s.  This  is  possible  due  to 
Lemma  9.  So,  initially  the  robot  can  choose  a  region 
depending  on  this  landmark.  At  every  successive  step, 
the  robot  moves  from  one  Li?-chord  defined  by  a  land¬ 
mark  to  another  LiZ-chord  defined  by  the  next  land¬ 
mark.  These  landmarks  can  be  correctly  chosen  due  to 
Lemma  8.  If  there  is  more  than  one  visible  landmark 
then  the  robot  is  free  to  chose  one  of  them.  In  this 
process,  ultimately  the  goal  is  reached. 

We  need  the  following  lemma  before  proving  the  fea^ 
sibility  of  step  10  of  the  algorithm. 

Lemma  11  If  a  landmark  I  is  visible  from  the  present 
position,  the  robot  can  reach  the  Li-nearest  point  on  the 
chord  of  I  by  an  xy-monotone,  i.e.  Li-shortest,  path. 


Proof.  We  assume  w.l.o.g.  that  the  landmark  /  is  on 
the  right  chain  R  and  to  the  right  and  above  the  present 
position  p  of  the  robot,  see  Figure  5.  The  other  cases 
are  similar. 

The  robot  goes  vertically  upwards  from  its  present 
position  until  it  reaches  the  chord  of  the  landmark  I 
(then  we  are  done),  or  I  becomes  invisible.  The  visibility 
ray  to  I  can  only  be  obstructed  by  a  reflex  vertex  of  L. 
At  this  point,  the  robot  starts  going  to  the  right  until  it 
is  vertically  below  a  point  of  the  chord  oil,  or  I  becomes 
invisible  again  (whatever  comes  first). 

This  process  is  repeated  until  the  chord  of  /  is  reached, 
see  the  thick  line  in  Figure  5.  Clearly,  the  resulting  path 
is  a;J^monotone. 

The  reached  point  on  the  chord  of  /  either  is  vertically 
above  p,  or  has  an  z-coordinate  equal  to  the  mciximum 
x-coordinate  of  the  subchain  of  L  between  the  chord 
of  p  and  the  chord  of  /.  This  shows  that  we  reach  the 
chord  of  /  at  the  Li-nearest  point  from  p.  □ 

It  is  easy  to  see  that  if  t  is  visible  from  the  present 
position  of  the  robot,  it  can  reach  f  in  a  similar  way,  so 
that  step  15  of  the  algorithm  is  feasible. 

Lemma  12  Starting  from  the  point  s,  the  robot  cor¬ 
rectly  reaches  the  point  t  by  executing  procedure  dou¬ 
bleX. 

Proof.  Initially,  there  are  two  possibilities  at  the  point 
s.  If  the  robot  sees  the  point  t,  it  can  go  to  t  by  a 
method  similar  to  that  in  Lemma  1 1 .  Otherwise,  it  finds 
a  landmark  according  to  Lemma  9.  From  Lemma  5, 
the  path  should  cross  the  chord  which  peisses  through 
this  landmark.  So,  initially,  the  robot  chooses  a  correct 
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1  Procedure  doubleX 

2  s  :  the  starting  point. 

3  t  :  the  target  point. 

4  m  :  a  chord. 

5  begin 

6  Let  m  be  the  chord  of  s. 

7  Execute  doubling  on  m  until  a  landmark  /  or  the  target  t  is  visible. 

8  while  {t  is  not  visible)  do 

9  Let  m  be  the  chord  of  /. 

10  Go  to  the  Lj-nearest  point  of  m  on  an  xy-monotone  path. 

11  Determine  the  region  R  of  m  where  we  come  from. 

12  Execute  doubling  on  m 

13  until  a  landmark  /  or  the  target  t  is  visible  in  a  region  other  than  R. 

14  end  while 

15  Go  to  t  on  an  xy-monotone  path. 

16  end 


Table  1:  The  strategy  for  rectilinear  ^-streets. 


vertical  direction.  In  every  subsequent  step,  the  robot 
reaches  a  critical  chord.  While  executing  the  doubling 
algorithm  at  each  such  chord,  it  is  always  ensured  from 
Lemma  8  that  either  the  next  landmark  or  the  point  t 
is  found.  As  soon  as  the  point  t  is  visible,  the  robot  can 
go  to  t  by  a  xy-monotone  path  as  mentioned  before.  □ 

We  now  estimate  the  length  of  the  path  generated  by 
the  robot. 

Definition  13  The  reduced  path  is  a  rectilinear  path 
from  s  to  t  which  only  includes  from  the  robot’s  path 

•  the  xy-monotone  motion  from  one  critical  chord  to 
the  next,  and 

•  the  horizontal  step  from  the  point  where  a  critical 
chord  is  first  reached  to  the  point  where  it  is  left. 

In  other  words,  for  the  reduced  path,  we  do  not  con¬ 
sider  the  extra  movements  generated  due  to  the  execu¬ 
tion  of  the  doubling  algorithm. 

Lemma  14  The  reduced  path  is  an  Li-shortest  path 
from  s  tot. 

Proof.  Along  its  way  from  s  to  t,  the  robot  stops  at  a 
sequence  of  critical  chords  cq  , . . . ,  Cj , . . . ,  c* ,  where  cq  is 
the  chord  of  s,  and  cj,  is  the  last  chord  the  robot  crosses 
before  reaching  t. 

By  induction,  assume  that  the  path  is  a  shortest  path 
upto  the  chord  Cf.  Clearly,  this  is  true  for  i  =  0.  Now, 


we  have  to  prove  that  the  path  up  to  c,-+i  is  shortest. 
If  i  =  0  or  if  the  reduced  path  from  Cj-i  to  c,+i  is  xy- 
monotone,  this  follows  from  the  induction  hypothesis. 

Otherwise,  we  have  two  possible  cases.  If  the  reduced 
path  changes  its  vertical  direction  at  c<  (“up”  from  Cj_i 
to  Cj  and  “down”  from  c,’  to  Cj+i  or  vice  versa)  then  c,- 
is  a  cut  chord,  and  every  shortest  path  from  s  to  t  goes 
along  the  edge  of  cj . 

If,  on  the  other  hand,  the  horizontal  direction 
changes,  then  the  path  from  c,-_i  to  c,-  must  touch  a 
vertical  edge  at  an  extreme  position  such  that  every 
shortest  path  also  touches  that  edge,  since  otherwise 
the  robot’s  path,  from  c,_i,  would  not  reach  the  Lj- 
nearest  point  on  cj. 

From  this  we  conclude  that  the  reduced  path  is  short¬ 
est  up  to  t. 

Theorem  15  Procedure  doubleX  achieves  a  competi¬ 
tive  factor  of  9  in  the  Li  metric,  and  this  is  optimal. 

Proof.  From  Lemma  14,  the  reduced  path  of  the  robot 
is  a  shortest  path.  Compared  to  a  reduced  path,  the 
only  extra  path  segments  the  robot  traverses  are  the 
segments  generated  due  to  doubling  on  a  step  or  cut 
chord.  Suppose,  for  such  a  chord  c,-,  the  robot  reaches 
and  leaves  c,-  at  the  points  pi  and  p2  respectively.  From 
Lemma  10,  the  robot  traverses  at  most  9  times  the 
length  of  the  segment  piP2  due  to  doubling.  Hence, 
the  claim  follows. 
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Figure  5:  An  sy-monctone  step  from  one  iij-chord  to  another. 


On  the  other  hand,  no  better  competitive  factor  can 
be  achieved.  Figure  6  shows  how  we  can  produce  a  Q- 
street  with  many  caves  at  integer  positions  where  t  can 
be  in  any  of  the  caves,  such  that  searching  for  t  in  such 
a  ^-street  is  essentially  equivalent  to  searching  a  point 
on  a  line,  for  which  [1]  proves  a  lower  bound  of  9.  □ 

The  lower  bound  of  9  also  applies  if  we  compare  the 
robot’s  path  to  the  i/2-shortest  path.  But  what  is  the 
competitive  factor  of  our  strategy  in  this  case?  Any 
Ti-shortest  path  is  at  most  times  longer  than  the 
corresponding  1,2-shortest  path.  From  this  one  can  con¬ 
clude  that  our  strategy  doubleX  achieves  a  competitive 
factor  of  9V2  (<  12.73)  in  the  L2  metric.  But  with  a 
closer  look  to  the  details  of  our  strategy,  one  can  prove 
a  much  better  bound. 

Theorem  16  Procedure  doubleX  achieves  a  compeii- 
tive  ratio  o/9.06  in  the  L2  metric. 

Proof.  We  consider  two  reflex  vertices  pi  and  p2  of 
the  robots  path  such  that  the  reduced  path  from  pi  to 
P2  is  ary-monotone  but  the  reduced  path  from  pi  to  the 
critical  chord  after  p2  (if  it  exists)  is  not  ary-monotone, 
as  well  as  the  reduced  path  from  the  preceding  chord  of 
Pi  top2- 

Let  ari  be  the  x-distance  between  pi  and  p2  and  yi 
the  {/-distance  (such  that  the  reduced  path  from  pi  to 
P2  has  length  x  -f  y). 

The  length  of  the  L2-shortest  path  from  pi  to  p2  is  at 
least  y/xl  +  y\.  From  the  proof  of  Theorem  15  we  know 
that  the  length  of  the  robot’s  path  is  at  most  9xi  -f  yi. 
We  may  assume  that  yi  ^  0  and  substitute  is  =  xi/yi. 


The  competitive  factor  must  be  less  than  the  maximum 
value  of 

9xi  -b  yi  _  9in  -H 
\/x\  H-  y\  +  1 

for  all  possible  values  of  w.  A  simple  analysis  shows 
that  the  maximum  is  reached  at  u;  =  9  at  which 

v/82  <  9.06 

is  the  maximum  value.  The  claim  of  the  theorem  follows 
by  extending  the  analysis  to  every  pair  of  such  chords 
and  from  the  fact  that  the  L2"shortest  path  must  also 
visit  the  points  pi  and  p2-  ^ 


4  Conclusion 

For  the  rectilinear  case,  we  have  extended  the  class  of 
polygons  for  which  competitive  strategies  are  known  for 
a  robot,  starting  at  a  vertex  s,  to  find  a  path  to  a  vertex 
t  with  the  help  of  the  visibility  map.  Our  strategy  dou¬ 
bleX  achieves  an  optimal  competitive  factor  of  9  in  the 
Li  metric,  and  a  factor  of  9.06  in  the  L2'rnetric,  which 
is  very  close  to  optimal. 

It  is  an  interesting  open  question  whether  there  is,  in 
the  rectilinear  case,  a  still  larger  class  of  polygons  which 
can  be  searched  competitively,  and  in  the  general  case,  if 
the  class  of  ^-streets  or  another  class  of  polygons  which 
strictly  includes  streets  permits  a  competitive  search. 
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t  can  be  in  any  of  the  caves. 


JT_JLji_ji_JLJT_h.JLJL 


s 


Figure  6:  Searching  in  a  ^-street  is  at  least  as  difficult  as  searching  a  point  on  a  line. 
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The  Tight  Lower  Bound  for  the  Steiner  Ratio  in  Minkowski  Planes 

Biao  Gao*  Ding-Zhu  Du*  Ronald  L.  Graham^ 


Abstract 

A  minimum  Steiner  tree  for  a  given  set  X  of  points 
is  a  network  interconnecting  the  points  of  X  having 
minimum  possible  total  length.  The  Steiner  ratio  for 
a  metric  space  is  the  largest  lower  bound  for  the  ra¬ 
tio  of  lengths  between  a  minimum  Steiner  tree  and  a 
minimum  spanning  tree  on  the  same  set  of  points  in 
the  metric  space.  In  this  note,  we  show  that  for  any 
Minkowski  plane,  the  Steiner  ratio  is  at  least  2/3.  This 
settles  a  conjecture  of  D.  Cieslik,  and  also  Du  et  ah. 

I  Introduction 

Given  a  compact,  convex,  centrally  symmetric  domain 
D  in  the  Euclidean  plane  one  can  define  a  norm 

II  •  \\d  ■  R  by  setting  ||x||£)  =  A  where  x  =  Xu 

and  u  €  dD,  the  boundary  of  D.  We  can  then  define 
a  metric  do  on  by  taking 

dD{x,y)  =  ||s  -  y||u. 

Thus,  dD  =  {x  I  ||x||i3  =  1}.  The  resulting  met¬ 
ric  space  M  =  M{D)  =  {E"^,dD)  is  often  called  a 
Minkowski  or  normed  plane  with  unit  disc  D.  We 
will  usually  suppress  the  explicit  dependence  of  vari¬ 
ous  quantities  on  D.  For  a  finite  subset  X  C  E'^,  a, 
minimum  spanning  tree  S  =  5(A')  consists  of  a  col¬ 
lection  of  segments  AB  with  A,  B  £  X ,  which  spans 
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all  the  points  of  X,  and  such  that  the  sum  of  all  the 
lengths  ||A5||£(  is  a  minimum.  We  denote  this  mini¬ 
mum  sum  by  Lm{X).  Further,  we  define 

Ls{X)  =  \ni^LM{Y) 

where  Y  ranges  over  all  finite  subsets  of  E"^  containing 
X .  It  is  not  hard  to  show  that  there  always  exists  X'  D 
X  with  lAT'l  <  2|A:|  -  2  having  Ls(X)  =  Lm(X'). 
When  equality  holds  we  say  that  the  Steiner  tree  T(X) 
(=  S(X'))  is  a  full  Steiner  tree  for  X.  The  minimum 
spanning  tree  5(y)  will  be  called  a  minimum  Steiner 
tree  T{X)  for  X.  The  points  of  Y \Ar  are  usually  called 
Steiner  points  of  T(A’);  the  points  of  X  are  known  as 
regular  points  of  T{X). 

Minimum  Steiner  trees  have  been  the  subject  of 
extensive  investigations  during  the  past  25  years  or 
so  (see[4,  11,  16,  9]).  Most  of  this  research  has  dealt 
with  the  Euclidean  metric,  with  much  of  the  remain¬ 
ing  work  dealt  with  the  L\  metric,  or  more  generally, 
the  usual  Lp  metric  or  norm  (see  [6,  3]).  It  has  been 
shown,  for  example,  that  the  determination  of  Ls{X) 
in  general  is  an  NP-complete  problem,  both  for  the 
Euclidean  as  well  as  the  Li  case  (cf.  [10],  [9]). 

In  this  note,  we  study  the  Steiner  ratio  p{D)  for 
M{D),  defined  by 


.  Ls{X) 


Thus,  p{D)  is  a  measure  of  how  much  the  total  length 
of  a  minimum  spanning  tree  can  be  decreased  by  allow¬ 
ing  additional  (Steiner)  points.  It  is  known  that  for  Ly 
metric  (so  that  D  is  the  square  with  vertices  (±1,0), 
(0,±1)),  p{D)  =  2/3  [13]  and  for  the  Euclidean  (or 
Lf)  metric,  p{D)  =  \/3/2  [7].  More  recently,  Cieslik 
[3]  and  Du,  Gao,  Graham,  Liu  and  Wan  [6]  indepen¬ 
dently  conjectured  that  for  any  normed  plane, 


2/3  <  p{D)  <  v^/2. 


Cieslik  [3]  showed  that  for  any  normed  plane, 

0.612  <  p{D)  <  0.9036 

while  Du  et  al.  [6]  proved  that  for  any  normed  plane, 
0.623  <  p{D)  <  0.8686 

We  will  prove  here  that  for  any  normed  plane, 
p{D)  >  2/3. 

Since  the  Li  metric  has  p  =  .2/3  then  this  inequality 
is  therefore  best  possible. 

For  prior  results  on  minimum  Steiner  trees  in 
normed  planes,  the  reader  should  consult  [2],  [8],  [1], 

[17]  and  [19].  This  note  is  organized  in  the  following 
way.  In  Section  2,  fundamental  properties  of  mini¬ 
mum  Steiner  trees  are  presented.  In  Section  3,  the 
main  result  is  proved. 

2  Preliminaries 

A  minimum  Steiner  tree  is  full  if  every  regular  point 
is  a  leaf  (i.e.,  has  degree  one).  The  following  lemma 
states  an  important  property  of  full  minimum  Steiner 
trees,  which  can  be  found  in  [6] 

Lemma  1  Suppose  that  dD  is  differentiable  and 
strictly  convex.  Then  every  full  Steiner  minimum  tree 
consists  of  three  sets  of  parallel  segments. 

A  tree  is  called  a  3-regular  tree  if  every  vertex 
which  is  not  a  leaf  has  degree  three.  A  consequence  of 
Lemma  1  is  that  for  strictly  convex  and  differentiable 
norms,  every  minimum  Steiner  tree  is  a  3-regular  tree. 

Another  consequence  of  Lemma  1  is  the  following 
result.  A  proof  can  be  found  in  [5]. 

Lemma  2  For  strictly  convex  and  differentiable 
norms,  every  full  minimum  Steiner  tree  on  more  than 
three  points  must  have  at  least  one  of  the  local  struc¬ 
tures  shown  in  Figure  1. 

Consider  a  full  minimum  Steiner  tree  T  in  a  plane 
with  a  strictly  convex  and  differentiable  norm.  Two 
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(a)  (b) 


Figure  1:  Local  structures  in  full  minimum  Steiner 
trees 


regular  points  are  called  adjacent  if  one  can  be  reached 
from  the  other  by  always  moving  in  a  clockwise  direc¬ 
tion  or  always  moving  in  a  counterclockwise  direction. 
Clearly,  each  regular  point  has  two  other  adjacent  reg¬ 
ular  points. 

We  can  form  a  polygon  G,  called  the  characteristic 
polygon  of  T,  by  joining  each  pair  of  adjacent  regular 
points  with  a  straight  line  segment.  Any  spanning 
tree  lying  inside  G  is  called  an  inner  spanning  tree.  A 
minimum  inner  spanning  tree  is  one  having  the  least 
possible  total  length.  A  point  set  P  is  called  critical 
if  there  is  a  minimum  Steiner  tree  T  for  P  such  that, 
the  union  of  the  minimum  inner  spanning  trees  (with 
respect  to  T)  for  P  divides  the  characteristic  polygon 
G  =  G{T)  into  equilateral  triangles.  The  vertices  of 
these  equilateral  triangles  (which  we  will  call  lattice 
points)  lie  on  a  triangular  lattice  in  the  normed  plane. 

Since  similar  sets  have  the  same  ratios  of  minimum 


Steiner  tree  and  minimum  spanning  tree  lengths,  we 
need  only  consider  critical  sets  having  equilateral  tri¬ 
angles  with  unit  edge  length.  Clearly,  for  any  critical 
set,  a  minimum  inner  spanning  tree  is  in  fact  a  min¬ 
imum  spanning  tree;  its  length  is  just  n  —  1  where  n 
is  the  number  of  its  (regular)  vertices.  Note  that  any 
two  adjacent  regular  points  have  mutual  distance  1. 


Define 


,n^  ■  ^s{P) 

Pn{D)  mm  -- — — . 

|P|=n  Lm(^) 


If  pn-1  >  Pn,  then  n  is  called  &  jump  value.  In  [7],  Du 


and  Hwang  prove  the  following. 


Lemma  3  In  a  plane  with  a  strictly  convex  and  dif¬ 
ferentiable  norm,  if  n  is  a  jump  value  then  p„  is 
achieved  by  some  critical  set. 


Remark:  The  proof  of  Du  and  Hwang  for  Gilbert- 
Pollak  conjecture  used  a  contradition  argument.  In 


their  argument,  n  is  assumed  to  be  the  smallest  nat¬ 
ural  numder  such  that  a  counterexample  of  n  points 
exists  for  the  Gilbert-Pollak  conjecture.  For  this  n, 
they  proved  that  p  is  achieved  by  some  critical  set 
and  then  showed  that  the  Gilbert-Pollak  conjecture 
holds  for  every  critical  set.  Actually,  if  assume  that  n 
is  a  jump  value,  then  the  argument  of  Du  and  Hwang 
still  holds.  Thus,  we  have  the  above  lemma. 

3  The  Main  Result 

Theorem  1  For  any  convex  and  centrally  symmetric 
D, 

PiD)>l. 

Moreover,  if  pk{D)  —  2/3  for  some  k,  then  k  —  4.  and 
dD  is  a  parallelogram. 

Proof.  To  begin  with,  we  first  assume  that  the  bound¬ 
ary  dD  of  unit  disc  D  is  strictly  convex  and  differen¬ 
tiable.  Thus,  we  can  apply  the  results  of  the  preceding 
section. 

Assume  that  the  theorem  is  false.  Let  n  denote  the 
least  value  so  that  Pn{D)  <  2/3.  Thus,  n  is  a  jump 
value.  By  Lemma  3  there  exists  a  critical  set  P  of  size 
n  such  that 

Ls{P)  2 

Lm{P)  3' 

that  is, 

Ls{P)<\LM{P)  =  \{n-l).  (1) 

Let  T  be  a  minimum  Steiner  tree  on  P  which  witnesses 
the  criticality  of  P.  We  first  establish  some  properties 
of  T. 

Lemma  4  T  is  a  full  Steiner  tree  and  every  edge  of 
T  has  length  less  than  2/3. 

Proof.  If  T  is  not  a  full  Steiner  tree,  then  we  can 
decompose  it  into  two  edge-disjoint  subtrees  Ti  and 
T2  which  are  Steiner  trees  on  point  sets  Pi  and  P2, 
respectively,  where  Pi  U  P2  =  P  and  each  Pj  has  size 
less  than  n.  Thus,  by  the  minilality  of  n, 

Ls{P)  =  Ls{Pi)  +  Ls{P2)>\lm{Pi)  +  \lm{P2) 

>  -^Lm{P), 


contradicting  (1). 

If  T  has  some  edge  e  of  length  at  least  2/3,  then 
by  removing  e,  we  are  left  with  two  vertex-disjoint 
subtrees  Tj  and  T^.  Clearly,  Ti  and  T2  are  Steiner 
trees  on  disjoint  subsets  Pi  and  P2,  respectively,  where 
Pi  U  P2  =  P.  It  follows  that 

Ls{P)  >  Ls{Pi)PLs{P2)Pi{e) 

>  |(Lm(Fi)  +  Lm(P2)  +  1) 

again  contradicting  (1),  where  in  general  we  will  let 
^{T)  denote  the  total  length  (under  D)  of  any  graph 
T  (such  as  an  edge,  path  or  tree).  □ 

Lemma  5  Suppose  Ti  is  a  3-regular  subtree  of  T 
which  has  f  leaves.  Then 

^(Ti)<|(/-1). 

Proof.  Assume  that 

^m)>|(/-i) 

for  some  subtree  Ti  and  suppose  that  Ti  has  r  leaves 
which  are  regular  points.  Then  the  removal  of  Ti  re¬ 
sults  in  f  —  r  subtrees.  Suppose  that  they  interconnect 
sets  of  ni,  n2,  •  •  •,  n/-r  regular  points,  respectively. 
Then  ui  -f  712  -f  ■  •  •  -h  n/-r  —  n  —  r  and 

Ls{P) 

—  i(”l  ~  1)  T  1(^2  —  1)  -h  ■  •  ■  -t-  ^(uf-r  “  1)  +  ^(Ti  ) 

>  i('^-/)  +  i(/- 1) 

=  l(n-l), 

which  contradicts  (1).  □ 

Let  us  call  a  path  A5i52  ■  •  ■  SiB  joining  two  adja¬ 
cent  regular  points  A  and  B  in  T  monotone  if  it  is 
either  a  clockwise  or  counterclockwise  path  from  A  to 
B.  We  will  say  that  Si  can  be  legally  moved  to  A  if 
7  >  3  and  the  subpath  5i5253  can  be  removed  from  T 
(disconnecting  it  into  three  subtrees)  and  replaced  liy 
a  parallel  translate  with  S[  located  at  point 

A  so  that  5(5353  intersects  S3  ■  ■  -  SiB.  Thus,  the  two 
subtrees  containing  A  and  B,  respectively,  are  recon¬ 
nected  by  5(5353. 
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Lemma  6  Let  AS1S2  ■  ■  SiB  be  a  monotone  path  in 
T  connecting  two  regular  points  A  and  B.  Suppose 
that  Si  cannot  be  legally  moved  to  A.  Draw  a  line 
through  B,  parallel  to  ASi,  and  intersecting  the  sub¬ 
path  5i525'3  at  B' .  Then 

£iASiS2S3)  +  £{8283)  -  e{BB'S2)  >  1.  (2) 

Proof.  Since  Si  cannot  be  legally  moved  to  A,  we  have 
£{BB')  <  £{ASi).  If  B'  is  on  the  segment  S1S2  (see 
Figure  2(a)),  then 

{£{ASi)  -  £{BB'))  +  e{SiB’)  >  £{AB)  =  1, 
that  is, 

£{ASiS2)-£{BB'S2)  >  1. 

Thus,  (2)  holds. 


Figure  2; 

On  the  other  hand,  if  B'  is  on  the  segment  S2S3 
(see  Figure  2(b)),  then 

i£iASi)  -  £iBB'))  +  £{SiS2B')  >  £iAB)  =  1, 
that  is, 

£iASiS2B')  +  £{S2B')  -  £{BB'S2)  >  1. 

Thus,  (2)  also  holds  in  this  case,  and  the  lemma  is 
proved.  □ 

It  is  easy  to  see  that  (2)  still  holds  if  £{ASi)  = 
£{BB'). 

Lemma  7  Suppose  Si  is  a  Steiner  point  in  T  adja¬ 
cent  to  two  regular  points  A  and  B.  Then  Si  can  be 
legally  moved  to  exactly  one  of  A  or  B. 

Proof  Let  52  be  the  Steiner  point  adjacent  to  5i  and 
let  S3  and  S4  be  the  two  vertices  adjacent  to  82- 


Suppose  that  Si  can  be  legally  moved  to  both  A  and 
B.  Then  from  these  two  movements,  we  can  obtain  a 
tree  of  total  length  at  most  £{T)  -\-£{SiS2),  which  can 
be  decomposed  at  A  and  B  (see  Figure  3).  Thus, 


Figure  3: 


f(T)  +  f(5i52)  >  H(„_2)  +  f(T5iS). 

By  Lemma  5, 

£{ASiB)-\-£{SiS2)  <  1. 

Therefore, 

£iT)  >  lin-2)-^£{ASiB)-£iSiS2) 

>  ^{n-4)  +  2£{ASiB) 

contradicting  (1). 

Suppose  now  that  Si  cannot  be  legally  moved  to 
either  A  or  B.  Let  C  be  the  regular  point  adjacent  to 

A,  other  than  B,  and  D  the  regular  point  adjacent  to 

B,  other  than  A.  By  Lemma  6, 

^(A5i5253)  +  ^(5253)  -  £{CC'S2)  >1.  (3) 
£iBSiS2S4)  +  £{8284)  -  £{DD' S2)  >1,  (4) 

where  C'  and  D'  are  two  points  defined  in  the  lemma. 
Let  T'  be  the  3-regular  subtree  interconnecting  A,  B, 
S3  and  54.  Adding  (3)  and  (4),  we  obtain 

2£{r)  -  £{ASiB)  -  £{CC'S2D'D)  >  2, 

that  is, 

£{T')  >  1  +  ^{£[ASiB)a£{CC'S2D’D))  >  2, 
contradicting  Lemma  5.  □ 
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We  now  complete  the  proof  of  the  first  part  in  the 
theorem.  By  Lemma  2,  there  are  two  possible  local 
structures  we  need  to  consider.  We  first  consider  the 
local  structure  shown  in  Figure  1(b).  Then  there  exists 
a  Steiner  point  S3  adjacent  to  two  Steiner  points  Si 
and  S2  each  of  which  is  adjacent  to  two  regular  points, 
say  A  and  B  are  adjacent  to  Si,  and  C  and  D  are  ad¬ 
jacent  to  52.  Let  54  be  a  vertex  adjacent  to  S3.  (See 
Figure  4.)  We  first  observe  that  if  £{BSi)  —  £{CS2) 


Let  T'  be  the  3- regular  subtree  interconnecting  A,  B, 
S4  and  52.  By  (5)  and  (6), 

£(T')  =  £{ASiB)  +  £{SiS3S2)  +  ^(5354)  >  2, 
contradicting  Lemma  5. 

Next,  we  consider  the  local  structure  shown  in  Fig¬ 
ure  1(a),  i.e.,  there  exists  a  Steiner  point  S2  adjacent 
to  a  Steiner  point  5i  and  a  regular  point  C  such  that 
5i  is  adjacent  to  two  regular  points  A  and  B.  Let  53 


Figure  4: 

then,  whether  or  not  5i  can  be  legally  moved  to  A,  we 
obtain  a  contradiction  by  using  the  argument  given  in 
the  proof  of  Lemma  7.  Thus,  without  loss  of  general¬ 
ity,  we  can  assume  that  £{BSi)  >  £{CS2),  i.e.,  5i  can¬ 
not  be  legally  moved  to  B.  Then  by  Lemma  6,  5i  can 
be  legally  moved  to  A  (see  Figure  4).  This  movement 
results  in  a  tree  of  length  at  most  ^(T)  4-^(5354),  which 
can  be  decomposed  at  A  into  the  subtree  ASiB  and 
a  subtree  interconnecting  n  —  1  regular  points  other 
than  B.  Thus, 

^(r)  +  ^(5354)  >  ^(n-2)+£(A5iS)  >  |(n-l)4-i. 

Since  £{T)  <  |(n  —  1),  we  have 

^(5354)  >  ^.  (5) 

Moreover,  by  Lemma  4,  £{BSi)  <  |  and  £{DS2)  <  |. 
It  follows  that 

£{CS2)  =  £{CS2D)  -  £{DS2)  > 

Note  that  by  Lemma  6, 

^(B5i5352)-^(C52)  >  1. 

Thus, 

^(5i5352)>  1  +  ^(C52)-^(B5i)>|  (6) 


Figure  5: 

be  the  vertex  adjacent  to  S2,  other  than  C  and  5i. 
We  claim  that 

£{S2S3)<e{BSi).  (7) 

In  fact,  if  £(S2S3)  >  £(BSi),  then  considering  the  3- 
regular  subtree  T',  interconnecting  A,  B,  C  and  S3, 
we  would  have 

£{T')  >  e{BSiS2C)  -f  £{ASiB)  >  2, 

contradicting  Lemma  5.  Now,  let  E  be  the  adjacent 
regular  point  of  A  other  than  B  and  let  yl5i  •  •  •  Sk  E 
be  the  monotone  path  connecting  A  and  E.  From 
the  definition  of  a  critical  set,  it  is  easy  to  see  that 
£(AE)  =  1.  Let  B,  Ei,  E2,  ■■■,  E5  be  all  the  lattice 
points  with  distance  exactly  one  to  A  (see  Figure  5). 
Then  E  £  {Ei,---,E5}.  Since  £{AC)  <  £{AB)  + 
1{BC)  =  2,  C  is  identical  to  either  Ei  or  a  lattice 
point  which  forms  an  equilateral  triangle  with  B  and 
El  (see  Figure  5). 

Suppose  that  E'  is  a  point  on  the  path  S1S2S3  such 
that  EE'  is  parallel  to  j45i.  If  E  is  at  Ei,  then  E' 
must  be  on  S2S3  and  £{S2E')  =  £{BSi).  It  follows 
that 

£iS2S3)>£iS2E')^£{BSi), 
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contradicting  (7).  The  similar  argument  can  be  ap¬ 
plied  to  the  case  that  E  is  at  E2. 


If  E  is  at  E3,  then  it  is  easy  to  see  k  >  4.  Let 
if"  be  a  point  on  S2S3S4  such  that  EE"  is  parallel  to 
SiS2-  Extend  BSi  to  F  so  that  EF  is  parallel  to  yl5i. 
Since  £{BE)  =  2£{BA),  we  have  £{EF)  =  2£{ASi) 
and  £{SiF)  =  £(BSi).  Let  F'  be  a. point  on  the  path 
S2S3S4  so  that  F F'  is  parallel  to  SiS2-  If  F'  is  on  the 
segment  S2S3,  then 

£{S2S3)>£iSiF)  =  £{BSi), 

contradicting  (7).  If  F'  is  on  the  segment  53S4,  then 

iiSsSi)  >  £iEF)  =  2£{ASi) 

>  2{£{AB)-£{ASi))>2{l-'^)  =  ^, 

contradicting  Lemma  4. 


If  E  is  at  E4  or  E5 ,  then  the  extension  of  Si  A  must 
intersect  the  monotone  path  ASi  ■  ■  -  SkE.  This  im¬ 
plies  that  any  line  between  and  S3S4  and  parallel 
to  them  must  intersect  the  path  ■■  -  SkE.  Draw 
the  parallelograms  S1S2S3H  and  ASiHG  and  extend 
HG  until  it  intersects  the  path  ASi  ■  ■  -  SkE,  say  at  F. 
{AG  cannot  intersect  the  path  TSi  ■■  -  SkE  since  oth¬ 
erwise,  removing  S2S3  and  adding  AG  would  result  in 
a  tree  of  length  at  most  £{T)  which  does  not  satisfy 
the  condition  in  Lemma  1  at  the  intersection  of  AG 
and  the  path  yl5i  •  •  -  SkE).  Then  FHS3S4  is  also  a 
parallelogram.  It  is  easy  to  see  that 

£{GF)  <£{8334)  -£iASi). 

Let  T'  be  the  3-regular  subtree  interconnect¬ 
ing  A,  B,  C  and  S3.  Consider  the  tree 
{T\T')UAGF  which  interconnects  n—2  regular  points. 
Then, 

£{T)  -  £{T')  +  £{AGF)  >  ^(n  -  3). 

o 


Figure  7: 


Moreover, 

£{T')  -  £{AGF) 

>  £{T')  -  (^(5253)  +  ^(5354)  -  ^(.45i)) 

>  £(TSi5)-bf(T5i52C)-£(5354) 


Therefore, 

m  >  \{n  -  3)  -f  i{T')  -  £{AGF)  >  ^(n  -  1), 

contradicting  (1).  This  completes  the  proof  of  the  first 
part  of  the  theorem  for  strictly  convex  and  differen¬ 
tiable  dD. 

When  dD  is  not  strictly  convex  or  not  differen¬ 
tiable,  we  can  use  a  sequence  of  strictly  convex  and 
differentiable  ones  to  approach  it  from  its  interior.  For 
each  norm  in  the  sequence  and  for  any  point  set  P,  we 
know  that 

Ls{P)>\lm{P).  (8) 

Since  Ls{P)  and  Lm{P)  are  continuous  functions  with 
respect  to  the  norm  for  fixed  P,  then  letting  the  se¬ 
quence  go  to  its  limit,  we  see  that  (8)  holds  for  the 
(arbitrary)  limiting  norm.  This  completes  the  proof 
of  the  first  part  of  the  theorem. 

Next,  we  show  the  second  part  of  the  theorem.  Be¬ 
fore  doing  so,  we  establish  three  lemmas. 

Lemma  8  Let  A1A2  ■  ■  ■  An  be  a  path  and  let  OBij 
be  the  unit  vector  starting  from  the  origin  O  along 
directions  AiAj.  If  £{AiA2  ■  ■  ■  An)  =  £{AiAn),  then 
the  straight-line  segment  Bi2-Sn-i,n  part  of  dD. 

Proof.  We  prove  that  all  Bij ,  i  <  j,  are  on  the  same 
straight  line.  The  lemma  is  a  consequence  of  this  fact. 

First,  consider  n  =  3.  Draw  the  parallelo¬ 

gram  A1A2A3B.  Without  lo.ss  of  generality,  assume 
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£{AiA2)  >  £{AiB)  =  £{A2Az).  Let  C  be  a  point 
on  A1A2  such  that  £(AiC)  =  £(AiB).  Let  E  be  the 
intersection  point  of  BC  and  A\Az-  Draw  line  A2H 
parallel  to  BC  and  intersecting  A1A3  at  H  (see  Figure 
8).  Then 

£(A^E)=£{HA3) 

and 

Thus, 

£iAiA3)  =  £(AiH)  +  £(HA3) 

Since 

£{AiA3)  =  ^(711^2^13)  =  ^(^2^3)(1  + 
we  have 

£{AiE)  =  ^(^2^3)  =  i{AiB)  =  £{AiC). 

This  means  that  quadrilateral  A\CEB  is  similar  to 
quadrilateral  OBnBizBzz-  Therefore,  B12,  B23  and 
Bi3  are  collinear.  In  addition,  B13  lies  between  B\2 
and  B23- 


Next,  consider  the  case  n  =  4.  Note  that 
£(^1712^3^4)  =  £(^1^4)  implies  that  £(Aiyl2j44)  = 
£(yiiT4)  because 

£(j4i ^42.43714)  >  £(4.i4.2)  +£(4.2^4)  >  £(4.1^4). 

From  the  case  n  —  3,  S14  is  in  the  segment  [5i2,  S24]- 
Similarly,  5i4  is  in  the  segment  [-813,534],  B13  is  in 
the  segment  [512,823],  and  824  is  in  the  segment 
[823,834]  (see  Figure  9).  Note  that  all  8,y  ’s  are  on 
do,  a  boundary  of  a  convex  region.  Moreover,  for  any 


=23 


Figure  9: 


i,  j  and  k,  Bij,  Bjk,  and  Bik  are  either  all  distinct  or 
all  identical.  It  follows  that  all  Bij  for  1  <  i  <  j  <  4 
are  collinear. 

Now  consider  n  >  4.  Note  that  £(4i42  •  •  -  An)  = 
£(4i4„)  implies  that  for  4  <  j  <  n,  £(4i42434j)  = 
£(4i4j)  and  ioi  3  <  j  <  k  <  n,  £{AiA2AjAk)  = 
£(4i4i;).  Therefore,  for  4  <  j  <  n, 

812,  823,  813,  5ij,  821  and  Bzj  are  collinear 

and  for  3  <  j  <  ^  <  n, 

Bi2,Bij,B2j,Bik,B2k  and  Bjk  are  collinear. 

It  follows  that  all  Bij  for  1  <  i  <  j  <  n  are  collinear. 

o 

Lemma  9  P4{D)  =  2/3  iffdD  is  a  parallelogram.. 
Proof.  Note  that  for  any  D, 

P3{D)  >  3/4. 

Suppose  Pa{D)  =  2/3.  Thus,  4  is  a  jump  value.  Con¬ 
sider  F  =  {(4,  5,  (7,8)  I  Lm{A,B,C,E)  :=  1).  Since 
Lm{A,  B,C,  E)  is  continuous  with  respect  to  4,  8, 
C  and  8,  then  8  is  a  compact  set  in  8-dimensional 
space.  Clearly, 

P4(8)=  inf  L5(4,8,C’,8). 
(A,B,C,E)^F 

Since  Ls{A,  8,  C,  E)  is  also  continuous  with  respect  to 
4,  8,  C,  and  8,  there  exists  a  point  set  {4,  8,  C,  8} 
such  that 

2/3  =  P4(8)  =  LsiA,  8,  C,  E)ILm{A,  8,  C,  8).  (9) 

Note  that  the  minimum  Steiner  tree  T  for  this  point 
set  must  be  full  because  4  is  a  jump  value.  Suppose 
that  4,  8,  C  and  8  are  arranged  in  the  order  as  shown 
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in  Figure  10.  Let  pxY  denote  the  path  from  X  to  Y 
in  T.  We  claim  that  the 


^{PAB)=^{AB), 

(10) 

i{pcB)  =  i{CB), 

(11) 

i{pcE)  =  £{CE), 

(12) 

£{PAE)  =  i{AE). 

(13) 

In  fact,  if  one  of  them  does  not  hold,  then 

1{AB)  +  1{CB)  +  1{CE)  +  1{AE) 

<  (-{pab)  +  ^{pcb)  +  ^{pce)  +  ^{pae)  =  2^(r). 

So, 

ALm{A,B,C,  E) 

<  Z{i{AB)  +  i{CB)^l{CE)Are{AE)) 

<  U{T)  =  &Ls{A,B,C,E), 

contradicting  (9).  By  Lemma  8  and  (10)-(13),  dD 
must  be  a  parallelogram.  □ 

Lemma  10  Let  d{dD,  dD')  denote  the  maximum  Eu¬ 
clidean  distance  between  the  two  intersections  of  a  ray 
from  the  origin  with  dD  and  dD' .  Then  for  any  6  >  Q 
and  k,  there  exists  e  >  0  such  that  d{dD,dD')  <  e 
implies  \pk{D)  -  Pk{D')\  <  6. 


Proof  Consider  any  set  of  k  points  as  a  point  in 
2A-dimensional  space.  Let  fi  be  the  point  set  in  2k- 
dimensional  space  consisting  of ’’points”  each  of  which 
is  a  set  of  k  points  in  the  plane  with  a  Euclidean  mini¬ 
mum  spanning  tree  of  length  one.  Then  12  is  a  compact 
set.  In  addition,  it  is  easy  to  see  that  for  any  D, 


Now,  suppose  to  the  contrary  that  dD  is  not  a  par¬ 
allelogram  and  Pk{D)  =  2/3  for  some  fixed  value  of 
k.  By  Lemma  9,  Pa{D)  >  2/3.  Thus,  there  ex¬ 
ists  k' ,  4  <  k'  <  k,  such  that  pk'-i{D)  >  2/3  and 
pk'{D)  =  2/3.  Let  P  be  a  set  of  k'  points  such  that 
Ls{P)/ Lm{P)  =  2/3.  Then  every  minimum  Steiner 
tree  for  P  is  full.  By  Lemma  10,  we  can  choose  a 
sequence  of  norms  D'  with  strictly  convex  and  dif¬ 
ferentiable  boundary  such  that  pk'-\{D')  <  pk'{D'). 
So,  the  minimum  Steiner  tree  T{D')  for  P  under  each 
norm  D'  is  still  full.  By  Lemma  1,  every  T{D')  is 
3-regular  and  satisfies  the  condition  that  all  edges  of 
T(D')  lie  in  three  directions.  Since  the  number  of 
3-regular  trees  with  k'  leaves  is  finite,  there  is  a  subse¬ 
quence  of  {T{D')}  which  converges  to  a  3-regular  tree 
and  satisfies  the  same  condition.  It  is  easy  to  see  that 
this  tree  must  be  a  rninimum  Steiner  tree  for  P  under 
the  norm  ||  ■  Ho.  By  the  argument  used  in  the  proof 
of  Du  and  Hwang  [7],  it  follows  that  P  is  a  critical 
set.  Now,  by  using  the  argument  in  the  proof  of  the 
first  part  of  the  theorem,  taking  special  care  of  the 
cases  in  which  equality  holds  in  various  inequalities, 
we  eventually  obtain  a  contradiction.  This  completes 
the  proof  of  the  second  part  of  the  claim  and  the  proof 
is  complete.  □ 

4  Discussion 

We  conjecture  that  for  any  norm  ||  •  ||£),  there  exists  k 
such  that  Pk{D)  =  p{D).  A  consequence  of  this  con¬ 
jecture  is  that  p{D)  =  2/3  iff  dD  is  a  parallelogram. 

The  proof  techniques  used  in  this  paper  are  different 
from  those  in  Hwang  [13]  for  proving  2/3  as  the  Steiner 
ratio  of  the  rectilinear  plane.  Graham  and  Hwang  [12] 
conjectured  that  m-dimensional  rectilinear  space  has 
the  Steiner  ratio  m/{2m  -  1).  Although  the  methods 
in  [13]  do  not  seem  to  be  applicable  to  proving  this 
conjecture,  perhaps  the  ideas  we  use  here  will  be  of 
some  help.  We  hope  to  consider  this  in  the  near  future. 
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Bounding  the  Number  of  Geometric  Permutations 
Induced  by  /c-Transversals 


Jacob  E.  Goodman* 
Richard  PoUackJ 
Rephael  Wenger^ 


Abstract 

We  prove  that  a  (fc-l)-separated  family  of  n  com¬ 
pact  convex  sets  in  can  be  met  by  A:-trans- 

versals  in  at  most  ^  ^ 

or,  for  fixed  k  and  d,  different 

order  types.  This  is  the  first  non-trivial  bound 
for  1  <  <  d  -  1 . 

Introduction 

Let  A  be  a  family  of  n  compact  convex  sets  in 
R'^.  A  line  transversal  of  the  family  A  is  a  line 
that  intersects  every  member  of  A.  K  the  sets 
in  A  are  pairwise  disjoint,  then  a  directed  line 
transversal  induces  a  hnear  ordering  on  the  sets, 
determined  by  the  order  in  which  the  fine  inter- 
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sects  them.  An  undirected  fine  transversal  in¬ 
duces  a  pair  of  orderings  on  A  corresponding  to 
the  two  orientations  of  the  Une.  Such  a  pair  of 
orderings  induced  by  a  fine  transversal  is  called 
a  geometric  permutation  of  the  set  A. 

A  generahzation  of  the  concept  of  fine  trans¬ 
versal  is  a  k -transversal,  a  flat,  or  affine  subspace, 
of  dimension  k  that  intersects  every  member  of 
A.  The  family  A  is  k- separated  [6]  if  no  subset  of 
A  of  size  greater  than  +  1  has  a  A:-transversal. 
Equivalently,  A  is  Ar-separated  if  every  family  of 
j  sets  of  A  can  be  strictly  sepajated  from  ev¬ 
ery  disjoint  family  of  k  +  2  -  j  sets  of  A  by  a 
hyperplane. 

If  A  is  (A:-l)-separated,  then  each  oriented  k- 
transversal  f  induces  an  “ordering”  on  A  as  fol¬ 
lows.  For  each  convex  set  a  e  A,  choose  a  point 
Pa  G  /  n  a.  The  order  type  of  the  set  of  points 

A  =  {Pa  I  a  G  A}  is  the  family  of  orientations  of 
all  the  ordered  (A:-|-l)-tuples  from  Va-  Because 
A  is  (A:  -  l)-separated,  the  order  type  of  Va  is 
independent  of  the  choice  of  pa.  This  order  type 
represents  the  order  in  which  /  meets  the  mem¬ 
bers  of  A.  An  imoriented  A:-transversal  induces  a 
pair  of  order  types  on  A  corresponding  to  the  two 
orientations  of  the  A;-transversal.  Such  a  pair  of 
orderings  is  also  known  as  a  geometric  permuta¬ 
tion  of  the  set  A.  A  more  complete  discussion  of 
A:-transversals  and  geometric  permutations  can 
be  found  in  the  survey  paper  [6]. 

What  is  the  maximum  number  of  geomet¬ 
ric  permutations  induced  by  A:-transversals  for 
a  (A:  -  l)-separated  family  of  n  compact  convex 
sets  in  For  fine  transversals  in  the  plane, 
the  answer,  due  to  Edelsbrurmer  and  Sharir  (see 
[4]),  is  2n  -  2.  In  higher  dimensions,  much  less 
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is  known.  Clearly  the  number  of  geometric  per¬ 
mutations  is  at  most  the  number  of  order  types 
of  n  points  in  R^,  which  is 

K 

by  results  of  [5].  Katchalski,  Lewis,  and  Liu  give 
a  lower  bormd  of  for  line  transversals 

in  R^  [7],  while  Wenger  gives  an  upper  bound 
of  for  the  same  problem  [9].  For  k- 

transversals  in  R'^  only  the  trivial  lower  bormd 
of  which  follows  immediately  from  the 

resrdts  of  [7]  by  “Hfting”,  is  known.  CappeU  et 
al.  give  a  bormd  of  for  the  number  of 

geometric  permutations  induced  by  hyperplane 
transversals  of  families  of  compact  convex  sets 
[3].  This  paper  gives  the  first  polynomial  (in 
n)  bound,  for  arbitraxy  k  and 

d  (Theorem  2). 

The  main  new  idea  that  we  use  to  obtain  this 
polynomial  bormd  is  the  following  result,  which 
may  be  of  independent  interest. 

The  orientation  of  a  labeled  set  5,  of  d  +  1 
points  in  R'^  is  determined  by  any  set  of  —  2 
vectors  vp  such  that  for  each  non-empty  proper 
subset  P  C  S,  Vp  is  the  unit  vector  normal  to 
some  hyperplane  separating  P  from  its  comple¬ 
ment  S  \  P  and  pointing  toward  P.  This  fact  is 
not  as  transparent  as  it  appears  and  constitutes 
Theorem  1. 

We  would  hke  to  express  our  gratitude  to 
Marie-Prrmgoise  Roy  for  bringing  to  our  atten¬ 
tion  the  embedding  of  the  real  Grassmaimicm 
Gk^d  in  R'^  which  we  use  in  the  proof  of  The¬ 
orem  2  in  place  of  the  more  familiar  embedding 
in  R(t)  using  Pliicker  coordinates.  This  embed¬ 
ding  has  the  effect  of  simplifying  the  presentation 
of  the  argmnent  and  of  making  the  bormd  singly 
rather  than  doubly  exponential  in  k. 

Determining  Orientation  from 
Separation 

Let  P  =  {ail, . . . ,  aid+i}  be  a  labeled  set  of  d  -|- 1 
points  in  general  position  in  R*^.  The  orientation 
of  P  is  the  sign  of  the  (d  -f  1)  x  (d  -|- 1)  determi¬ 
nant  whose  ith  row  consists  of  1  followed  by  the 


coordinates  of  Xi.  For  every  partition  of  P  into 
an  ordered  pair  (P',P")  of  non-empty  subsets, 
choose  some  oriented  hyperplane  separating  P' 
from  P"  with  normal  vector  vpi  pointing  toward 
P'.  Let  H  be  the  set  of  these  2'^'*"^  —  2  separat¬ 
ing  hyperplanes  and  let  N  be  the  corresponding 
set  of  unit  normal  vectors.  The  orientation  of  an 
ordered  d-tuple  of  vectors  in  N  is  the  sign  of  the 
determinant  of  the  d  vectors  taken  in  order  as 
the  rows  of  a  d  x  d  matrix.  The  order  type  of 
N  is  the  correspondence  between  these  d-tuples 
and  their  orientation.  There  are  many  choices 
for  H,  giving  rise  to  sets  N  with  many  different 
order  types.  Neverthess,  we  shall  show  that  the 
order  type  of  N  determines  the  orientation  of  P. 

First,  notice  that  dropping  just  two  normal 
vectors  from  N  may  make  it  impossible  to  deter¬ 
mine  the  orientation  of  P  from  the  order  type  on 
N.  Choose  any  partition  of  {1, . . . ,  d}  into  non¬ 
empty  subsets  I'  and  I",  and  let  /  be  a  (d— 1)- 
flat  in  R'^.  It  is  possible  to  place  d  -|-  1  points 
P  =  xj, . . . ,  Xrf+i  in  /  so  that  any  partition  other 
than  the  one  corresponding  to  R'  =  {xj  |  i  £  I'} 
and  R"  =  {xi\i  6  I"}  can  be  realized  by  split¬ 
ting  the  point  set  with  a  (d-2)-fiat  in  f.  Con¬ 
struct  all  such  sphtting  (d-2)-flats  and  extend 
them  to  oriented  hyperplanes  in  R*^  to  generate 
a  set  H  consisting  of  -  4  separating  hyper¬ 
planes.  Moving  one  of  the  d  -|-  1  points  above  or 
below  /  gives  us  two  (d-f-l)-tuples  of  points,  P 
and  Q,  in  general  position.  The  set  H  separates 
each  of  P  and  Q  in  the  same  2'^'^^  —  4  distinct 
ways,  yet  P  and  Q  have  opposite  order  types. 
The  two  missing  hyperplanes  separating  points 
corresponding  to  R'  and  R"  provide  crucial  in¬ 
formation  about  the  orientations  of  P  and  Q. 

Assume  now  that  if  is  a  set  of  2^^+^  -  2  ori¬ 
ented  hyperplanes  separating  a  (d-fl)-tuple  P  as 
described  above,  and  let  N  be  the  set  of  unit  nor¬ 
mal  vectors  of  H .  Associate  the  zth  member  of 
P  with  the  ith  vertex  of  the  standard  d- simplex 
in  R*^,  whose  vertices  in  canonical  order 
are  (0,0,. ..,0),  (1,0,. ..,0),  (0, 1,  0, . . . ,  0),. . . , 
(0,...,0,1).  Notice  that  the  orientation  of  the 
vertices  in  this  order  is  positive.  Using  N  we  con¬ 
struct  a  map  (j>pt  from  the  boundary  of  the 
standard  simplex,  to  the  unit  sphere  C  R"^. 

Let  K  be  the  boundary  complex  of  A*^,  and  L 
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the  baxy centric  subdivision  of  K.  Each  (closed) 
face  K.  in  if  is  associated  with  a  vertex  in  L. 

K,  contains  a  set  D'  of  vertices  of  K  and  misses 
another  set  D" .  Let  P'  and  P"  be  the  subsets 
of  P  associated  with  D'  and  P",  respectively. 
Define  to  be  the  unit  normal  vector  of 

the  oriented  hyperplane  separating  P'  and  P" 
with  normal  pointing  toward  P' .  Extend  this 
map  in  each  face  of  the  barycentric  subdivision 
by  mapping  each  point  x  =  Sa/c  =  1, 

Uk  >  0,  to  the  vector  X)  0'k.<I>n{wk.)- 

Every  (ci-l)-simplex  A  of  P  contains  a  unique 
vertex  v'  of  and  lies  on  a  facet  of  op¬ 
posite  another  vertex  v"  6  A^.  Let  h  be  some 
hyperplane  perpendicular  to  the  line  through  the 
points  p',p''  €  P  corresponding  to  v',v"  (resp.) 
and  separating  p'  from  p".  For  each  vertex  w  of 
A,  the  vector  points  away  from  the  half¬ 

space  bounded  by  h  containing  p"  and  toward 
the  half- space  containing  p' .  Thus  all  the  vec¬ 
tors  must  lie  in  an  open  half-space  and  so 

X)  is  never  0.  Normahze 

to  map  a:  to  a  point  <Pn{^)  on 

The  resulting  mapping,  ^  — »•  ,  has 

a  weU-defined  degree,  which  is  (intuitively)  the 
algebraic  niimber  of  times  <j)i^  wraps  aroimd 
S<i-i  Pqj.  ijasic  facts  about  this  degree,  see  for 
example  [8]. 

Lemma  1  Let  P  be  a  {d+l)-tuple  of  points  in 
general  position  in  let  H  be  some  set  of  sep¬ 
arating  hyperplanes  for  P,  and  let  N  be  the  set 
of  normals  of  H .  The  map  das  degree  +1  if 
the  orientation  of  P  is  positive  and  degree  -1  if 
the  orientation  of  P  is  negative. 

Proof:  If  P  has  negative  orientation  then  we 
may  reflect  it  and  the  hyperplanes  in  H,  say  by 
reversing  the  sign  of  the  first  coordinate.  Reflec¬ 
tion  reverses  both  the  orientation  of  P  and  the 
sign  of  the  degree  of  the  mapping  ^at-  Thus  we 
may  assmne  that  P  has  positive  orientation. 

For  any  (d-|- l)-tuple  Q  of  points  in  general 
position  in  R^  we  define  the  following  c2inoni- 
cal  separating  set.  Let  q*  be  the  bary center  of 
the  simplex  whose  set  of  vertices  is  Q.  For  ev¬ 
ery  partition  of  Q  into  sets  Q'  and  Q"  there  is  a 
unique  hyperplane  containing  Q'  which  is  paral¬ 
lel  to  some  hyperplane  containing  Q".  Translate 


this  hyperplane  to  pass  through  q*  and  orient  it 
with  normal  pointing  toward  Q'.  Let  H*{Q)  and 
N*{Q)  be  this  canonical  set  of  hyperplanes  and 
their  normal  vectors,  respectively. 

We  first  show  that  the  degree  of  equals  the 
degree  of  (/ia^.(P).  The  set  H  of  separating  hyper¬ 
planes  can  be  continuously  transformed  to  the 
canonical  separating  set  H*{P).  Let  Ht  denote 
the  intermediate  sets  of  separating  hyperplanes 
where  0  <  t  <  1,  Hq  —  H ,  and  Hi  =  H*{P).  Let 
Nt  be  the  set  of  normals  of  Ht  ■ 

Since  the  normals  in  Nt  vary  continuously  with 
the  separating  hyperplanes,  for  each  vertex  of 
the  barycentric  subdivision  the  image 
must  vary  continuously  as  a  function  of  t.  Hence 
for  any  point  x  =  Yl  convex  combi¬ 

nation  Y(^Kd>Nt{'^K.)  ^so  varies  continously  as  a 
function  of  t,  and  the  same  holds  for  its  nor¬ 
malization  (x).  Hence  the  degree  of  ^ATf  is 
a  constant  that  does  not  change  with  t,  and,  in 
particular,  the  degree  of  ^at  =  <I>N(,  equals  the 
degree  of  (j)N*{P)  -  <l>Ni  ■ 

Let  Vd  be  the  set  of  vertices  of  A^  in  canonical 
order.  It  is  easy  to  check  that  the  map  <f>N*{Vd) 
is  a  homeomorphism  of  dA‘^  to  the  unit  sphere 
and  has  degree  -|-1.  We  show  that  the  degree  of 
^w*(P)  equals  the  degree  of  <i>N*(Vi)- 

Since  P  and  Vj  have  the  same  positive  ori¬ 
entation,  we  can  continuously  move  the  points 
in  P  to  the  corresponding  points  in  Vd,  keeping 
them  in  general  position;  this  can  be  shown,  for 
example,  by  an  easy  inductive  argument.  Let 
Pt  denote  the  intermediate  sets  of  points  where 
a  <  t  <  1,  Pq  =  P,  and  Pi  =  Vd-  The  sets 
H*{Pt)  and  N*{Pt)  are  also  continuous  func¬ 
tions  of  t.  As  argued  above,  this  imphes  that 
d^N*(Pt)  ^  continuous  function  of  t.  Thus  the 
degree  of  ^Ar*(P)  =  ^N*{Po)  equals  the  degree  of 
<f>N*(Vd)  —  4>n*(Pi)i  which  is  -t-1.  Since  the  degree 
of  4>n  equals  the  degree  of  <^Ar*(P)  >  orientation 
of  P  matches  the  degree  of  □ 

Lemma  1  estabhshes  that  the  orientation  of  P 
depends  upon  the  degree  of  We  now  show 
that  the  degree  of  4>n  depends  only  upon  the 
order  type  of  N . 

Lemma  2  The  order  type  of  N  determines  the 
degree  of  d>N- 
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Proof:  As  before,  let  K  be  the  boundary  com¬ 
plex  of  the  standard  d-simplex,  and  L  the 
baxycentric  subdivision  of  K.  Let  0  be  the  set 
of  all  unit  vectors  that  axe  hneax  combinations 
of  d  —  1  or  fewer  vectors  of  N.  Choose  any  point 
y'  on  the  unit  sphere  that  is  not  contained  in  0. 
Since  the  degree  of  is  -fl  or  —1,  the  point 
y'  =  <j>N{x')  for  some  point  x'  on  dA‘^.  Point  x' 
hes  in  the  interior  of  some  (d— l)-face  A  of  L.  Let 
Wo,  wi, . . . ,  wj-i  be  the  vertices  of  A. 

For  any  d  vectors  ui , . . . ,  let  det(ui , . . . , 

represent  the  determinant  of  the  matrix  with  the 
vectors  Vi  (in  that  order)  as  rows.  Choose  an 
€  >  0  such  that 

e  <  1  |det(t;i,U2,...,Ud)| 
d|det(t;(,u^,...,t;j)l 

for  all  Vi,v^  €  N  where  det(ui, . .  .,Vd)  7^  0  and 
det(uj, . . . ,  Uj)  7^  0.  Let  =  l/(l-be-|--  •  ^), 

X  =  n{wo  +  ewi-\ - and  y  =  <I>n{x)- 

We  claim  that  the  order  type  of  iV  U  {y}  is  de¬ 
termined  by  the  order  type  of  N.  Let  Vi , . . . ,  Vd-i 
be  a  set  of  d  —  1  vectors  in  N.  The  orienta¬ 
tion  of  the  d-tuple  (y,  vi, . . . ,  Vd-i)  is  the  sign  of 
det(y,  vi,...,  Vd-i).  If  the  vectors  Vi  axe  linearly 
dependent,  then  the  orientation  of  the  d-tuple 
(y,  ui, . . . ,  Ud_i)  is  zero.  Assume  the  Vi  are  lin¬ 
early  independent.  By  definition, 

4>n{x) 

=  v{(I>n{wo)  -i  e(l)N{'Wi)  H - h  ^N{wd-i)), 

where  1/  is  chosen  so  that  ||^jv'(®)||  =  1-  Thus 

det(y,ui, . . .,Ud-i) 
d-l 

=  1/  ^  det(^7v(TWi),ui, . .  .,Ud-i). 
1=0 

By  our  choice  of  a  suitably  small  e,  the  sign 
of  this  sum  is  determined  by  the  sign  of  the 
first  non-zero  term,  e*  det((^^(wi),  ui, . . . ,  Ud-i), 
in  this  sum.  The  vectors  (l>i^{wi),  vi, . . Vd-i  are 
all  in  iV  and  so  the  order  type  of  N  determines 
the  sign  of  d.et{(f>j^{wi),vi, . . .  ,Vd-i)-  Thus  the 
orientation  of  every  d-tuple  in  iV  Li  {y}  is  com¬ 
pletely  determined  by  the  orientation  of  the  d- 
tuples  in  N. 

Notice  that  y  ^  0.  For  otherwise,  we  would 
have  det(y,t;i,...,Ud_i)  =  0  for  some  choice 


of  vi, . .  .,Vd-i.  But  then,  by  the  choice  of  e, 
it  would  foUow  that  det(<^iv(tyi), . . . ,  I’d-i)  =  0 
for  each  i,  which  would  imply  that  ^iv(w'o),  •  •  -  , 
<i>N{wd-\)  are  hnearly  dependent,  contrary  to  the 
choice  of  the  simplex  A. 

Partition  the  (d-l)-simpHces  of  L  whose  image 
contains  y  into  those  whose  orientation  is  pre¬ 
served  and  those  whose  orientation  is  reversed 
by  ^jv.  The  degree  of  </>jv  is  then  the  number  of 
the  former  minus  the  number  of  the  latter  (see 
[8],  for  example).  The  simphces  whose  image 
contains  y  axe  determined  by  the  order  type  of 
N  U  {y},  as  is  their  orientation.  Thus  the  degree 
of  is  determined  by  the  order  type  of  N .  □ 

We  have  therefore  proved: 

Theorem  1  The  orientation  of  a  {d+l)-tuple  P 
of  points  in  general  position  in  R'^  is  determined 
by  the  order  type  of  the  normal  vectors  to  any 
set  of  oriented  hyperplanes  separating  each  non¬ 
empty  subset  of  P  from  its  complement. 

Bounding  the  Number  of  Geo¬ 
metric  Permutations 

Let  A  be  a  (/fc-l)-sepaxated  family  of  n  compact 
convex  sets  in  R'^.  For  every  subset  A'  of  size 
jfe  -i  1,  choose  a  family  of  2^'^^  —  2  oriented 
separating  hyperplanes  for  A! ,  and  let  Nj^i  be 
the  set  of  unit  normals  to  the  members  of  H_\i. 
Let  and  be  the  sets  of  all  the  chosen 
hyperplanes  and  normal  vectors,  respectively. 

For  each  oriented  A:-transversal  /,  let  /fl  A  and 
represent  the  sets  obtained  by  intersecting 
/  with  the  sets  in  A  and  with  the  hyperplanes  in 
respectively.  Notice  that  /  meets  each  sep¬ 
arating  hyperplane  transversaUy.  Let  Mf  be  the 
d  X  d  matrix  which  in  the  standard  basis  repre¬ 
sents  orthogonal  projection  onto  /o,  the  oriented 
fc-dimensional  subspace  of  R'^  parallel  to  f.  No¬ 
tice  that  Mf  is  symmetric,  idempotent  and  has 
trace  k.  In  fact,  such  matrices  axe  in  one-to- 
one  correspondence  with  the  fc-dimensional  sub¬ 
spaces  of  R'^,  which  make  up  the  real  Grassman- 
nian  Gk,d.  This  Grassmannian  is  an  algebraic 
variety  in  R'^  of  dimension  k{d—  k),  defined  by 
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polynomials  of  degree  at  most  2  (see  [2]  for  more 
details).  Let  f{V)  be  the  orthogonal  projection 
of  any  set  V  of  vectors  in  onto  /q.  Represent¬ 
ing  V  as  the  columns  of  a  d  x  |F|  matrix,  we  see 
that  f{V)  is  then  represented  by  the  columns  of 
MfV.  Notice,  too,  that  /(N^)  is  a  set  of  normals 
to  /  n  Ha- 

By  Theorem  1,  two  oriented  ^-transversals,  f 
and  f',  generate  the  same  order  type  on  A  if  for 
every  (fc -f  l)-subset  A'  the  orientation  of  every 
fe-tuple  in  and  of  the  corresponding  k- 

tuple  in  f'{N^i)  is  the  same.  Let  V  he  a  k- 
tuple  of  N^i.  The  orientation  of  f{V)  is  zero 
if  and  only  if  Mf{V)  has  rank  <  ib,  and  this 
will  be  true  if  and  only  if  the  maximal  minors 
of  Mf[V)  are  aU  zero.  Let  gy  be  the  sum  of 
the  squares  of  these  maximal  minors,  gy  is  a 
polynomial  of  degree  2d  in  the  indeterminates 
txii, . .  .Udd,  the  coordinates  of  a  generic  point  of 
R'^^ .  By  continrdty,  two  fc-transversals  /  and  /' 
meet  A'  in  the  same  order  type  if  the  subspaces 
corresponding  to  Mf  and  Mf  can  be  connected 
in  Gk4  without  meeting  the  zero  set  of  gy  for 
any  A:-element  subset  V  of  N^i. 

Each  A:-subset  V  of  N^i  determines  such  a 
polynomial  gy  of  degree  2d  in  R[uii, . . .  ,Udd]- 
There  are  such  polynomials  for  each  of 

the  subsets  A'  of  A  of  size  k  +  1,  giving  a 
total  of 


polynomials.  In  [1]  it  is  shown  that  the  number 
of  coimected  components  of  all  non-empty  sign 
conditions  (and  hence  in  our  case  of  the  number 
of  coimected  components  on  G^^d  of  the  comple¬ 
ment  of  the  zero  set  of  these  s  polynomials)  of 
5  polynomials  in  R[Xi, . .  .,Xm],  each  of  degree 
at  most  d  on  a  variety  of  dimension  m!  defined 
by  polynomials  of  degree  at  most  d,  is  at  most 
0{d)^.  Applying  this  bound  to  our  polyno¬ 
mials  on  the  variety  Gk,d,  we  have  proved: 


Theorem  2  A  {k—l)-separated  family  ofn  com¬ 
pact  convex  sets  in  R'^  has  at  most 


0{d)^ 


( 2^+1 


k{d-k) 


geometric  permutations. 


Conclusion 

In  this  paper  we  have  generalized  arguments 
given  in  [9]  for  boimding  the  number  of  geo¬ 
metric  permutations  of  line  transversals  in  R'^. 
If  we  pay  attention  only  to  the  dependence  on 
n,  the  bound  given  here  matches 

the  one  in  [9]  when  k  —  1.  Better  bounds  are 
known  for  the  special  cases  of  fine  transversals 
in  the  plane  (0(n))  and  of  hyperplane  transver¬ 
sals  in  R'^  {0{n^~^))  [3,  4,  9].  We  suspect  that 
the  number  of  geometric  permutations  for  ib-fiats 
is  at  most  0(n^^^“^)). 
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Abstract 

We  show  that  any  graph  of  n  vertices  that  can  be 
drawn  in  the  plane  with  no  ^  +  1  pairwise  crossing 
edges  has  at  most  ctnlog^*“^n  edges.  This  gives  a 
partial  answer  to  a  dual  version  of  a  well-known  prob¬ 
lem  of  Avital-Hanani,  Erdos,  Kupitz,  Perles,  and  oth¬ 
ers.  We  also  construct  two  point  sets  {pi,  ■  ■ .  ,Pn}, 
{?!!  ■  •  •  1 9n}  in  the  plane  such  that  any  piecewise  lin¬ 
ear  one-to-one  mapping  /  :  with  f{pi)  =  qi 

(1  <  i  <  n)  is  composed  of  at  least  f2(n^)  linear  pieces. 
It  follows  from  a  recent  result  of  Souvaine  and  Wenger 
that  this  bound  is  asymptotically  tight.  Both  proofs 
are  based  on  a  relation  between  the  crossing  number 
and  the  bisection  width  of  a  graph. 


1  Introduction 

A  geometric  graph  is  a  graph  drawn  in  the  plane 
by  (possibly  crossing)  straight-line  segments  i.e.,  it  is 
defined  as  a  pair  of  {V{G),  E{G)),  where  V(G)  is  a  set 
of  points  in  the  plane  in  general  position  and  E{G)  is 
a  set  of  closed  segments  whose  endpoints  belong  to 
V{G). 

The  following  question  was  raised  by  Avital  and 
Hanani  [AH],  Erdos,  Kupitz  [K]  and  Perles:  What  is 
the  maximum  number  of  edges  that  a  geometric  graph 
of  n  vertices  can  have  without  containing  Ar-fl  pairwise 
disjoint  edges?  It  was  proved  in  [PT]  that  for  any  fixed 
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k  the  answer  is  linear  in  n.  (The  cases  when  ifc  <  3  had 
been  settled  earlier  by  Hopf  and  Pannwitz  [HE],  Erdos 
[E],  Alon  and  Erdos  [AE],  O’Donnel  and  Perles  [OP], 
and  Goddard,  Katchalski  and  Kleitman  [GKK].) 

In  this  paper  we  shall  discuss  the  dual  counterpart 
of  the  above  problem.  We  say  that  two  edges  of  G 
cross  each  other  if  they  have  an  interior  point  in  com¬ 
mon.  Let  ek{n)  denote  the  maximum  number  of  edges 
that  a  geometric  graph  of  n  vertices  can  have  without 
containing  I;  -f  1  pairwise  crossing  edges.  If  G  has  no 
crossing  edges,  then  it  is  a  planar  graph.  Thus,  it 
follows  from  Euler’s  polyhedral  formula  that 

ei(n)  =  3n  -  6  for  all  n  >  3. 

It  was  shown  in  [P]  that  e2(n)  <  13n^'^^  and  that,  for 
any  fixed  k, 

eUn)  = 

However,  we  suspect  that  ek{n)  =  0{n)  holds  for  ev¬ 
ery  fixed  k  as  n  tends  to  infinity.  We  know  that  this 
is  true  for  convex  geometric  graphs  [CP]  (a  geomet¬ 
ric  graph  is  convex  if  the  nodes  of  the  graph  are  on 
the  convex  hall  of  the  drawing).  Our  next  theorem 
brings  us  fairly  close  to  this  bound  for  arbitrary  geo¬ 
metric  graphs.  (We  remark  that  our  proof  works  just 
as  well  for  drawings  where  the  edges  are  arbitrary  Jor¬ 
dan  curves.) 

Theorem  1.1  Lei  G  be  a  geometric  graph  of  n  ver¬ 
tices,  containing  no  k  \  pairwise  crossing  edges. 
Then  the  number  of  edges  of  G  satisfies 

\EiG)\<Cknlog^’‘-^n, 

with  a  suitable  constant  Ck  depending  only  on  k. 

The  proof  is  based  on  a  general  result  relating  the 
crossing  number  of  a  graph  to  its  bisection  width  (see 
Theorem  2.1).  A  nice  feature  of  our  approach  is  that 
we  do  not  use  the  assumption  that  the  edges  of  G  are 
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line  segments.  Theorem  1.1  remains  valid  for  graphs 
whose  edges  are  represented  by  arbitrary  Jordan  arcs 
with  the  property  that  no  arc  passes  through  any  ver¬ 
tex  other  than  its  endpoints. 

The  same  ideas  can  be  used  to  settle  the  follow¬ 
ing  problem.  Let  Ti  and  T2  be  triangles  in  the  plane, 
and  let  {pi , . . . , Pn}  and  {gi , . . . ,  be  two  n-element 
point  sets  lying  in  the  interior  of  Ti  and  T2,  respec¬ 
tively.  A  hotneomorphism  f  from  Ti  onto  T2  is  a  con¬ 
tinuous  one-to-one  mapping  with  continuous  inverse. 
/  is  called  piecewise  linear  if  there  exists  a  triangula¬ 
tion  of  Ti  such  that  /  is  linear  on  each  of  its  triangles. 
The  size  of  /  is  defined  as  the  minimum  number  of 
triangles  in  such  a  triangulation.  Recently,  Souvaine 
and  Wenger  [SW]  have  shown  that  one  can  always  find 
a  piecewise  linear  homeomorphism  /  :  Ti  — >  T2  with 
f{pi)  =  (1  <  *  <  n)  such  that  the  size  of  /  is 

O(n^).  Our  next  result  shows  that  this  bound  cannot 
be  improved. 

Theorem  1.2  There  exist  a  triangle  T  and  two  point 
sets  {pi, . .  .,p„}, 

{911  •  •  M?"}  C  int  T  such  that  the  size  of  any  piece- 
wise  linear  homeomorphism  f  :T  —*T  which  maps  pi 
to  qi  (1  <  J  <  n)  is  at  least  cn^  (for  a  suitable  constant 
c  >  0). 

For  some  closely  related  problems  consult  [S]  and 
[ASS]. 


2  Crossing  number  and  bisection 
width 

Let  G  be  a  graph  of  n  vertices  with  no  loops  and 
no  multiple  edges.  For  any  partition  of  the  vertex  set 
V(G)  into  two  disjoint  parts  Vi  and  V2,  let  E{Vi,  V2) 
denote  the  set  of  edges  with  one  endpoint  in  Vi  and 
the  other  endpoint  in  V2.  Define  the  bisection  width 
of  G  as 

6(G)  =  min  \E{VuV2)\, 

|Vi,|V'3|>n/3' 

where  the  minimum  is  taken  over  all  partitions 
V(G)  =  ViUV2  with  I  Vi  I,  1^2 1  >  n/3. 

Consider  now  a  drawing  of  G  in  the  plane  such  that 
the  vertices  of  G  are  represented  by  distinct  points  and 
the  edges  are  represented  by  Jordan  arcs  connecting 
them  with  the  property  that  no  arc  passes  through  a 
vertex  different  from  its  endpoints.  The  crossing  num¬ 
ber  c(G)  of  G  is  defined  as  the  minimum  number  of 
crossing  pairs  of  arcs  in  such  a  drawing  of  G,  where 
two  arcs  are  said  to  be  crossing  if  they  have  an  in¬ 
terior  point  in  common.  It  is  easy  to  show  that  the 


minimum  number  of  crossings  can  always  be  realized 
by  a  drawing  satisfying  the  following  conditions: 

(1)  no  two  arcs  meet  in  more  than  one  point  (in¬ 
cluding  their  endpoints); 

(2)  no  three  arcs  share  a  common  interior  point. 

We  need  the  following  result  which  is  an  easy  con¬ 
sequence  of  a  weighted  version  of  the  Lipton-Tarjan 
separator  theorem  for  planar  graphs  [LTj. 

Theorem  2.1  Let  G  be  a  graph  with  n  vertices  of  de¬ 
gree  di, . . . ,  d„.  Then 

b\G)  <  (1.58)2  ^i6c(G)  +  > 


where  b{G)  and  c(G)  denote  the  bisection  width  and 
the  crossing  number  ofG,  respectively. 

Proof:  Let  if  be  a  plane  graph  on  the  vertex  set 
V{H)  =  {?;i, . . . ,  ujv}  such  that  each  vertex  has  a  non¬ 
negative  weight  w(vi)  and  w{vi)  =  1.  Let  d(vi) 
denote  the  degree  of  Uj  in  if.  It  was  shown  by  Gazit 
and  Miller  [GM]  that,  by  the  removal  of  at  most 

/  N 

1.58 

edges,  if  can  be  separated  into  two  disjoint  subgraphs 
ffi  and  II2  such  that 

viev(i/i)  vieviHj) 

(See  also  [M]  and  [DDS].) 

Consider  now  a  drawing  of  G  with  c(G)  crossing 
pairs  of  arcs  satisfying  conditions  (1)  and  (2).  Intro¬ 
ducing  a  new  vertex  at  each  crossing,  we  obtain  a  plane 
graph  if  with  iV  =  n  -f  c(G)  vertices.  Assign  weight  0 
to  each  new  vertex  and  weights  of  1/n  to  all  other  ver¬ 
tices.  The  above  result  implies  that,  by  the  deletion 
of  at  most 

/  n  \  1/2 

1.58  n6c(G)  +  ^dn 

edges,  if  can  be  separated  into  two  parts  ifi  and  ff2 
such  that  both  of  the  sets  Vi  =  V{Hi)  fl  V^(G)  and 
V{H2)  n  V{G)  have  at  least  n/3  elements.  Hence, 

6(G)  <  \E{VuV2)\  <  1.58  ^16c(G)  + 


and  the  result  follows.  □ 

In  the  special  case  when  every  vertex  of  G  is  of 
degree  at  most  4,  Theorem  2.1  was  established  by 
Leighton  [L]  and  it  proved  to  be  an  important  tool 
in  VLSI  design  (see  [U]). 


199 


3  Geometric  graphs 

The  aim  of  this  section  is  to  prove  the  following 
generalization  of  Theorem  1.1  for  curvilinear  graphs. 

Theorem  3.1  Let  G  be  a  graph  with  n  >  2  vertices, 
and  let  k  >  1.  If  G  has  a  drawing  with  Jordan  arcs 
such  that  no  arc  passes  through  any  vertex  other  than 
its  edpoints  and  there  are  no  k  \  pairwise  crossing 
arcs,  then 


|£(G)|<3n(101og2n)2*^-2. 

Proof:  By  double  induction  on  k  and  n.  The  as¬ 
sertion  is  true  for  ^  =  1  and  for  all  n.  It  is  also  true 
for  any  >  1  and  n  <  6  ■  10^*^“^,  because  for  these 
values  the  above  upper  bound  exceeds  (j) . 

Assume  now  that  we  have  already  proved  the  theo¬ 
rem  for  some  k  and  all  n,  and  we  want  to  prove  it  for 
^  -f  1.  Let  n  >  6  ■  10^*^,  and  suppose  that  the  theorem 
holds  for  I;  -t- 1  and  for  all  graphs  having  fewer  than  n 
vertices. 

Let  G  be  a  graph  of  n  vertices,  which  has  a  drawing 
with  no  1: 4-2  pairwise  crossing  edges.  Let  us  fix  such  a 
drawing  with  c(G)  crossings  and,  for  the  sake  of  sim¬ 
plicity,  denote  it  also  by  G  =  (V{G),E(G)).  For  any 
arc  e  £  E(G),  let  Ge  denote  the  graph  consisting  of 
all  arcs  that  cross  e.  Clearly,  Ge  hris  no  fc-|- 1  pairwise 
crossing  arcs.  Thus,  by  the  induction  hypothesis. 


than  n  vertices,  we  can  apply  the  induction  hypothesis 
to  obtain 

\E{G)\  =  \EiGi)\  +  \EiG2)\  +  b{G) 

<  3ni(10  log2  +  3n2(10 logs  ”2)^*  +  b{G), 

where  n,-  =  \Vi\  (i  =  1,2).  Combining  the  last  two 
inequalities  we  get 

\EiG)\-9V^il0log,n)'‘-^^/\EiG)\ 

<  3|(101og,|)»+3|^(101og,|)“ 

<  3n(101og2n)^*(l  - 

If  the  left  hand  side  of  this  inequality  is  negative,  then 
£  3n(101og2  and  we  are  done.  Otherwise, 

f{x)  =  X  —  9\/n(10  logs 

is  a  monotone  increasing  function  of  x  when  x  > 
|jE'(G)|.  An  easy  calculation  shows  that 


/(3n(10  logs  >  3n(10  logs  - 


Hence, 


/(|F;(G)|)</(3n(101ogsn)2^), 


c(G)  =  1  \E{G.)\ 

e€E(G) 

-  ^  m  3n(101og2Ti)^*“^ 

eeE{G) 

<  ^|£'(G')|n(101og2n)^^“^ 

Since  ^3?=!  —  2|£'(G)|n  holds  for  every  graph  G 

with  degrees  di, . .  .,d„,  Theorem  2.1  implies  that 

6(G)  <  1.58  ^16c(G)  +  ^  d?  j 

<  9^n|F;(G)|(101og2n)'^-i. 

Consider  a  partition  of  K(G)  into  two  parts  Vi  and 
Vs,  each  containing  at  least  n/3  vertices,  such  that  the 
number  of  edges  connecting  them  is  6(G).  Let  Gi  and 
Gs  denote  the  subgraphs  of  G  induced  by  Vi  and  Vs, 
respectively.  Since  neither  of  Gi  or  Gs  contains  k-\-2 
pairwise  crossing  edges  and  each  of  them  has  fewer 


which  in  turn  implies  that 

|^(G)|<3n(101ogsr^)2^ 

as  required.  □ 

4  Avoiding  snakes 

In  [ASS],  Aronov,  Seidel  and  Souvaine  con¬ 
structed  two  polygonal  regions  P  and  Q  with  vertices 
{Pi,  •  •  •  ,Prj}  and  {gi, . . . ,  gn}  in  clockwise  order  such 
that  the  size  of  any  piecewise  linear  homeomorphism 
/  :  P  — »■  Q  with  /(pi)  =  gi  (1  <  j  <  n)  is  at  least 
cn^  (for  an  absolute  constant  c  >  0).  Their  ingenious 
construction  heavily  relies  on  some  special  geometric 
features  of  “snakelike”  polygons. 

Our  theorem  1.2  (stated  in  the  introduction)  pro¬ 
vides  the  same  lower  bound  for  a  modified  version  of 
this  problem  due  to  J.E.  Goodman.  The  proof  given 
below  is  purely  combinatorial,  and  avoids  the  use  of 
“snakes.” 

Proof  of  Theorem  1.2:  Let  Ti  and  Ts  be  two 
triangles  containing  two  convex  n-gons  P  and  Q  in 
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their  interiors,  respectively.  Let  Pt(i)i  ••■ )?»(«)  de¬ 
note  the  vertices  of  P  in  clockwise  order,  where  tt  is 
a  permutation  of  {1, . . . ,  n}  to  be  specified  later.  Fur¬ 
thermore,  let  gi,...,9n  denote  the  vertices  of  Q  in 
clockwise  order.  Let  f  ■  Ti  — >  T2  be  a  piecewise  linear 
homeomorphism  with  /(p.)  =  Qi  (1  <  *  <  n),  and  fix 
a  triangulation  Ti  ofTi  with  [Til  =  size(/)  triangles 
such  that  /  is  linear  on  each  of  them.  By  subdividing 
some  members  of  7i  if  necessary,  we  obtain  a  new  tri¬ 
angulation  T/  of  Ti  such  that  each  pi  is  a  vertex  of 
and  |T/|  <  lTiH-3u. 

Obviously,  /  will  map  T/  into  an  isomorphic  tri¬ 
angulation  T2  of  T2.  The  image  of  each  segment 
P)riPT(t+i)  is  a  polygonal  path  connecting  9^(0  and 
9t(.+i),  (1  <  *  <  ”)•  The  collection  of  these  paths 
together  with  the  segments  qiqi+i  is  a  drawing  of  the 
graph  G  =  Gt  defined  by: 

(*) 

E{G)  =  {qtqi+i  1  1  <  *  <  o  {qic(i)qr{i+i)  1 

1  <  i  <  n}. 

Suppose  that  this  drawing  has  c  crossing  pairs  of 
arcs.  Notice  that  each  crossing  must  occur  between  a 
path  q^(i)qr(i+i)  and  a  segment  qjqj+i-  By  the  con¬ 
vexity  of  Q,  any  line  can  intersect  at  most  two  seg¬ 
ments  qjqj+i.  B:ence  the  total  number  of  subsegments 
of  the  concatenation  of  the  polygons  /(Pir(i)PT(i+i)). 
1  <  i  <  n,  is  at  least  c/2.  On  the  other  hand,  by 
the  convexity  of  P,  each  triangle  belonging  to  T{  in¬ 
tersects  at  most  two  sides  of  the  form  P,r(j)Px(i+i)- 
Thus,  \T(\  >  cjA,  which  yields  that 

size(/)  =  |Til  >  lT/|-3u>  ^  -  3n, 


where  c{G)  stands  for  the  crossing  number  of  G.  Ap¬ 
plying  Theorem  2.1,  we  obtain  that 


c(G)> 


h\G) 

40 


Therefore, 


size(/)  > 


h\G) 

160 


—  3n  — 


1 

4’ 


To  complete  the  proof  of  Theorem  1.2,  it  is  suffi¬ 
cient  to  show  that  for  a  suitable  permutation  tt  the 
bisection  width  of  the  graph  G  =  Gx  defined  by  (+) 
is  at  least  constant  times  n.  We  use  a  counting  argu¬ 
ment  (cf.  [AS].  The  family  of  graphs  Gx  has  size  n!. 
We  bound  from  above  the  number  of  those  members 
of  this  family  whose  bisection  width  is  at  most  k .  We 
will  see  that  for  k  <  n/20  this  number  is  less  than  n\. 


Let  6(Gx)  <k.  Let  ( Vi ,  V2)  be  a  partition  of  V (Gx) 
with  l^il,  11^1  >  n/3  and  E{Vi,V2)  <  k.  Define 

Ei(VuV2)  =  {qtqi+i  \  I  <  i  <  n}  H  E{VuV2), 

E2{Vi,V2)  =  I  1  <*■<«}  n  E{VuV2). 

Since  \Ei{Vi,V2)\  <  k,  the  partition  (14, 14)  should 
be  of  a  special  form.  If  we  delete  all  elements  of 
£^1(14 .  V2)  from  the  path  gi . . .  ,  it  splits  into  at  most 

ib  -h  1  paths  (or  points)  lying  alternately  in  14  and  in 
V2.  This  yields  a  2(ib  +  1)(’J)  upper  bound  on  the 
number  of  partitions  in  question. 

The  order  in  which  the  elements  of  14  (i  =  1, 2)  oc¬ 
cur  in  the  sequence  9x(i)  ■  •  -Qirin)  can  be  represented 
by  a  function  cr,-  :  {1, . . . ,  [Vil}  — >  14  (i  =  1,2).  For  a 
fixed  partition  (14i  14))  there  are  at  most  114H  choices 
for  (Ti  and  |14|!  choices  for  <T2.  If  and  (X2  are  also 
fixed,  then  the  number  of  possible  permutations  is 
bounded  again  by  2(1:  + 1)  (^) .  Thus  the  total  number 
of  permutations  ir  for  which  i>(Gx)  <  k  cannot  exceed 


< 


< 


|V^i1!|V2|!2(F  +  1)(”) 


(V'l.V'a) 


?  "'UAj  + 


which  is  less  than  n!  if  <  n/20,  and  n  is  sufficiently 
large.  □ 
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Cutting  Dense  Point  Sets  in  Half 

Herbert  Edelsbrunnert  Pavel  Valtr*  and  Emo  Welzl^ 


Abstract 

A  halving  hyperplane  of  a  set  S  of  n  points  in  con¬ 
tains  d  ajfinely  independent  points  of  S  so  that  equally 
many  of  the  points  off  the  hyperplane  lie  in  each  of  the 
two  half-spaces.  We  prove  bounds  on  the  number  of 
halving  hyperplanes  under  the  condition  that  the  ratio 
of  largest  over  smallest  distance  between  any  two  points 
is  at  most  5  some  constant.  Such  a  set  S  is  called 

dense. 

In  d  =  2  dimensions,  the  number  of  halving  lines  for 
a  dense  set  can  be  as  much  as  e(nlogn),  and  it  cannot 
exceed  0(n®/^/log*  n).  The  upper  bound  improves  over 
the  current  best  bound  0/ log*  n)  which  holds 
more  generally  without  any  density  assumption.  In  d  = 
3  dimensions  we  show  that  0(n'^''^)  is  an  upper  bound 
on  the  number  of  halving  planes  for  a  dense  set.  The 
proof  is  based  on  a  metric  argument  that  can  be  extended 
to  d  >  4i  dimensions,  where  it  leads  to  0{n^~  as  an 
upper  bound  for  the  number  of  halving  hyperplanes. 
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1  Introduction 

Dense  point  sets  are  sets  with  bounded  ratio  of  largest 
over  smallest  distance  between  points.  We  present  ex¬ 
tremal  properties  of  dense  point  sets  for  the  number 
of  halving  hyperplanes  and  for  the  stabbing  number  of 
geometric  d-uniform  hypergraphs.  In  this  section,  we 
introduce  and  discuss  notions  and  results. 

Dense  Sets.  Let  6  >  0.  A  set  S  of  n  points  in  is 
6 -dense  if  the  ratio  of  the  maximum  over  the  minimum 
distance  between  two  points  in  S  is  at  most  We 

use  dense  short  for  ^-dense  if  ^  is  a  constant.  Note 
that  values  of  6  that  admit  arbitrarily  large  6-dense 
sets  cannot  be  arbitrarily  small.  For  example  in 

S  >  So  =  Y  2-1/3/71  fs  1.05  is  necessary  and  sufficient. 
Arbitrarily  large  60-dense  sets  can  be  constructed  as  the 
intersection  of  a  triangle  grid  with  a  disk.  For  our  pur¬ 
poses,  it  is  possible  to  suppose  the  minimum  distance  is 
1,  so  S  is  (5-dense  if  the  diameter  of  S  is  at  most 

Various  combinatorial  extremal  problems  previously 
considered  for  arbitrary  point  sets  have  recently  been 
studied  for  dense  sets.  These  include  convex  and  empty 
convex  subsets  [2,  20,  21],  separation  discrepancy  [1], 
mutual  avoidance  and  crossing  families  [21].  The  com¬ 
plexity  of  the  convex  hull  problem  for  (5-dense  sets  was 
determined  in  [21]. 

The  investigation  of  dense  point  sets  is  motivated 
by  the  common  discrepancy  between  the  complexity 
of  algorithms  in  the  worst  case  and  in  practical  cases. 
The  complexity  of  a  geometric  algorithm  typically  de¬ 
pends  on  certain  combinatorial  parameters  associated 
with  the  geometric  data,  and  it  is  often  the  case  that 
these  parameters  obtain  their  extrema  only  for  bizarre 
and  rare  data  sets.  In  particular,  extremal  point  sets 
constructed  in  the  literature  often  exhibit  large  distance 
ratios,  sometimes  exponential  in  the  number  of  points. 
This  is  related  to  the  fact  that  there  are  combinatorial 
types  of  point  configurations  that  require  the  distance 
ratio  be  at  least  double-exponential  in  the  number  of 
points,  see  [14,  15].  Such  sets  are  unlikely  to  occur  in 
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practical  applications.  As  a  consequence,  researchers 
spend  a  substantial  amount  of  time  and  effort  battling 
difficulties  that  are  too  rare  to  be  relevant  in  practical 
applications.  The  notion  of  density  can  be  seen  as  an 
attempt  to  make  the  theoretical  analysis  of  algorithms 
more  relevant  to  practice. 

There  are  two  related  concepts.  Random  sets  of 
points  uniformly  distributed  in  the  unit  square,  and  sets 
of  points  with  small  integer  coordinates  (grid  points). 
Dense  point  sets  behave  differently  from  such  sets,  at 
least  in  some  respect.  For  example,  a  random  set  of 
n  points  in  the  unit  square  is  likely  to  have  a  pair  at 
distance  0(i),  and  is  therefore  not  dense.  Similarly, 
a  dense  set  of  n  points  in  the  plane  can  include  the 
vertices  of  a  convex  m-gon,  m  =  D(T/n),  which  is  im¬ 
possible  for  any  set  of  points  with  integer  coordinates 
between  0  and  some  constant  times  ^/7^. 

It  is  perhaps  worthwhile  to  mention  that  dense 
point  sets  have  been  considered  in  computer  graphics 
[6],  and  that  they  commonly  appear  in  nature  (eye- 
photoreceptor  distributions,  minimum  distance  con¬ 
straints  for  molecules,  atoms  of  proteins,  etc.). 

Halving  hyperplanes.  A  notion  that  is  closely  re¬ 
lated  to  and  somewhat  more  general  than  a  halving  hy¬ 
perplane  is  that  of  a  k-set.  A  subset  T  C  5  of  a  finite  set 
S  C  is  a  k-set  if  cardT  =  k  and  there  is  a  half-space 
so  that  T  is  the  intersection  of  S  with  this  half-space. 
The  number  of  k-sets,  for  some  fixed  k,  arises  in  the 
analysis  of  geometric  algorithms,  see  e.g.  [7,  8,  12].  The 
number  of  k-sets  obtains  its  maximum  when  the  points 
are  in  general  position,  which  we  henceforth  assume. 

The  problem  of  bounding  the  maximum  number  of  k- 
sets  possible  for  a  set  of  n  points  turns  out  to  be  notori¬ 
ously  difficult,  even  in  d  =  2  dimensions.  The  largest  k 
for  which  matching  upper  and  lower  bounds  are  known 
is  3,  see  [18].  The  most  difficult  case  seems  to  be  when  k 
is  roughly  j,  which  is  the  reason  why  this  case  receives 
special  attention  as  the  problem  of  bounding  the  num¬ 
ber  of  halving  lines.  The  best  upper  bounds  for  general 
point  sets  in  is  /  log*  n),  see  [17],  and  there 

are  constructions  of  sets  with  D(nlogn)  halving  lines, 
[11,  13].  Our  results  for  dense  sets  in  consist  of  an 
adaptation  of  the  lower  bound  in  [11]  and  of  the  upper 
bound  0(n®/^/  log*  n). 

In  three  and  higher  dimensions,  the  problem  seems 
even  harder.  The  first  non-trivial  upper  bound  for  d  = 

3  dimensions  has  been  given  in  [5].  This  bound  has 
been  improved  in  [4,  10],  and  the  current  best  bound  is 
0(n®/3),  as  derived  in  [9].  In  d  >  4  dimensions  there 
are  minute  improvements  over  the  trivial  0(n‘^)  bound 
based  on  algebraic  topology  arguments  about  the  non- 
embeddability  of  certain  complexes,  see  [22].  We  have 
as  an  upper  bound  for  dense  sets  in  all  d  >  3 


dimensions. 

Outline.  Section  2  shows  the  fi(nlogn)  lower  bound 
on  the  number  of  halving  lines  is  asymptotically  un¬ 
affected  by  the  density  assumption.  Section  3  gives  a 
proof  of  the  0(71?/ log*  n)  upper  bound  on  the  number 
of  halving  lines  for  a  dense  set  of  n  points  in  M^.  The 
crux  of  the  argument  is  a  reduction  to  k-seis  for  about 
y/n  subsets  of  size  at  most  about  y/n  each.  Section  4 
considers  point  sets  in  M^.  The  approach  to  proving 
an  upper  bound  rests  on  an  area  argument  combined 
with  an  observation  about  centroids  of  subsets  of  the 
set.  This  is  extended  to  d  >  4  dimensions  in  section 
5.  Alternatively,  the  metric  argument  can  be  combined 
with  an  upper  bound  on  the  stabbing  number  of  geo¬ 
metric  d-uniform  hypergraphs  known  as  Lovasz’  lemma 
[16].  This  is  discussed  in  section  6. 

2  Lower  bound  in  the  plane 

The  constructions  in  [11,  13]  establishing  the  D(nlogTi) 
lower  bound  on  the  number  of  halving  lines  are  not 
dense:  with  minimum  distance  1  they  require  the  max¬ 
imum  distance  be  at  least  some  constant  times  n.  This 
section  shows  that  asymptotically  the  same  lower  bound 
can  also  be  obtained  for  dense  sets.  We  begin  by  modi¬ 
fying  the  construction  in  [11]  so  that  the  Kj-coordinates 
of  the  points  are  contiguous  integers. 

Lemma.  For  any  positive  even  integer  m  there  are 
points  pi,p2,  •••, Pm  in  so  that 

(i)  for  all  i  the  a;i-coordinate  of  p;  is  i,  and 

(ii)  there  are  at  least  y  logg  y  halving  lines. 

Proof.  Assume  first  that  m  =  2-Z'^ ,  k  >  0,  and  denote 
the  number  of  halving  lines  of  a  set  5  by  h{S).  We 
construct  sets  Sk,  card  (5*,)  =  2-3*',  with  slightly  higher 
number  of  halving  lines  than  claimed,  namely  h{Sk)  = 
(A:  +  l).3^-  f  loggS^. 

For  =  0  we  have  m  =  2,  So  =  {(1, 0),  (2,  0)},  and 
h{So)  =  1.  We  continue  by  induction.  Sk+i  is  the  union 
of  three  sets,  To,Ti,T2,  each  an  affinely  transformed 
copy  of  Sk  as  described  below.  By  horizontal  transla¬ 
tion,  along  the  ^i-axis,  make  sure  the  points  in  Ti  have 
a;i-coordinates  from  im  -b  1  through  (f  -f-  l)m.  Next, 
shrink  the  a;2-coordinates  of  all  points  in  Ti  by  a  suffi¬ 
ciently  large  factor.  Then,  move  the  points  vertically, 
along  the  a:2-axis,  so  that  the  shrunk  2;2-coordinates  be¬ 
come  vertical  displacements  with  respect  to  a  line,  one 
for  each  T.  The  three  lines  meet  in  point  ( ,  0) 
and  have  slopes  0,  1,  and  -1.  The  line  for  To  is  X2  =  0, 
the  one  for  Ti  is  X2  =  xi  —  and  the  one  for  T2 

is  X2  =  —xi  +  .  So  in  fact,  the  only  difference 
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between  Sk  and  To  is  in  the  a;2-coordinates  which  are 
scaled  down  for  To;  the  points  in  Ti  and  T2  also  expe¬ 
rience  horizontal  translation  and  vertical  displacement 
other  than  scaling.  The  shrinking  factor  for  each  set  Ti 
is  chosen  sufficiently  large  so  that  any  line  through  two 
points  of  Ti  separates  TJ+i  and  T;+2,  where  indices  are 
taken  modulo  3. 

Any  halving  line  of  Ti  is  also  a  halving  line  of  .  In 
addition,  5*,+!  has  —  halving  lines  connecting  a  point 
of  Ti  with  a  point  of  Tj+i  each,  for  i  =  0, 1, 2.  It  follows 
that 

h{Sk+i)  =  /i(To)  +  /i(Ti)  +  /i(T2)  +  3m/2 
=  3/i(5it)  + 3*^+1 
=  (/k-b2)3*^+i. 

To  extend  this  construction  to  arbitrary  positive  even 
m  observe  that  all  halving  lines  of  Sk+i  have  bounded 
slope.  We  can  thus  add  equally  many  points  above 
and  below  all  these  lines  and  retain  them  as  halving 
lines.  To  satisfy  the  integer  coordinate  requirement,  we 
add  the  points  with  contiguous  integer  a:i-coordinates, 
starting  at  2  •  3*  -b  1 .  After  adding  j  points  above  and 
j  points  below  all  halving  lines  of  Sk,  we  get  a  set  of 
m  =  2  •  3*  -t-  2j  points,  which  we  denote  by  Pm  for  later 
reference.  We  may  assume  j  <  2  ■  so  Pm  has  fewer 
than  three  times  as  many  points  than  Sk ,  which  implies 
the  claimed  lower  bound  for  h{Pm).  0 

The  integer  sj-coordinates  can  be  used  to  control  the 
density  of  transformed  copies  of  Pm  ■  This  is  done  in  the 
lower  bound  construction  below. 

Thm.  1  For  any  even  n  >  2  there  is  a  2-dense  set  of 
n  points  in  with  at  least  ^  logg  n  -  n  halving 
lines. 

Proof.  We  may  assume  that  n  >  3*^^,  for  otherwise  the 
claim  is  void.  Define  m  =  2-  [^J  and  M  —  2-  -|-9; 

so  m  is  even  and  M  is  odd.  Let  Y  be  the  annulus  of 
points  on  and  between  the  two  circles  with  radii  m  and 
2m,  both  centered  at  the  origin.  For  1  <  i  <  M ,  let  Ri 
be  the  half-line  with  angle  ■  27r  starting  at  the  origin. 
The  intersection  ii;  H  Y  is  a  line  segment  of  length  m. 
For  £  >  0  and  1  <  i  <  M  find  an  affine  transformation 
Qi  of  Pm  so  that 

(i)  Qi  c  y, 

(ii)  the  distance  of  any  p  G.  Qi  from  Ri  is  at  most  e, 
and 

(iii)  the  distance  between  any  two  points  in  Qi  is  at 
least  1. 

For  example,  shrink  the  Z2-coordinates  of  the  points  in 
Pm  to  within  [—£,£],  rotate  the  set  by  ^  ■  27r  about 


the  origin,  and  translate  along  Ri  until  all  points  lie  in 
Y.  Define  S  —  IjfiiQi-  For  sufficiently  small  e  >  0, 
every  halving  line  of  Qi  is  also  a  halving  line  of  S. 

Finally,  we  adjust  the  number  of  points  from  card  S  — 
M  m  —  {2\y/^  -b  9)(2[^J)  to  n.  Notice  that 


n  <  n  -b 


^  -  18  <  card  S  <n  +  <  n  -b  12m. 


We  can  delete  points  from  12  sets  Qi  until  a  set  S'  C 
S  with  card  5'  =  n  remains.  By  choosing  half  these 
points  from  Qj,  1  <  J  <  6,  and  the  other  half  from  Qj, 
<  j  <  ,  the  halving  lines  of  the  remaining 

sets  Qi  are  still  halving  lines  of  S'.  Therefore, 

h{S')  >  {M  -  12) -hiPm) 

^  (or  ‘i\  \/n-4 

>  (202  -  3) - jy—  log3  - - - 

n 

>  logs  n-n. 

The  diameter  of  S'  is  less  than  4m  and  the  number 
of  points  is  n  >  4m^.  The  density  of  S'  is  therefore 
S<^  =  2.  0 


Remark.  It  is  possible  to  improve  the  density  factor 
by  setting  m  «  \J\/Z/{A'k)  ■  0z  0.3702  and  M  s» 

^  •  27rm  Rs  Y^47r/-v/3  •  02  w  2.6902  and  by  placing  the 
m  points  of  Qi  alternately  at  distances  roughly  m  -b 
m  -b  1  -b  5,  •  •  •,  2m  —  1  -b  5  and  at  distances  roughly 
m  -b  |,  m  -b  1  -b  |,  •  •  •,  2m  —  1  -b  I  from  the  origin. 
The  diameter  is  smaller  than  4m,  so  the  density  is  at 

most  2  Rs  1.49.  The  number  of  halving  lines  is 

at  least  ^  logg  n  —  n. 

For  any  fixed  6  >  \j2y/Z/K  «  1.05,  we  can  construct 
a  5-dense  set  of  even  size  22  with  at  least  cn  log  n  —  n 
halving  lines,  where  c  =  c(5)  >  0  is  independent  of  n, 
as  follows.  Put  mi  =  4(502  and  m2  =  -  e)0i, 

where  e  =  e{6)  >  0  is  sufficiently  small.  Let  Y  be 
the  annulus  of  points  on  and  between  the  two  circles 
with  radii  mi  and  m2,  both  centered  at  the  origin.  We 
find  a  set  S  of  Am^irni  —  m2)  points  in  the  annulus  Y 
similarly  as  in  the  proof  of  Theorem  1  so  that  5  has  at 
least  cn  log  22  —  n  halving  lines.  Let  T  be  a  set  of  points 
of  a  triangle  grid  with  minimum  distance  1  which  is 
symmetric  around  the  origin  and  does  not  contain  the 
origin.  Let  T'  be  the  intersection  of  T  with  the  disk  of 
radius  m2  —  1  centered  at  the  origin.  We  may  certainly 
assume  that  each  of  the  cn  log  n  —  n  halving  lines  of  S 
cuts  T'  into  two  equal-sized  parts.  Then,  5  U  T'  has 
at  least  cn  log  n  —  n  halving  lines.  If  £  >  0  was  chosen 
sufficiently  small,  S  UT'  has  at  least  n  points.  Finally, 
we  delete  (card  (S'  U  T')  —  n)/2  pairs  of  points  of  T', 
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each  pair  symmetric  around  the  origin,  so  that  exactly 
n  points  remain  in  S  Li  T' .  This  gives  us  a  i-dense  set 
of  size  n  with  at  least  cn  log  n  —  n  halving  lines. 

3  Upper  bound  in  the  plane 

Let  /fc(n)  be  the  maximum  number  of  k-sets  in  a  set 
of  n  points  in  the  plane,  and  let  f{n)  =  max{/fc(n)  : 

0  <  k  <  n}.  As  mentioned  in  the  introduction,  /(n) 
0(nt/log*  n)  is  the  best  known  upper  bound  on  f{n). 
We  show  that  (5-dense  sets  admit  smaller  bounds.  In 
particular,  the  result  implies  that  0(nJ/log*n)  is  an 
upper  bound  for  dense  sets. 

Thm.  2  The  number  of  halving  lines  of  a  6-dense  set 
of  n  points  in  is  at  most  0{6y/n  ■  /(35i/ri)). 

Proof.  Let  5  be  a  ^-dense  set  of  n  points  in  M^.  Con¬ 
sider  a  general  direction  7  €  [0,7r)  and  a  line  £  =  £{j) 
with  direction  7  so  that  equally  many  points  of  S  lie  on 
both  side  of  £.  The  length  of  ^  C  conv  S  is  at  most  the 
diameter  of  5,  which  is  at  most  S^/n.  Let  R  =  Rij)  be 
the  smallest  rectangle  that  contains  the  ^-neighborhood 
of  £  n  conv  5;  its  length  is  at  most  6y£n  -I-  I,  its  width 
is  I,  so  its  area  is  at  most  (5-y/n  -f-  1.  For  each  point 
peSOR  consider  the  disk  Dp  ^  {x  \  \xp\  <  i},  where 
\xp\  is  the  Euclidean  distance  between  x  and  p.  The 
area  of  R  covered  by  Dp  is  at  least  Since  all  such 

disks  are  disjoint,  this  implies  that 

card  (5  n  if)  <  <  —  ■  ^\/n  -I-  3. 

8 

We  say  a  line  h  meets  R  shortside  if  h  meets  the  two 
short  sides,  which  have  length  1  each.  A  halving  line 
of  S  necessarily  meets  £  within  conv  S  and  thus  within 
R.  It  follows  that  if  /i  is  a  halving  line  then  it  meets 
R  shortside  if  its  direction  differs  from  7  by  at  most 
X,  where  -|-  ^tani;  =  5,  or  equivalently,  x  = 

arctan  Since  6^/n  certainly  exceeds  1,  we  have 

1  ^  1  ^  1  ^  tt 

2SVE+1  -  25V^+l  -  3-6^^  10-  6^' 

Choose  m  =  [5  •  6y/n]  and  assume  that  all  directions 

=  0  — TT  —TT . 2^7r  are  general  in  the  sense  that 

no  two  points  of  S  lie  on  a  line  with  such  a  direction. 
Two  contiguous  directions  differ  by  less  than  2x,  so  each 
halving  line  meets  at  least  one  of  the  rectangles  R{j) 
shortside. 

To  finish  the  argument  observe  that  each  rectangle, 
R,  is  met  shortside  by  at  most  f{^6^/n  +  3)  halving 
lines,  because  they  are  all  split  S  Cl  R  the  same  way.  It 
follows  that  the  total  number  of  halving  lines  is  at  most 

m  ■  /(  — -f  3)  =  0(6\/n  •  /(3(5\/n)), 

TT 


as  claimed.  ^ 

Remark.  Let  fk{n,6)  be  the  maximum  number  of  k- 
sets  in  a  5-dense  set  of  n  points  in  the  plane,  and  let 
/(n,5)  =  max{/it(n,  5)  :  0  <  fc  <  n}.  Thus,  Theorem  2 
yields 

^(n,5)  =  0(5^^^•/(35^/)^)), 

where  F  =  |  -  1 .  An  analogous  argument  gives  also 

/(n,5)  =0(5^/)^■/(35^^)). 

Remark.  Let  us  redefine  R  —  R{'y)  in  the  proof  of 
Theorem  2  to  be  the  smallest  rectangle  that  contains 
the  e-neighborhood  of  ^Plconv  5,  where  e  >  0  is  a  small 
positive  constant.  In  this  case,  the  points  of  S  H  R  are 
almost  collinear.  We  call  5  C  of  size  n  8-line-dense  if 
there  is  a  line  q  such  that  the  orthogonal  projection  of  S 
to  5  is  a  5-dense  set  in  the  1-dimensional  interpretation. 

Let  lk{n,6)  be  the  maximum  number  of  k-sets  in  a  5- 
line-dense  set  of  n  points  in  the  plane,  and  let  /(n,  5)  = 
max{/jt(n,5)  :  0  <  ^  <  n}.  Analogous  arguments  as 
in  sections  2  and  3  yield  that  for  any  fixed  5  >  5o  = 

^/2\/3/7r  and  5'  >  1, 

/(n,  5)  =  0(\/n  •  /(ci\/n,  5'))  =  0(/(c2n,  5)), 

where  ci  and  C2  are  two  constants  depending  only  on  5 
and  5'. 

In  particular,  if  the  function  /(n,5)  is  smooth  as  a 
function  of  n  (i.e.,  f{cn,S)  —  0(/(n,5))  for  any  con¬ 
stant  c)  then 

fin,  8)  =  @i^Jn  ■li\/n,8')), 

and  determining  the  asymptotic  behavior  of  the  max¬ 
imum  number  of  halving  lines  is  equally  difficult  for 
dense  sets  as  for  line-dense  sets.  We  believe  that  the 
function  fin,  8)  (and  also  the  function  fin))  is  smooth 
but  have  no  proof  at  this  moment. 

Remark.  Note  that  Thm.  2  can  be  bootstrapped.  So 
if  dense  sets  asymptotically  maximize  the  number  of 
halving  lines,  then  this  maximum  is  0(n  polylog  n). 

4  Upper  bound  in  space 

Let  S'  be  a  dense  set  of  n  points  in  M^.  As  mentioned 
in  the  introduction,  we  derive  an  upper  bound  on  the 
number  of  halving  planes  using  an  area  argument  for 
triangles  spanned  by  points  in  S.  In  particular,  we  need 
a  lower  bound  on  the  total  area  of  a  collection  of  m  such 
triangles.  We  begin  with  two  lemmas.  The  collection 
of  subsets  of  S  with  size  k  is  denoted  by  (j.). 
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Lemma  A.  Let  5  be  a  set  of  n  points  with  minimum 
distance  1  in  Then 


\i  A\  <  A2  <  ■  ■  ■  <  Am  is  the  ordered  sequence  of  the 
M  —  (3)  triangle  areas,  we  can  rewrite  this  as 


E 


|P9l 


<  3  •  n5. 


Proof.  Consider  the  ordered  sequence  of  the  N  =  Q) 
distances:  xi  <  2:2  <  . . .  <  xjq.  The  pigeon-hole  princi¬ 
ple  implies  that  for  each  index  i  there  is  a  point  with  at 
least  ^  points  at  distance  x,-  or  less.  So  we  have  at  least 
^  -h  1  points  inside  a  ball  of  radius  x,- .  Because  of  the 
minimum  distance  assumptions  we  get  at  least  ^  +  1 
pairwise  disjoint  open  balls  of  radius  |  each  inside  a 
ball  of  radius  By  dividing  volumes  we  get 


Hi 

n 


+  1  < 


=  8(X.-  +  <  3"  ■  xl 


It  follows  that  Xi  >  ,  and  therefore 


E 


1 

\pq\ 


N  ,  N 

<3  n2-E 


1  =  1 


□ 


Based  on  the  bound  for  distances  we  can  now  prove 
a  bound  for  areas  of  triangles. 

Lemma  B.  Let  S  be  a  set  of  n  points  with  minimum 
distance  1  and  maximum  distance  D  in  Then 
the  total  area  of  m  >  121>n^  different  triangles  is 
at  least  /n^. 

Proof.  For  two  points  p,  ?  €  5  consider  all  third  points 
r  £  5  so  that  the  area  of  the  triangle  pqr  is  at  most  A 
and  pq  is  its  longest  edge.  The  distance  from  r  to  the 
line  through  p  and  q  is  therefore  at  most  and  r 
lies  between  the  two  planes  through  p  and  q  normal  to 
pq.  The  number  of  points  r  that  can  possibly  lie  inside 
this  cylinder  can  be  bounded  by  a  packing  argument. 
After  growing  by  |  in  every  direction,  the  volume  of  the 
cylinder  is 


0  A  1 

V  ^i\pq\  A  A -r.< 


\P<l\ 


+  47rA  -I- 


■^\Pl\ 

2 


because  |p?|  -|-  1  <  2|p?|.  It  contains  at  most  pair¬ 
wise  disjoint  open  balls  of  radius  |  each.  Using  lemma 
A  we  get  an  upper  bound  on  the  number  of  triangles 
with  area  A  or  less: 

-V  <  144A^n3  -I-  12An^  + 
w  2 

{p,?}6(f) 


i  <  144A?n5  +  l2Ain^  + 


For  i  >  6Dn^  this  implies  that  at  least  one  of  144A?n3 
and  12A,n^  exceeds  \{i  -  The  first  in¬ 

equality  implies 


Ai  > 


14nf  ’ 


which  is  weaker  than  the  second  inequality  provided 
i  >  |n3.  This  is  indeed  the  case  if  i  >  6Dn^,  because 
D  >  in  3,  which  can  be  proved  by  a  straightforward 
packing  argument.  Finally,  the  total  area  of  m  =  2i  > 
12Dn^  triangles  is  at  least 


i  -  Ai> 


m  (f)" 

2  14  •  n« 


1  m2 

40 


□ 

The  lower  bound  on  the  total  area  of  m  triangles  is 
sufficient  to  prove  an  upper  bound  on  the  number  of 
halving  planes  of  a  ^-dense  set  S.  Assume  n  is  odd 
and  define  k  =  For  every  subset  T  C  S  of  size 

k,  let  7t  be  the  centroid,  that  is,  77-  =  ^  P-  i'®*' 
Sk  be  the  set  of  Wold  centroids,  that  is,  Sk  =  {7t  I 
T  C  5,  card  T  =  k].  The  convex  hull  of  Sk  is  a  three- 
dimensional  convex  polytope,  Pk-  Note  that  Pk  is  con¬ 
tained  in  the  convex  hull  of  S,  and  by  density  assump¬ 
tion,  its  projections  to  the  three  coordinate  planes  have 
area  at  most  6'^n^  each.  It  follows  that  the  surface  area 
of  Pk  is  less  than  66^ . 

We  claim  that  if  the  plane,  fi,  through  points  p,q,r  E 
S  is  a  halving  plane  then  a  homothetic  copy  of  the  tri¬ 
angle  pqr  is  a  face  of  Pk-  Let  17  C  5  be  the  subset  of 
points  on  one  side  of  h,  so  card  U  =  k  —  1.  For  each 
^  G  {P)9)F},  there  is  a  plane  that  separates  U  U  {x} 
from  S  —  U  —  {x},  and  therefore  'yuu{x}  is  a  vertex 
of  Pk-  Furthermore,  the  centroids  defined  by  {/  U  {p}, 
U  U  {?},  and  U  U  {r}  span  a  triangular  face  of  Pk-  This 
face  is  a  homothetic  copy  of  pqr,  with  scaling  factor 
p  In  summary,  if  the  plane  through  points  p,q,r  is  a 
halving  plane  then  a  homothetic  copy  of  pqr  is  a  face 
of  Pk,  and  the  area  of  pqr  is  k^  times  the  area  of  the 
corresponding  facet.  Since  the  surface  area  of  Pk  is  less 
than  the  total  area  of  all  triangles  correspond¬ 
ing  to  halving  planes  is  less  than  <  |(5^n®/^. 

Lemma  B  thus  implies  the  following  upper  bound  on 
the  number  of  halving  planes. 

Thm.  3  The  number  of  halving  planes  of  an  (5-dense 
set  of  n  points  in  is  less  than  20  • 
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5  Four  and  higher  dimensions 

The  area  argument  of  section  4  can  be  extended  to  M'*, 
d  >  4.  To  simplify  the  argument,  only  dense  sets  of  n 
points  in  are  considered,  so  the  diameter  is  D  <  6ni 
for  some  constant  6.  Suitable  positive  constants  are 
used  throughout  this  section  and  denoted  as  c,  with  or 
without  sub-  and  superscript.  A  set  T  of  cardT  =  k  + 
1  <  d  +  1  points  in  spans  a  fc-simplex,  ctt  =  conv  T. 
Its  ^;-dimensional  Euclidean  measure  is  denoted  by  |<T7’|. 
We  let  Sjc  (A)  denote  the  number  of  A;-simplices  spanned 
by  the  points  whose  measure  is  A  or  less.  All  logarithms 
are  to  the  base  2. 

Lemma  C.  Let  S  be  a  dense  set  of  n  points  inis'*.  Then 
Si  (A)  is  bounded  from  above  by 


c  •  nA'*  for  k  =  I, 

(2) 

c  ■  n*-+  <i  A'*  log  A -f- for  2  <  A;  <  d. 


Proof.  Let  F  be  an  (i  —  l)-flat,  1  <  i  <  d.  The  (i  — 
l)-dimensional  measure  of  its  intersection  with  conv  5 
is  less  than  It  follows  that  the  d-dimensional 

measure  of  the  cylinder  of  points  at  distance  z  +  ^  of 
less  from  F  is  less  than  ci{z  -f-  With  a 

different  constant,  Cj,  this  is  also  an  upper  bound  on 
the  number  of  points  in  5  at  distance  z  or  less  from  F. 

Consider  an  arbitrary  ordering,  po,pi, . .  -tPk,  of  the 
vertices  of  a  ^-simplex  (Tt,T  C  S.  For  1  <  i  <  ^,  let 
Xi  be  the  distance  of  pi  from  the  (z  —  l)-flat  spanned  by 
Po , . . . , Pt_  1 .  If  zi,  Z2, Zk  is  a.  sequence  of  distances 
so  that  Xi  <  Zi  for  all  i,  then 


The  number  of  A;-simplices  for  which  (1)  holds  is 
bounded  from  above  by 


d-i  +  l  D'-ll 


The  number  of  A;-simplices  with  at  least  one  x,-  at  most 
1  is  less  than  c'  -n^  ■  .  For  the  others  we  can  assume 

all  Zi  exceed  1,  so  the  upper  bound  can  be  simplified  to 


For  a  given  A,  we  can  now  derive  an  upper  bound  on 
Si(A),  the  number  of  F-simplices  (Tt  with  \aT\  <  A. 
For  fc  =  1  set  zi  =  A,  get  |(Tt|  =  xi  <  A  from  (1)  and 
<  C2nA'*  from  (2).  This  completes  the  proof  of 
the  first  inequality  in  lemma  C.  For  k  >  2  we  consider 


all  sequences  Zi  though  Zk  so  that  z,-  >  1,  for  all  i,  and 
n?=i  =  Simplices  are  counted  only  if  all  their  x.  ’s 
exceed  1;  the  others  are  covered  by  the  second  term 
on  the  right  side  of  the  second  inequality  in  lemma  C. 
To  consider  all  sequences  we  use  induction  and  the  fact 
that  W^iZl  Zi  So 


t(A)  <  r  Sk-i{-)-z 

Jz  =  l  ^ 


<i— A;  +  l  j-)k  —  l 


■D'^-^dz. 


For  k  =  2  we  get 


rA  1 

S2(A)  <  c  ■  nA^  ■  D  /  -c 

Jz  =  l  2 


<  C2  •  nA'*  log  A  ■  D, 


and  for  ^;  >  3  we  get 


k  f  ^  1 

sjfe(A)  <  c  •  nA‘*log  A  ■  £>(=)  /  .  ,  , - dz 

Jz=i  log^ 

<  c^'-nA'*  log  A  •£)(=). 


The  upper  bound  on  Sk{A)  can  be  used  to  prove  a 
lower  bound  on  the  total  measure  of  a  collection  of  k- 
simplices,  all  spanned  by  5. 

Lemma  D.  Let  5  be  a  dense  set  of  n  points  in  M"*.  The 
total  F-dimensional  measure  of  m  >  c"  • 
F-simplices  is  at  least 

J  m*+?/(c  •  ni)  for  A-  =  1 

\  1,1  2d-^k(k-l)  1 

[  /(c  •  n  2“*^  log<‘  m)  for  2  <  k  <  d. 

Proof.  Let  Ai  <  A2  <  . . .  <  Am  be  the  ordered  se¬ 
quence  of  the  M  —  A-simplex  measures.  Assume 

c"  >  4c'  so  the  majority  of  any  set  of  m  A-simplices  have 
all  Xi  exceeding  1,  see  the  proof  of  lemma  C.  For  k  =  I, 
we  can  rewrite  the  result  in  lemma  C  as  z  <  c  ■  nAf,  so 

m  -  m  1  j_  1 

E.  1  .1 

Ai  >  — >  C3 - 

.•=1  (c")"  .=1 

For  k  >  2  and  z  >  y  >  2c'n*'+^,  we  get  z  <  2c  • 
n^Af  log  A,-,  where  £  =  1  -f-  {^)/d.  Thus 


A  (z  ■  n 

Ai  >  C4  •  ^ - . 

log"*  (z  ■  n~^) 

For  the  total  measure  we  get 


m  .1 
Id 


Ej  ^  X — ^  ^  ^ 


n  <1  log  m 
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because  ^ 


□ 


Remark.  For  d  =  S,  k  =  2,  and  D  proportional  to 
n3,  the  bounds  in  lemmas  B  and  D  agree  up  to  the 
logarithmic  factor.  The  larger  D  gets  the  further  the 
bound  in  lemma  D  lags  behind  the  one  in  lemma  B. 

For  k  =  d  —  1,  lemma  D  says  that  m  >  c"  ■ 

{d—  l)-simplices  have  total  {d—  l)-dimensional  measure 
at  least 


m 


i+i 


c  •  n 


> 


C5 


log  5 


log'*  n 


_3 _ ^ 

2«i  d2  ^ 


The  exponent  of  n  exceeds  d  —  ^  for  d  >  4.  This  is  worth 
noticing  because  the  same  argument  on  centroids  which 
leads  to  theorem  3  in  section  4  implies  that 


is  an  upper  bound  on  the  total  measure  of  all  (d  —  1)- 
simplices  spanning  halving  hyperplanes.  It  follows  that 
there  cannot  be  m  >  c"  •  such  (d  —  l)-simplices, 

because  this  would  contradict  lemma  D. 


Thm.  4  The  number  of  halving  hyperplanes  of  a  dense 
set  of  n  points  in  d  >  4,  is  less  than  c  • 


6  Stabbing  number 

As  described  by  Santalo  [19],  there  is  amotion-invariant 
measure  on  lines  in  with  the  property  that  the  mea¬ 
sure  of  lines  intersecting  a  triangle  is  twice  its  area. 
This  measure  can  be  used  to  extend  theorem  3  to  the 
stabbing  problem  for  triangles  defined  by  a  (5-dense  set 
S  C  R^.  The  measure  of  lines  intersecting  the  convex 
hull  of  5  is  bounded  by  If  we  take  every  line 

with  the  multiplicity  of  its  stabbing  number  in  a  set  of 
m  triangles  defined  by  S,  then  this  measure  is  twice  the 
total  area  of  these  triangles.  Lemma  B  shows  that  this 
area  is  provided  m  >  c6n^^^.  So  if  the 

set  of  triangles  has  the  property  that  every  line  stabs 
at  most  O(n^)  of  them,  then  m  —  1^),  which  is 

the  same  bound  as  in  theorem  3.  Lovasz’  lemma  guar¬ 
antees  that  this  is  indeed  the  case  for  triangles  that  span 
halving  planes.  Thus,  the  motion-invariant  measure  for 
lines  together  with  Lovasz’  lemma  gives  another  proof 
of  theorem  3  based  on  the  area  bound  in  lemma  B.  The 
same  can  be  said  about  d  >  4  dimensions,  theorem  4, 
and  lemma  D. 
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1  Introduction 

1.1  Overview  of  the  Results 

The  greedy  triangulation  (GT)  of  a  set  S  of  n  points 
in  the  plane  is  the  triangulation  obtained  by  starting 
with  the  empty  set  and  at  each  step  adding  the  short¬ 
est  compatible  edge  between  two  of  the  points,  where 
a  compatible  edge  is  defined  to  be  an  edge  that  crosses 
none  of  the  previously  added  edges.  In  this  paper  we 
present  a  simple,  practical  algorithm  that  computes  the 
greedy  triangulation  in  expected  time  O(nlogn)  and 
space  0(n)  for  points  uniformly  distributed  over  any 
convex  shape.  A  variant  of  this  algorithm  should  also 
be  fast  for  many  other  distributions. 

We  first  describe  a  surprisingly  simple  method  for 
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10th  Computational  Geometry  94-6/94  Stony  Brook,  NY.  USA 
©  1994  ACM  0-89791 -648-4/94/0006..$3.50 


testing  the  compatibility  of  a  candidate  edge  with  edges 
in  a  partially  constructed  greedy  triangulation.  Our 
method  requires  0(1)  time  for  both  the  edge  test  and 
the  update  of  our  data  structure,  0(n)  time  for  initial¬ 
ization,  and  0(n)  space. 

We  next  prove  that  an  edge  cannot  be  greedy  if  a 
small  disk  centered  at  its  midpoint  contains  a  point  from 
5  in  both  half-disks.  Based  on  this,  we  are  able  to  prove 
that  all  edges  in  a  greedy  triangulation  of  uniformly 
distributed  points  over  a  convex  compact  region  C  are 
expected  either  to  be  short  or  to  have  both  endpoints 
near  the  boundary  of  C. 

These  results  allow  us  to  develop  an  algorithm  that 
computes  the  greedy  triangulation  for  uniformly  dis¬ 
tributed  points  in  0(n  log  n)  expected  time.  We  also 
present  another  two-phase  algorithm  that  is  less  tuned 
to  uniform  distributions.  It  runs  in  expected  time 
0(n  log^  n)  time  on  uniformly  distributed  points,  and 
0(n^  log  n)  in  the  worst  case.  These  algorithms  should 
be  compared  to  an  algorithm  by  Levcopolous  and  Lin- 
gas  [20].  For  the  more  restricted  case  of  points  uni¬ 
formly  distributed  in  the  unit  square,  their  algorithm 
runs  in  expected  time  0(n).  Extending  it  to  rectangles 
is  straightforward,  but  extending  it  to  non-rectangular 
convex  shapes  would  require  non-trivial  modications  of 
the  algorithm  and  analysis.  Although  their  algorithm 
is  beautiful  theoretically,  it  has  not  been  implemented 
and  would  be  difficult  to  implement  practically.  Our 
algorithms  not  only  work  efficiently  for  more  general 
compact  convex  regions,  but  the  simplicity  of  our  algo¬ 
rithms  and  smaller  constant  factors  would  make  them 
preferable  for  most  practical-sized  problems. 
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1.2  Background 

Efficiently  computing  the  greedy  triangulation  is  a  prob¬ 
lem  of  long  standing,  going  back  at  least  to  1970  [9].  A 
number  of  the  properties  of  the  GT  have  been  discov¬ 
ered  [18,  22,  26,  27]  and  the  greedy  algorithm  has  been 
used  in  applications  [5,  27]. 

A  straightforward  approach  to  computing  the  GT  is 
to  compute  all  (2)  distances,  sort  them,  and  then  build 
the  GT  an  edge  at  a  time  by  examining  each  pair  in 
order  of  length  and  adding  or  discarding  it  based  on  its 
compatibility  with  the  edges  already  added.  It  is  easy 
to  see  that  this  method  requires  O(n^)  space  and  time 

T{n)  =  0{n^logn  +  f(n)  +  ng{n))  (1) 

where  0(n^  log  n)  is  the  time  required  for  an  optimal 
comparison-based  sort  on  (2)  distances,  f{n)  is  the  time 
required  to  test  new  edges  for  compatibility,  and  g(n) 
is  the  time  required  to  update  the  data  structure  when 
a  new  greedy  edge  is  added  [29].  A  naive  test  would 
compare  each  new  potential  edge  to  each  of  the  existing 
edges  (of  which  there  are  at  most  0(n))  for  an  O(n^) 
time  algorithm.  Gilbert  [10]  presented  a  data  struc¬ 
ture  allowing  an  O(logn)  time  compatibility  test  and 
an  0(n  log  n)  time  update,  thus  improving  the  algo¬ 
rithm’s  overall  time  complexity  to  0{n^  logn),  without 
adversely  affecting  space  complexity.  He  does  this  by 
building  a  segment  tree  for  each  point  in  the  set,  where 
the  endpoints  of  the  “segments”  are  the  polar  angles  be¬ 
tween  the  given  point  and  every  other  point  in  the  set. 
Manacher  and  Zobrist  [27]  have  since  given  an  0{n^) 
expected  time  and  0{n)  space  greedy  triangulation  al¬ 
gorithm  that  makes  use  of  a  probabilistic  method  for 
pretesting  compatibility  of  new  edges.  Note  that  our 
approach  also  uses  this  “generate  and  test”  paradigm, 
and  that  we  gain  improvements  over  previous  results 
by  generating  fewer  edges  and  supplying  more  efficient 
tests. 

An  alternate  approach  to  “generate  and  test”  is  to 
generate  only  compatible  edges.  One  way  to  do  this 
was  discovered  independently  by  Goldman  [11]  and  by 
Lingas  [24].  The  method  uses  the  generalized  or  con¬ 
strained  Delaunay  triangulation  [3,  15,  33].  The  con¬ 
strained  Delaunay  triangulation  is  required  to  include  a 
set  of  edges  E.  The  rest  of  the  edges  in  the  triangulation 
have  the  property  that  the  circumcircle  of  the  vertices 
of  any  triangle  contains  no  point  visible  from  all  three 
vertices. 

This  alternate  approach  computes  the  constrained 
Deluanay  triangulation  of  the  points  with  the  current 
set  of  GT  edges  as  the  set  E.  The  next  edge  to  be 
added  to  the  GT  can  be  found  in  linear  time  from  the 
constrained  Delaunay  triangulation.  The  triangulation 
must  then  be  updated  to  include  the  new  edge  in  E, 
which  takes  0(n  log  n)  time  in  the  worst  case.  This 
gives  an  0(n^  log  n)  time  and  0(n)  space  algorithm, 


thus  improving  the  space  complexity  of  Gilbert’s  algo¬ 
rithm  without  affecting  the  worst  case  time.  Lingas  [24] 
shows  that  his  method  runs  in  O(nlog^'^n)  for  points 
chosen  uniformly  from  the  unit  square. 

Recently  Levcopoulos  and  Lingas,  and  independently 
Wang,  have  shown  how  to  do  the  update  step  in  0(n) 
time,  using  a  modification  of  the  linear-time  algorithm 
for  computing  the  Voronoi  diagram  of  a  convex  polygon 
[1],  leading  to  an  O(n^)  time  and  0(n)  space  algorithm 
in  the  worst  case  [19,  31].  More  recently  Levcopoulos 
and  Lingas  give  a  modification  of  this  algorithm  that 
is  expected  to  take  0(n)  time  for  points  uniformly  dis¬ 
tributed  in  a  square  [20].  These  methods  are  elegant, 
but  are  significantly  more  complicated  than  our  meth¬ 
ods  and  should  be  slower  for  practical-sized  problems. 

One  use  of  the  greedy  triangulation  is  as  an  approx¬ 
imation  to  the  minimum  weight  triangulation  (MWT). 
Given  a  set  S  of  n  points  in  the  plane,  a  Minimum 
Weight  Triangulation  (MWT)  of  5  is  a  triangulation 
that  minimizes  the  total  length  of  all  edges  in  the  tri¬ 
angulation.  The  MWT  arises  in  numerical  analysis 
[22,  25,  29].  In  a  method  suggested  by  Yoeli  [34]  for  nu¬ 
merical  approximation  of  bivariate  data,  the  MWT  pro¬ 
vides  a  good  approximation  of  the  sought-after  function 
surface.  Wang  and  Aggarwal  use  a  minimum-weight 
triangulation  in  their  algorithm  to  reconstruct  surfaces 
from  contours  [32].  Though  it  has  been  shown  how  to 
compute  the  MWT  in  time  O(n^)  for  the  special  case 
of  n- vertex  polygons  [14],  there  are  no  known  efficiently 
computable  algorithms  for  the  MWT  in  the  general  case 
[29].  We  therefore  seek  efficiently  computable  approxi¬ 
mations  to  the  MWT. 

Although  neither  the  GT  nor  the  Delaunay  triangu¬ 
lation  (DT)  yields  the  MWT  [26,  25],  the  GT  appears 
to  be  the  better  of  the  two  at  approximating  it.  In  fact, 
for  convex  polygons  the  GT  approximates  the  MWT  to 
with  a  constant  factor  while  the  DT  can  be  a  factor  of 
f2(n)  larger  [18].  For  general  point  sets,  the  DT  can  be  a 
factor  of  fl(n)  larger  than  the  MWT,  but  the  best  lower 
bound  for  the  GT  is  Q(^/n)  [13,  17].  For  points  lying 
on  a  convex  polygon  or  uniformly  distributed  points  in 
a  square,  both  the  GT  and  the  DT  are  expected  to  be 
within  a  constant  factor  of  the  MWT  [20,  4].  For  these 
reasons  a  large  amount  of  effort  has  gone  into  finding  ef¬ 
ficient  methods  for  computing  the  greedy  triangulation. 

Other  heuristics  for  approximating  the  MWT  have 
also  been  developed.  Plaisted  and  Hong  have  developed 
a  complicated  polynomial-time  heuristic  that  is  guaran¬ 
teed  to  be  within  a  factor  of  O(logn)  of  the  MWT  [28]. 
The  best  heuristics  known  so  far  are  those  of  Lingas 
[23]  and  Heath  and  Pemmaraju  [12].  These  approaches 
make  use  of  the  convex  hull  and  a  spanning  tree  to  create 
a  single  cell,  and  then  use  an  optimal  cell  triangulation 
algorithm  generalized  from  Gilbert’s  dynamic  program¬ 
ming  approach  for  computing  the  MWT  of  points  on 
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a  convex  polygon  [10].  However,  though  both  methods 
appear  empirically  to  produce  triangulations  much  bet¬ 
ter  than  the  GT,  both  of  these  algorithms  require  O(n^) 
time  which  is  impractical  for  large  point  sets.  In  [12], 
the  authors  only  report  on  data  for  sets  of  size  50  and 
smaller.  We  also  note  that  with  the  method  of  Heath 
and  Pemmaraju,  the  greedy  triangulation  still  remains 
of  interest  as  it  is  a  substep  in  their  algorithm. 

1.3  Notation 

Throughout  the  paper,  we  let  d(p,  q)  be  the  distance 
from  point  ptoq  using  the  standard  Euclidean  distance 
metric. 

2  A  New  Edge  Test  for  the  GT 

We  now  present  our  new  method  for  testing  the  com¬ 
patibility  of  edges  in  a  greedy  triangulation.  We  will 
first  present  some  definitions,  and  then  a  new  theorem 
stating  a  property  of  any  pair  of  points  that  does  not 
form  an  edge  in  the  GT.  Following  the  theorem,  we  will 
present  the  edge  test  method  with  a  proof  of  its  correct¬ 
ness  and  a  complexity  analysis  of  the  run  time  required 
by  each  of  the  operations. 

Definition  1  In  a  straight  line  planar  graph  T,  a  clock¬ 
wise  chain  from  pi  (hereafter  written  “CW  chain  from 
Pi  ”)  is  a  sequence  of  points  pi, . . .  such  that  for  1  < 
i  <  k,piPi+i  are  edges  in  T,  and  for  1  <  i  <  k,piPi+i 
is  the  next  edge  around  point  pi  in  a  clockwise  direction 
frompi-ipi. 

Definition  2  Let  pi,.--,Pk  he  a  CW  chain  in  straight 
line  planar  graph  T.  If  {pi,p2)  is  the  first  edge  in  a 
clockwise  direction  from  a  segment  piq  (with  piq  not 
necessarily  an  edge  in  T ),  then  we  say  that  Pi,  ■  ■  ■  ,Pk  Is 
a  CW  chain  with  respect  to  piq. 

We  define  a  counter-clockwise  chain,  or  CCW  chain,  in 
a  similar  fashion. 

The  new  compatibility  test  method  is  based  on  the 
following  lemma  and  theorem. 

Lemma  1  Given  a  set  S  of  n  points,  let  x,  y,  and  z  be 
oriented  so  that  they  form  a  CW  triangle.  Let  T  be  the 
partial  triangulation  in  the  standard  greedy  algorithm  at 
the  time  when  xz  is  tested  for  compatibility  or  at  some 
later  time.  Let  x,  y,  z  be  a  CCW  chain  in  T.  Then 
triangle  xyz  contains  no  points  in  S  in  its  interior  and 
xz  is  either  compatible  with  T  or  already  in  T.  (The 
lemma  is  also  true  if  CW  and  CCW  are  interchanged.) 

Theorem  1  (CW/CCW  Chain  Theorem) 

Given  a  set  S  of  n  points  and  a  non-negative  integer 


k  <  (2) ,  let  T  be  a  partial  greedy  triangulation  of  S 
constructed  using  the  standard  greedy  approach  of  exam¬ 
ining  the  first  k  interpoint  pairs  in  nondecreasing  order 
of  distance  and  adding  to  T  exactly  those  edges  compat¬ 
ible  with  previously  added  edges.  Now  let  (p,  q)  be  the 
{k  -f  1)’*  pair  to  be  tested  with  6  =  d(p,  q).  If  edge  pq  is 
not  compatible  with  T,  then  T  contains  a  CW  or  CCW 
chain  p,vi, ...  ,Vk,x  (or,  respectively,  q,vi, . . .  ,Vk,x) 
with  respect  to  pq  that  has  the  following  five  properties: 

1)  edge  VkX  intersects  segment  pq, 

2)  points  vi,...,Vk  lie  within  the  8  x  26  rectangle 
R  that  is  divided  into  two  squares  by  segment  pq  and 
d{p,  u,)  <  8  (respectively  d{q,  Vi)  <  8)  for  1  <  i  <k, 

3)  ifd{q,vi)  >  8  (respectively  d{p,vi)  >  8)  then  k  = 

1, 

4 )  the  vertices  p,v\, . . .  ,Vk  (all  edges  of  the  chain  be¬ 
fore  VkX )  form  a  convex  chain,  and 

5)  VkX  is  the  closest  edge  to  p  (respectively  q)  of  all 
edges  in  T  intersecting  pq. 

2.1  Method  and  Proof  of  Correctness 

We  now  give  a  fast  greedy  triangulation  edge  test  based 
on  Theorem  1.  To  determine  whether  edge  pq  is  compat¬ 
ible,  the  algorithm  examines  the  CW  and  CCW  chains 
from  p  and  q  with  respect  to  pq.  The  method  actually 
requires  that  we  examine  at  most  only  two  edges  on  each 
of  these  chains.  It  returns  TRUE  if  the  edge  is  compat¬ 
ible  and  FALSE  otherwise.  For  reasons  of  brevity,  we 
give  only  the  first  part  of  this  test:  checking  the  CCW 
from  p.  Steps  2,  3,  and  4  are  symmetric  except  that  in 
Step  4  instead  of  a  goto  Step  5  the  algorithm  would  re¬ 
turn  TRUE.  In  part  le.,  Cq  is  the  circle  of  radius  d{p,  q) 
centered  at  q. 

2.1.1  Edge  Test  for  (p,  q)  in  T 

Step  1:  CCW  Chain  from  p 

la.  Let  ,  V2  be  the  next  two  points  on 

the  CCW  chain  from  p  (with  respect  to  pq). 

lb.  IF  {Lqpvi  >  7r/2)  OR  (no  CCW  chain  exists) 
THEN  goto  Step  2. 

lc.  IF  V1V2  intersects  pq  THEN  return  FALSE. 

ld.  IF  V2  =  g  THEN  return  TRUE 

le.  IF  vi  is  strictly  inside  Cq  AND  V2  ^  q 

THEN  return  FALSE. 

ELSE  goto  Step  2. 


Proof  of  Correctness  We  now  show  that  the  above 
algorithm  correctly  determines  whether  edge  pq  is  com¬ 
patible  with  the  edges  already  added  to  T  (assuming 
the  edges  are  added  in  greedy  fashion.)  We  give  a  proof 
for  Step  1;  the  other  three  steps  are  exactly  analagous. 

The  first  two  cases  are  obvious.  Eirst  assume  that 
vi  is  outside  R  (or  equivalently  that  no  CCW  chain 
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exists).  It  follows  directly  from  Theorem  1  that  if  pq 
is  not  compatible  we  will  find  a  CW  or  CCW  chain  on 
one  of  the  subsequent  steps  and  we  can  immediately 
go  to  Step  2.  Now  assume  that  V1V2  intersects  pq.  By 
definition,  pq  is  not  compatible  with  T  and  we  can  halt. 

Now  assume  that  V2  =  q.  That  is,  the  second  edge  on 
the  CCW  chain  is  viq.  In  this  case  Lemma  1  says  that 
pq  is  compatible,  so  we  can  halt  and  answer  TRUE. 

If  ui  is  strictly  inside  Cg  and  V2  ^  q,  then  we  claim 
that  pq  can  not  be  compatible  with  T.  Though  we  have 
not  yet  found  an  existing  edge  VkX  intersecting  pq,  we 
know  that  one  exists,  because  cissuming  otherwise  would 
lead  to  a  contradiction.  Suppose  that  pq  were  compati¬ 
ble  and  were  added  to  the  triangulation.  Then  q,  p, 
would  be  a  CCW  chain,  so  by  Lemma  1  triangle  qpv\ 
would  contain  no  points  in  its  interior  and  viq  would 
be  compatible.  Because  it  is  shorter  than  pq  it  would 
already  be  in  the  triangulation.  But  then  v\q  would  be 
the  next  edge  CCW  around  ul  from  pvi,  contradicting 
the  assumption  that  V2  ^  q.  Therefore  pq  cannot  be 
compatible. 

There  is  one  remaining  case,  which  is  when  vi  is  inside 
rectangle  R  but  not  strictly  inside  circle  C,  V2  #  q,  and 
V1V2  does  not  intersect  pq.  That  this  CCW  chain  will 
not  lead  to  any  edge  intersecting  pq  follows  directly  from 
Condition  (3)  of  Theorem  1. 

We  have  now  completed  the  proof  of  the  correctness 
of  our  greedy  edge  test  method.  □ 

2.2  Implementation  and  Analysis 

We  now  discuss  the  implementation  of  our  edge  test  and 
analyze  its  efficiency.  We  store  our  greedy  triangulation 
T  as  a  graph  using  adjacency  lists.  Each  adjacency  list 
is  represented  as  a  circular  linked  list  of  edges  ordered 
in  polar  (or  rotational)  order  around  the  point.  To  find 
CW  and  CCW  chains  from  a  new  edge  we  must  deter¬ 
mine  where  in  rotational  order  the  new  edge  would  fit. 
That  would  normally  take  0(n)  time,  or  O(logn)  if  we 
stored  the  circular  list  in  a  binary  tree.  Fortunately, 
we  can  use  properties  of  the  greedy  triangulation  to  do 
better. 

Let  p  be  a  vertex  in  T.  We  consider  the  neighbors  of 
p  (in  T)  in  CCW  order.  Let  x  and  y  be  two  consecutive 
neighbors,  and  let  a  be  the  CCW  angle  swept  from  px 
to  py.  We  call  the  ordered  pair  {x,y)  closed  if  a  <  tt  and 
xy  is  in  T,  and  we  call  (x,  y)  open  otherwise.  (Note  that 
if  (a;,  y)  is  closed,  by  Lemma  1  the  triangle  pxy  must  be 
empty,  so  no  new  greedy  edges  can  connect  to  p  between 
px  and  py.)  We  call  an  edge  an  open  edge  if  it  connects 
p  to  a  point  in  an  open  ordered  pair.  A  closed  interval  is 
a  wedge  around  p  that  is  bounded  by  two  adjacent  open 
edges  whose  endpoints  in  CCW  order  do  not  form  an 
open  pair.  (If  the  edges  around  p  are  viewed  as  spokes 
of  a  wheel,  then  the  open  pairs  will  correspond  to  pairs 
of  spokes  with  no  “rim”  between  them  and  the  closed 


intervals  will  be  maximal  sections  of  the  wheel  with  the 
entire  “rim”  present.)  A  closed  point  is  a  point  incident 
to  at  least  one  edge  but  to  no  open  edges. 

To  the  each  edge  structure  in  the  data  structure  for 
storing  the  edges  as  a  circular  linked  list  we  add  pointers 
to  the  next  CW  and  CCW  open  edge  and  a  flag  to  show 
whether  the  wedge  lying  between  that  edge  and  the  next 
open  edge  CCW  from  it  is  an  open  ordered  pair  or  a 
closed  interval.  These  new  fields  will  only  be  maintained 
for  open  edges,  and  will  provide  a  doubly-linked  circular 
list  of  open  edges  around  each  point. 

Maintaining  this  structure  when  a  new  edge  pq  is 
added  to  T  is  fairly  straightforward.  We  will  discuss 
updates  at  p;  q  is  symmetric.  If  pq  is  the  first  edge  in¬ 
cident  to  p,  the  new  edge  will  point  to  itself  as  an  open 
ordered  pair  spanning  27r.  If  pq  is  added  to  the  middle 
of  an  open  pair  {x,  y)  (note  that  new  edges  can  only  be 
added  between  edges  of  an  open  ordered  pair),  we  must 
do  two  types  of  updates.  First,  (x,  y)  must  be  split  into 
two  pairs  (z;,  q)  and  (g,  y),  which  may  be  open  or  closed. 
To  find  out  which,  we  must  see  if  edges  xq  and  yq  are  in 
T,  which  can  be  done  in  constant  time.  Second,  we  must 
see  if  the  new  edge  closes  off  a  previously  open  pair.  To 
do  that,  we  see  if  the  CCW  edge  from  pq  has  the  same 
endpoint  w  as  the  CW  edge  from  qp  (and  the  symmetric 
case  on  the  other  side).  In  either  case,  we  must  be  able 
to  update  the  data  structure  so  that  a  previously  open 
interval  {x,  y)  is  now  closed.  To  do  this  the  new  closed 
interval  {x,y)  must  be  merged  with  possible  closed  in¬ 
tervals  lying  to  either  side  of  it  to  form  a  single  closed 
interval.  All  of  these  tests  and  updates  can  be  done  in 
constant  time. 

With  this  data  structure,  the  edge  test  is  done  as 
follows.  The  new  edge  pq  is  located  in  the  circular  list  of 
open  edges  at  p  and  q.  If  pg  falls  in  a  closed  interval  for 
either  point,  then  the  edge  is  not  compatible.  Otherwise 
perform  the  test  described  in  the  previous  subsection, 
using  the  open  edges  as  the  first  edges  in  the  CW  and 
CCW  chains.  This  test  will  take  time  proportional  to 
the  number  of  open  edges. 

Fortunately,  the  maximum  number  of  open  edges  ad¬ 
jacent  to  any  point  p  is  10.  This  is  because  any  pair 
{x,y)  with  a  <  7r/3  must  be  closed.  Because  a  is  not 
the  largest  angle  in  triangle  pxy,  we  know  that  xy  is 
not  the  longest  edge  in  the  triangle.  By  Lemma  1,  xy 
is  in  T,  which  means  that  {x,  y)  is  closed.  Therefore  ev¬ 
ery  open  pair  has  a  >  tt/S,  so  so  there  cannot  be  more 
than  six  of  them  around  p.  If  there  are  six,  each  edge 
is  shared  by  two  open  intervals,  so  there  are  only  six 
edges.  If  there  are  five  or  fewer  open  pairs,  then  there 
cannot  be  more  than  10  open  edges. 

To  initialize  the  data  structure  we  simply  create  n 
empty  circular  lists  in  0{n)  time.  Because  there  are 
0(n)  edges  in  the  entire  triangulation  the  total  space 
required  by  all  lists  is  0{n). 
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Thus  we  have  a  data  structure  that  requires  0(1)  time 
for  an  edge  test  or  an  update,  0(n)  time  to  initialize, 
and  0{n)  space.  For  comparison,  we  end  this  subsec¬ 
tion  by  noting  that  the  method  of  Gilbert  [10]  requires 
O(n^logn)  time  for  initialization,  O(nlogn)  time  for 
update,  and  O(n^)  space. 

3  A  Necessary  Condition  for  an 
Edge  to  be  in  the  Greedy  Tri¬ 
angulation 

We  begin  by  stating  a  simple  and  obvious  lemma. 

Lemma  2  (Convex  Quadrilateral  Rule)  Let 
pqr  and  pqs  be  two  empty  triangles  in  a  greedy  trian¬ 
gulation.  If  segment  rs  intersects  segment  pq  (that  is, 
prqs  is  a  convex  quadrilateral),  then  d{p,q)  <  d(r,s). 

Heath  and  Pemmaraju  [12]  describe  this  as  the  prop¬ 
erty  of  “local  optimality.” 

We  now  give  an  important  (though  less  obvious) 
lemma^  that  states  a  necessary  (but  not  sufficient)  con¬ 
dition  for  an  edge  to  be  a  greedy  triangulation  edge. 
This  is  a  variant  of  Lemma  3.1  in  [21]. 

Lemma  3  Letp,q  be  a  pair  of  points  in  a  set  S.  Con¬ 
sider  the  disc  D  of  radius  6  =  d{p,q)/{2\/5)  centered 
at  the  midpoint  of  pq.  Let  pq  divide  the  disc  into  two 
half-disks.  If  both  half-disks  contain  at  least  one  point 
in  S  then  pq  cannot  be  in  the  GT  of  S. 

Proof  Sketch  For  notational  convenience,  orient  the 
plane  so  that  segment  pq  is  horizontal,  with  p  on  the  left 
side.  (For  the  remainder  of  the  proof,  refer  to  Figure  1.) 
Let  a  be  the  point  closest  to  pq  in  the  upper  half-disk  of 
D,  and  let  b  be  the  point  closest  to  pq  in  the  lower  half- 
disk  of  D.  Let  C  be  the  circle  with  pq  as  a,  diameter, 
and  let  a  be  the  length  of  the  shortest  segment  with 
endpoints  on  or  outside  of  C  that  passes  through  D. 
Then  a  =  \/d{p,  q^  —  46^. 

We  will  give  a  proof  by  contradiction  making  use  of 
the  following  observation. 

Observation  1  Let  pi  and  p2  be  any  pair  of  points 
that  are  not  connected  by  an  edge  in  the  GT.  Then  some 
GT  edge  intersecting  segment  pip2  must  have  length 
<  d{pi,p2). 

Assume  pq  is  a  GT  edge.  Then  ab  cannot  be  a  GT 
edge,  and  by  Observation  1  there  must  be  some  GT  edge 
of  length  less  than  d{a,  b)  <  26  “cutting  off”  ab.  We  will 
show  that  no  such  edge  exists  either  above  or  below  pq. 

*an  earlier  false  version  of  this  lemma  with  S  =  d(p,  q)/2  was 
given  in  [6].  We  give  the  corrected  version  here. 


Figure  1:  Edges  intersecting  ab 

Since  there  are  points  on  both  sides  of  edge  pq,  it 
is  not  a  CH  edge,  and  therefore  it  must  be  an  edge 
in  two  triangles  -  an  upper  and  a  lower.  Let  pwiq  be 
the  upper  triangle  and  let  pziq  be  the  lower  triangle. 
If  wi  and  zi  are  both  in  C,  then  d{wi,zi)  <  d{p,q) 
and  thus  by  Lemma  2  pq  could  not  be  a  GT  edge  and 
we  have  a  contradiction.  Without  loss  of  generality,  we 
therefore  assume  that  wi  is  outside  of  C  and  is  closer  to 
q  than  to  p.  Since  edge  pwi  cuts  off  ab,  it  follows  that 
d{p,  wi)  >  a. 

For  notational  convenience,  we  also  label  all  the  GT 
edges  intersecting  segment  ab.  The  GT  edges  above  pq 
that  intersect  ab  we  will  label  ei, . . . ,  Cm  and  the  edges 
below  ab  that  intersect  ab  we  label  /i , . . . ,  /n  •  We  now 
give  a  case  analysis  on  these  edges.  First,  all  of  these 
edges  cannot  have  both  endpoints  outside  of  or  on  C, 
because  they  comprise  all  of  the  edges  crossing  ab,  and 
all  of  them  would  be  longer  than  o:.  But  our  choice  of  6 
causes  a  to  be  greater  than  26,  and  by  Observation  1, 
one  intersecting  GT  edge  must  be  shorter  than  d{a,b). 

The  final  two  observations  lead  to  our  contradiction. 

Observation  2  At  least  one  e,-  and  at  least  one  fj 
must  have  an  endpoint  in  C. 

Observation  3  If  ej,  is  the  first  e,-  to  have  an  endpoint 
in  C,  then  ei, . . .,  ek-i  are  all  of  the  form  pwi  for  1  < 
i  <  k  —  1,  and  e*  is  of  the  form  xwk-i.  Furthermore, 
X  lies  to  the  left  of  D.  Finally,  for  some  1  <  /  <  fc  —  1 
there  is  an  point  wi  that  lies  within  d{p,  q)  of  p.  (See 
Figure  1  again.) 


215 


The  Contradiction  This  structure  is  very  constrain¬ 
ing.  We  have  constrained  x  to  lie  to  the  left  of  D  and 
inside  of  C.  Furthermore,  if  x  were  closer  to  q  than  a, 
then  by  Observation  1  one  of  the  e,-  with  1  <  i  <  ^  —  1 
would  have  to  be  shorter  than  a,  which  cannot  be  the 
ca.se.  (None  of  them  have  endpoints  in  C.)  Therefore  x 
is  at  least  a  from  q.  Finally,  x  must  lie  below  the  ray 
from  q  that  is  tangent  to  D  from  above. 

We  have  also  constrained  point  wj  to  lie  outside  of  C 
near  q,  but  inside  the  circle  of  radius  d(p,  q)  centered  at 
p.  It  must  also  lie  below  the  ray  from  from  p  tangent  to 
D  from  above. 

Observation  2  says  that  some  edge  fj  will  be  the  first 
edge  below  pq  along  ab  with  an  endpoint  y  inside  of 
C.  But  where  can  y  lie?  It  must  lie  above  either  the 
ray  from  p  tangent  to  D  from  below  or  the  ray  from  q 
tangent  to  D  from  below.  If  it  is  in  the  left  half  of  C,  its 
distance  to  x  will  be  less  than  a.  If  it  is  in  the  right  half 
of  C,  its  distance  to  wi  will  be  less  than  a.  In  either  case 
we  have  a  contradiction,  because  all  intervening  edges 
will  be  at  least  a  long.  (All  of  them  have  both  endpoints 
on  or  outside  of  C.)  □ 

3.1  Comments  on  the  Constant 

The  proof  above  requires  a  to  be  larger  than  a  number  of 
different  values.  The  strongest  requirement  arises  in  the 
last  case,  when  we  let  y  lie  at  the  intersection  of  the  ray 
from  p  tangent  to  D  from  below  and  C  and  we  let  wi  lie 
at  the  intersection  of  the  ray  from  p  tangent  to  D  from 
above  and  the  circle  of  radius  d{p,  q)  centered  at  p.  The 
actual  bound  of  d{p,  q)/{‘ly/%)  is  derived  from  a  slightly 
worse  configuration,  which  cannot  be  actually  achieved. 
We  require  that  the  intersection  points  between  the  two 
rays  from  p  tangent  to  D  and  the  circle  of  radius  d{p,  q) 
centered  at  p  be  at  most  a  apart.  This  case  is  less  than 

5  percent  worse  than  the  actual  worst  case,  and  is  less 
complicated  to  calculate  and  express. 

We  do  not  believe  that  our  bound  is  tight.  The 
worst  example  that  we  have  been  able  to  find  is  a  di¬ 
amond  with  pq  as  its  diameter,  with  two  additional 
points  just  outside  of  the  midpoints  of  two  opposite 
edges  of  the  diamond.  This  case  approaches  the  bound 

6  =  d{p ,  q) / {2y/2)  as  closely  as  is  desired. 

4  An  Analysis  of  the  Edges  in  the 
Greedy  Triangulation 

For  two  points  p  and  q  in  the  plane,  and  for  a  real  num¬ 
ber  r  >  0,  let  D{j),  q,  r)  denote  the  closed  disk  of  radins 
r  centered  at  (p+q)/2.  The  line  through  p  and  q  defines 
two  closed  semidisks  of  D{p,  q,r),  denoted  by  D'(p,  q,  r) 
and  D"{p,  q,  r)  (without  specifying  which  half  is  D'  and 
D" ,  respectively). 


Let  us  employ  a  constant  0  <  7  <  1,  fixed  for  the 
whole  section.  Given  a  set  5  of  n  points  in  the  plane,  we 
call  a  pair  {p,  g}  of  points  in  S  plausible,  if  D'(p,  q,  ri)  fl 
5  =  0  or  D''(p,  q,  n)  n  5  =  0,  with  ri  =  7|p  —  g|/2.  The 
previous  section  showed  that  if  7  =  l/VS,  only  plausible 
edges  can  be  in  a  greedy  triangulation.  When  7  =  1 
only  plausible  edges  can  be  Delaunay,  so  the  lemmas  we 
prove  about  distributions  of  edge  lengths  apply  to  both 
greedy  and  Delaunay  triangulations. 

Our  first  goal  is  to  estimate  the  expected  number  of 
plausible  pairs  in  a  set  5  of  n  points,  uniformily  dis¬ 
tributed  in  a  convex  compact  region  C.  We  normalize  C 
to  be  of  area  1  to  simplify  the  notation. 

Suppose  p  and  q  are  points  in  C,  so  that  D{p,q,ri) 
is  contained  in  C  (ri  as  above).  If  we  choose  another 
n  —  2  points  at  random  from  C,  then  the  probability  of 
{p,  g}  being  plausible  is  the  same  as  the  probability  that 
D'{p,  q,  ri)  is  empty  or  D"{p,  g,  ri)  is  empty.  This  gives 
us  the  following: 

Prob({p,  q}is  plausible) 

<  Prob{D'  is  empty)  +  Prob{D''  is  empty) 


We  have  to  cope  with  pairs  of  points  {p,  g}  where 
D{p,q,r\)  extends  beyond  the  boundaries  of  C:  For 
p  £  C,  let  fd{p)  denote  the  distance  of  p  from  the  bound¬ 
ary  of  C,  or,  in  other  words,  the  largest  radius  of  a  disk 
centered  at  p  which  is  still  contained  in  C.  Then  we 
observe  that  for  any  pair  if  points  p  and  g  in  C,  the 
disk  Z)(p,  g,  r2),  r2  =  (/?(p)  +  (3{q))/2,  is  contained  in  C. 
It  follows  that  for  any  pair  {p,  g)  C  C  the  probability 
of  being  plausible  (after  adding  (n-2)  random  points)  is 
bounded  by 

2g-''^’r(n-2)/2 

for  r  =  :=  min{ri,  r2},  with  ri  and  r2  dependent  on 

p  and  g  as  previously  specified.  Now  let  p  and  g  be  two 
random  points  in  a  random  set  of  n  points  in  C.  Then 
the  probability  of  being  plausible  is  bounded  by 

/  [  dqdp 

Jp€C  JqeC 

<  [  [ 

J p^c  J (i^C 

+  [  [  2e-''='’^("-2)/2dgdp  .  (2) 

dpec  Jgec 

In  order  to  estimate  the  terms  in  (2)  we  use  density 
functions  fp{x),  p  £  C,  and  g{x).  fp{x)dx  is  the 

probability  for  a  random  point  in  C  to  have  distance  at 
most  2  from  p;  clearly,  fp{x)  <  2xn  for  all  p  G  C  and 
*  >  0.  f_^g{x)dx  is  the  probability  of  a  random  point 
in  C  to  have  distance  at  most  2  from  the  boundary  of  C; 
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here  we  have  a  bound  of  g{x)  <  t/,  [/  is  the  perimeter 
of  C,  for  all  a:  >  0. 

Now  the  left  term  in  (2)  equals 

[  [  2/p(a;)e-"V<n-2)/8^^^p 

Jpec  Jx=o 

f-OO 

<  /  dx 

Jx=0 

16 

7^(n  —  2) 

The  right  term  in  (2)  equals 

roo  ^oo 

/  9ix)  /  9{y)2e-(^+yy^(^-^y^dydx 

Jx=0  •/y=0 

yoo  yOO 

<  2C/2  /  /  dydx 

J xzzQ 

8U^ 

7r(n  —  2) 

Lemma  4  Let  X  be  the  random  variable  for  ike  number 
of  plausible  pairs  in  a  random  set  S  of  n  >4  points 
uniformly  distributed  in  a  convex  region  of  area  1  and 
perimeter  U.  Then 

E(X)  <  4(n  +  2)(—  +  4-)  =  0(U^n) 

TT  ' 

CLTld 

E{X\ogX)  <  2E{X)\ogn  =  0(f/^nlogn)  . 


Let  us  call  a  pair  {p,  q]  C  C  a  candidate  (for  being  plau¬ 
sible),  if  r  =  Tp,,  <  B,  where  B  :=  4(c  In  n)/(n  -  2) 
for  a  constant  c.  Our  next  goal  is  now  to  show,  that 
a  random  set  does  not  contain  too  many  candidates, 
and-with  high  probability-all  plausible  pairs  are  indeed 
candidates.  We  use  a  similar  method  of  splitting  the 
problem  into  inequalities,  and  arrive  at  the  following 
lemma: 

Lemma  5  Let  Y  be  the  random  variable  for  the  num¬ 
ber  of  candidates  in  a  random  set  S  of  n  >  4  points 
uniformly  distributed  in  a  convex  region  of  area  1  and 
perimeter  U .  Then 

j  c(n  -f  2)  In  n  =  0{U'^cn  log  n)  . 

The  probability  that  there  exists  a  plausible  pair  that  is 
not  a  candidate  is  bounded  by 

„2-c7r/2 


5  Greedy  Triangulation  Algo¬ 
rithms 

These  results  lead  to  the  following  greedy  triangulation 
algorithm: 

5.1  Algorithm  1 

Step  1:  Generate  all  plausible  pairs  with  ri  < 

B.  To  do  this,  we  generate  all  pairs  of  points  separated 
by  a  distance  of  at  most  2B/7.  This  is  the  fixed-radius- 
near-neighbors  problem  [2,  7].  In  this  case  a  bucketing 
algorithm  by  Bentley,  Stanat,  and  Williams  can  solve 
the  problem  in  time  0(n  -j-  m),  where  m  is  the  number 
of  pairs  that  lie  within  2B/7  of  one  another.  As  each 
pair  is  generated,  test  to  see  if  it  is  plausible  using  the 
method  described  below. 

Step  2:  Generate  all  plausible  pairs  with  < 

B.  The  easy  way  to  do  this  is  to  find  all  points  within 
2B  of  the  boundary  of  C  and  to  generate  all  pairs.  How¬ 
ever,  this  can  generate  pairs  with  r2  as  long  as  2B.  By 
sorting  the  points  within  2B  of  the  boundary  of  C  in 
order  of  their  distance  from  the  boundary  one  can  gen¬ 
erate  only  the  pairs  needed  by  matching  each  point  with 
points  further  from  the  boundary  than  itself  only  until 
r2  >  B,  then  going  on  to  the  next  point.  Also,  note 
that  pairs  that  also  have  rj  <  H  can  be  ignored,  be¬ 
cause  they  were  generated  in  Step  1.  Test  each  pair  to 
see  if  it  is  plausible. 

Step  3:  Generate  the  greedy  triangulation  of 
the  plausible  pairs  generated  in  Steps  1  and  2. 
To  do  this,  first  sort  the  pairs  in  increasing  order  of 
distance  between  the  points.  Start  with  an  empty  tri- 
angulation,  and  attempt  to  create  an  edge  between  each 
pair  in  turn,  failing  to  create  the  edge  if  it  fails  the  com¬ 
patibility  test  described  above. 

5.2  Testing  to  See  if  a  Pair  is  Plausible 

We  need  a  fast  test  to  see  if  an  edge  is  plausible.  One 
way  to  do  this  uses  a  grid  of  squares.  As  a  preprocessing 
step,  cover  C  by  a  grid  of  0[n)  squares,  each  with  side 
l/y/n.  For  each  bucket,  create  a  list  of  the  points  in 
S  that  fall  in  that  bucket.  Then  to  test  a  pair  (p,  5), 
compute  D{p,q,r),  with  r  =  min(ri,r2)  as  described 
above.  Go  through  the  squares  that  overlap  D'{p,  q,  r) 
until  you  find  a  point  lying  in  D'{p,q,r)  or  find’tLt 
no  such  point  exists.  Similarly  go  through  the  squares 
that  overlap  D"(j>,q,r')  until  you  find  a  point  lying  in 
this  half-disk  or  find  that  no  such  point  exists.  If  either 
half-disk  is  empty,  the  point  is  plausible.  In  searching 
through  the  grid  squares,  start  with  the  squares  with 
maximum  overlap  with  the  half-disk. 
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For  uniformly  distributed  points  this  test  will  run  in 
0(1)  expected  time.  The  probability  that  a  grid  square 
that  lies  completely  in  C  contains  no  point  from  S  is 
(1  -  1/n)"-  This  is  always  less  than  Therefore  the 
probability  that  a  given  grid  square  contains  a  point 
is  greater  than  1  -  >  0.63.  This  implies  that  a 

search  through  grid  squares  looking  for  a  non-empty 
one  is  expected  to  look  at  fewer  than  2  squares.  When 
r  is  small  enough  that  no  full  grid  squares  overlap  a 
half-disk,  then  only  a  constant  number  of  grid  squares 
overlap  the  half-disk.  When  r  is  larger,  the  number 
of  grid  squares  examined  before  an  point  is  found  is 
expected  be  a  constant. 

Unfortunately,  a  bad  distribution  of  points  could 
cause  this  test  to  look  at  0(n)  squares  for  a  very  long 
edge.  To  prevent  this,  we  stop  testing  after  looking  at 
clnn  squares  for  some  c  that  we  choose.  Thus  the  test 
could  occasionally  decide  that  a  half-disk  is  empty  when 
it  in  fact  is  not.  Thus  a  candidate  could  occasionally  be 
considered  plausible  when  it  is  not.  But  this  will  happen 
with  probability  less  than  (1  -  ^  ^  . 

This  is  about  We  can  therefore  choose  c  so  that 

the  expected  number  of  edges  on  which  the  test  fails  is 
o(l),  and  this  limitation  will  avoid  bad  worst-case  times 
without  hurting  expected  times. 


5.3  Analysis  of  Algorithm  1 

Given  this  test  for  plausible  pairs,  it  is  easy  to  show  that 
Algorithm  1  runs  in  0(n  log  n)  expected  time.  By  using 
the  floor  function  we  can  do  the  preprocessing  for  the 
edge  test  in  0(n)  time.  By  Lemma  5  the  number  of  pairs 
considered  in  Steps  1  and  2  is  0(ti  log  n).  Each  test  for 
plausibility  takes  constant  expected  time,  so  these  steps 
require  C)(nlogn)  time.  By  Lemma  4  the  number  of 
pairs  that  are  generated  in  Steps  1  and  2  that  are  plau¬ 
sible  is  0(n).  Sorting  these  takes  0(n  log n)  time,  and 
the  compatibility  test  for  each  takes  0(1)  time.  There¬ 
fore  Step  3  and  the  entire  algorithm  require  0(n  log  n) 
time  and  0(n)  space.  In  the  worst  case  the  algorithm 
could  take  0(n  log^  n)  time  and  0(n  log  n)  space. 

Unfortunately,  this  algorithm  is  not  guaranteed  to 
generate  the  greedy  triangulation.  It  will  generate  it 
with  probability  ,  and  because  we  can  choose 

c  as  large  as  we  like  we  can  make  this  probability  ar¬ 
bitrarily  close  to  1.  But  the  algorithm  could  fail  to 
produce  a  triangulation,  either  because  neither  diago¬ 
nal  of  a  quadrilateral  was  a  candidate  or  because  the 
edge  compatibility  test  (which  depends  on  the  correct¬ 
ness  of  the  partial  triangulation)  could  fail  if  edges  were 
missing.  Alternately,  it  could  produce  a  triangulation 
that  was  not  the  greedy  one,  and  there  is  no  known  fast 
way  to  verify  the  correctness  of  a  greedy  triangulation. 

Fortunately,  a  minor  modification  of  the  algorithm 
will  eliminate  this  problem. 


5.4  Algorithm  2 


We  turn  Algorithm  1  into  a  two-phase  algorithm.  We 
run  Steps  1  and  3  of  Algorithm  1,  but  skip  Step  2.  What 
we  will  be  left  with  at  the  end  of  this  process  is  a  par¬ 
tial  greedy  triangulation.  All  of  the  short  edges  will  be 
present,  but  no  edge  longer  than  2B/7  will  be  present. 
But  Lemma  5  implies  that  with  very  high  probability, 
all  edges  that  have  either  endpoint  at  least  25/t  from 
the  boundary  of  C  will  be  present.  We  will  call  points 
at  least  2B/7  from  the  boundary  interior  points.  We 
call  a  greedy  triangulation  edge  with  an  interior  point 
as  (at  least)  one  of  its  endpoints  an  interior  edge.  The 
expected  number  of  interior  edges  that  are  missing  is 
bounded  by  the  expected  number  of  plausible  edges 
/-onriirlatpfi  so  IS  at  most  .  As 


long  as  c  is  chosen  to  be  at  least  d/rr  this  is  0(1)  edges. 
This  implies  that  at  most  0(1)  interior  points  will  not 
be  closed  (as  defined  in  the  edge  test  section).  This  is 
because  a  missing  edge  can  cause  at  most  four  points 
to  be  not  closed  (the  four  vertices  of  the  quadrilateral 
with  the  missing  edge  as  diagonal). 


This  observation  leads  to  a  new  Step  2' . 


Step  2':  Generate  plausible  long  pairs  Gener¬ 
ate  all  possible  pairs  of  non-closed  points,  and  reject  all 
implausible  pairs.  Sort  these  pairs,  and  continue  run¬ 
ning  Step  3  of  Algorithm  1  with  these  pairs  as  well. 

Because  all  short  pairs  are  tried  and  then  all  longer 
pairs  that  could  possibly  create  an  edge  are  tried,  the  al¬ 
gorithm  will  correctly  generate  the  greedy  triangulation 
of  any  set  of  points. 

The  edge  test  data  structure  keeps  track  of  the  list 
of  incident  open  edges  for  each  point.  A  point  which  is 
closed  will  have  incident  edges,  but  no  open  edges  in  its 
list. 

Will  this  step  generate  too  many  candidate  pairs? 
The  number  of  interior  points  that  are  not  closed  is 
0(1).  Non-interior  points  lie  within  2B/7  of  the  bound¬ 
ary  of  C.  The  total  number  of  non-interior  points  is  ex¬ 
pected  to  be  at  most  2BUn/~f,  because  2BU/-y  is  an 
upper  bound  on  the  area  close  enough  to  the  boundary 
of  C.  Therefore  the  total  number  of  candidate  edges 
generated  in  Step  2'  is  (2B'^"/7+o(i))  =  0(n log n). 

Lemma  4  says  that  the  total  expected  number  of  plau¬ 
sible  pairs  is  0(n),  so  Step  2'  is  expected  to  gener¬ 
ate  0{n)  plausible  pairs.  Therefore  sorting  and  testing 
these  for  compatability  will  take  O(nlogn)  time. 

This  shows  that  Algorithm  2  will  always  compute  the 
greedy  triangulation  and  is  expected  to  run  in  0(n  log  n) 
time,  using  0(n)  space.  In  the  worst  case  it  could  take 
0(n^  log  n)  time  and  0{n'^)  space. 
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5.5  Algorithm  3  -  An  Algorithm  that 
Depends  Less  on  the  Uniform  Dis¬ 
tribution 

Algorithm  2  depends  heavily  on  the  uniform  distribn- 
tion,  both  to  get  a  fast  expected-time  test  for  plausibil¬ 
ity  and  to  tell  when  to  end  the  first  phase  and  begin  the 
second.  The  following  algorithm  is  a  variant  that  is  less 
sensitive  to  the  exact  distribution.  It  will  dynamically 
decide  when  to  switch  from  one  phase  to  the  next  in 
an  attempt  to  balance  the  amount  of  work  done  in  each 
phase. 

In  the  first  phase  it  generates  possible  edges  in  in¬ 
creasing  order  using  an  algorithm  of  Dickerson,  Drys- 
dale,  and  Sack  [8].  (Algorithms  to  enumerate  the  k 
closest  interpoint  pairs  have  been  invented  by  by  Sa- 
lowe  and  by  Lenhof  and  Smid,  but  because  they  need  to 
know  k  in  advance  they  are  less  appropriate  in  this  con¬ 
text  [30,  16].)  When  the  number  of  pairs  of  not  closed 
points  is  proportional  to  the  number  of  pairs  already  ex¬ 
amined,  it  starts  over,  enumerating  pairs  of  not  closed 
points  in  increasing  order  (similar  to  Step  2'  above). 

How  do  we  know  when  to  switch  over?  We  keep  track 
of  the  number  of  points  which  are  not  closed.  When 
during  an  edge  insertion  a  point  becomes  closed,  we  sub¬ 
tract  1  from  the  number  of  non-closed  points  c.  When 
c^/2  is  greater  than  the  number  of  edges  tested  so  far, 
the  number  of  edges  generated  in  the  second  phase  will 
equal  the  number  in  the  first,  so  we  change  to  the  second 
phase. 

5.6  Analysis  of  Algorithm  3 

Generating  the  next  pair  in  increasing  order  requires 
O(logn)  time,  and  a  compatibility  test  takes  0(1)  time. 
Therefore  the  algorithm  runs  in  time  O(logn)  times  the 
number  of  pairs  generated.  It  requires  space  propor¬ 
tional  to  the  number  of  pairs  generated. 

For  the  uniform  distribution,  we  have  already  seen 
that  all  interior  points  are  expected  to  be  closed  af¬ 
ter  examining  0(n  log  n)  edges,  and  that  at  that  point 
the  number  of  points  which  are  not  closed  is  0(Bn}  = 
0(^/n  logn).  Therefore  the  number  of  pairs  in  the 
second  phase  will  be  O(nlogn).  This  implies  that 
the  algorithm  is  expected  to  take  0(n  log^  n)  time  and 
O(nlogn)  space  for  a  uniform  distribution. 

6  Summary  and  Open  Problems 

We  have  given  a  new  method  for  testing  edge  compati¬ 
bility  in  a  greedy  triangulation.  The  method  is  based  on 
Theorem  1,  the  CW/CCW  chain  theorem,  which  states 
an  interesting  property  of  greedy  triangulations.  Our 
method  requires  only  0(1)  time  for  both  the  compati¬ 
bility  test  and  npdates  operations.  This  is  a  significant 


improvement  over  previous  methods. 

We  then  proved  a  necessary  condition  involving  half¬ 
disks  for  an  edge  to  be  in  the  greedy  triangulation.  This 
lead  to  theorems  on  the  nnmber  of  pairs  of  points  that 
were  plansible  and  that  were  candidates  to  be  plausible 
edges. 

Finally,  we  used  these  characterizations  and  the  com¬ 
patibility  test  to  prove  the  correctness  and  runtime  of 
several  new  algorithms  for  computing  the  greedy  tri¬ 
angulation.  On  uniformly  distributed  points  we  can 
compute  the  greedy  triangulation  in  expected  time 
O(nlogn)  and  space  0(n). 

Some  obvious  questions  arise  from  this  work. 

Problem  1  OPEN  We  can  construct  a  point  set  for 
which  Algorithm  3  would  require  ©(n^  log  n)  time.  How¬ 
ever  this  set  is  highly  structured  and  non-random.  What 
is  the  expected  run  time  for  Algorithm  3  for  random  dis¬ 
tributions  other  than  the  uniform  distribution? 

Problem  2  OPEN  Is  there  a  greedy  triangulation  al¬ 
gorithm  requiring  o(n^)  time  in  the  worst  case?  Note 
that  with  our  edge  test,  a  way  to  enumerate  all  pairs 
of  points  in  increasing  order  by  distance  in  O(n^)  time 
would  lead  to  an  O(n^)  algorithm  for  the  GT,  but  would 
not  lead  to  an  o(n^)  algorithm. 

Problem  3  OPEN  What  is  the  true  worst-case  ra¬ 
tio  for  6  in  Lemma  3?  We  have  bounded  it  between 
dip,q)/(‘2V5)  and  d{p,q)l{2y/2). 
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Linear-size  Nonobtuse  Triangulation  of  Polygons 

Marshall  Bern  *  Scott  Mitchell  1  Jim  Ruppert  ^ 


Abstract 

We  give  an  algorithm  for  triangulating  n-vertex  polyg¬ 
onal  regions  (with  holes)  so  that  no  angle  in  the  final 
triangulation  measures  more  than  7r/2.  The  number  of 
triangles  in  the  triangulation  is  only  0{n),  improving 
a  previous  bound  of  O(n^),  and  the  worst-case  running 
time  is  O(nlog^n).  The  basic  technique  used  in  the 
algorithm,  recursive  subdivision  by  disks,  is  new  and 
may  have  wider  application  in  mesh  generation.  We 
also  report  on  an  implementation  of  our  algorithm. 

1.  Introduction 

The  triangulation  of  a  two-dimensional  polygonal  re¬ 
gion  is  a  fundamental  problem  arising  in  computer 
graphics,  physical  simulation,  and  geographical  infor¬ 
mation  systems.  Most  applications  demand  not  just 
any  triangulation,  but  rather  one  with  triangles  satis¬ 
fying  certain  shape  and  size  criteria  [9].  In  order  to 
satisfy  these  criteria,  one  typically  allows  triangles  to 
use  new  vertices,  called  Steiner  points,  that  are  not 
vertices  of  the  input  polygon.  The  number  of  Steiner 
points  should  not  be  excessive,  however,  as  this  would 
increase  the  running  time  of  computations. 

Throughout  the  application  areas  named  above,  it  is 
generally  true  that  large  angles  (that  is,  angles  close 
to  tt)  are  undesirable.  Babuska  and  Aziz  [2]  justi- 
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fled  this  aversion  for  one  important  application  area  by 
proving  convergence  of  the  finite  element  method  [25] 
as  triangle  sizes  diminish,  so  long  as  the  maximum  an¬ 
gle  is  bounded  away  from  n.  They  also  gave  an  example 
in  which  convergence  fails  when  angles  grow  arbitrar¬ 
ily  flat.  (An  elementary  example  in  which  large  angles 
spoil  convergence  is  Schwarz’s  paradox  [22].) 

Any  bound  smaller  than  tt  implies  convergence  in 
Babuska  and  Aziz’s  model,  but  a  bound  of  7r/2  on  the 
largest  angle  has  special  importance.  First  of  all,  any 
stricter  non-varying  requirement  would  also  bound  the 
smallest  angle  away  from  zero;  for  some  inputs  (such 
as  a  long,  skinny  rectangle)  this  forces  the  triangula- 
tion  to  contain  a  number  of  triangles  dependent  on  the 
geometry — not  just  on  the  combinatorial  complexity — 
of  the  input.  Second,  a  nonobtuse  triangulation  is 
necessarily  a  (constrained)  Delaunay  triangulation  [8]. 
Third,  a  nonobtuse  triangulation  admits  a  perpendicu¬ 
lar  planar  dual,  that  is,  an  embedding  in  which  dual 
edges  cross  at  right  angles.  Such  an  embedding  is  con¬ 
venient  for  the  “finite  volume”  method  [25].  Finally, 
a  nonobtuse  triangulation  has  better  numerical  proper¬ 
ties  [3,  27].  In  particular,  Vavasis  [27]  recently  proved 
that  for  finite  element  problems  with  physical  char¬ 
acteristics  that  vary  enormously  over  the  domain,  a 
nonobtuse  mesh  implies  faster  convergence  of  a  certain 
numerical  method. 

These  properties  have  established  nonobtuse  triangu¬ 
lation  as  a  desirable  goal  in  finite  element  mesh  gener¬ 
ation.  Several  heuristic  methods  have  been  developed 
to  compute  nonobtuse  triangulations  [4,  20].  Baker, 
Grosse,  and  Rafferty  [3]  gave  the  first  provably-correct 
algorithm.  Their  algorithm  also  bounds  the  smallest 
angle  away  from  zero,  and  hence  necessarily  uses  a  num¬ 
ber  of  triangles  dependent  upon  input  geometry.  Melis- 
saratos  and  Souvaine  [16]  gave  another  algorithm  of  this 
type. 

From  the  point  of  view  of  theoretical  computer  sci¬ 
ence,  however,  it  is  important  to  determine  the  inher¬ 
ent  complexity  of  nonobtuse  triangulation,  apart  from 
no-small-angle  triangulation.  Bern  and  Eppstein  [8]  de¬ 
vised  a  nonobtuse  triangulation  algorithm  using  O(n^) 
triangles,  where  n  is  the  number  of  vertices  of  the  input 
domain.  This  result  demonstrates  a  fundamental  com- 


Figure  1.  (a)  Disk  packing.  (b)  Induced  small  polygons. 


(c)  Final  triangulation. 


plexity  separation  between  bounding  large  angles  and 
bounding  small  angles.  Bern,  Dobkin,  and  Eppstein  [7] 
later  improved  this  bound  to  for  convex  poly¬ 

gons. 

In  this  paper,  we  improve  these  bounds  to  linear,  us¬ 
ing  an  entirely  different — and  more  widely  applicable — 
technique.  Aside  from  sharpening  the  theory,  our 
new  algorithm  boasts  other  advantages:  it  parallelizes, 
thereby  placing  nonobtuse  triangulation  in  the  class 
MC]  and  it  does  not  use  axis-parallel  grids,  so  the  out¬ 
put  has  no  preferred  directions.  Our  algorithm  also 
improves  results  of  Bern  et  al.  [7]  on  no-large- angle  tri¬ 
angulation.  The  superseded  results  include  an  algo¬ 
rithm  guaranteeing  a  maximum  angle  of  at  most  57r/6 
that  uses  0{n  log  n)  triangles  for  simple  polygons  and 
0(n^^^)  triangles  for  polygons  with  holes. 

2.  Overview  of  the  Algorithm 

Our  algorithm  consists  of  two  stages.  The  first  stage 
(Section  3)  packs  the  domain  with  non-overlapping 
disks,  tangent  to  each  other  and  to  sides  of  the  domain. 
The  disk  packing  is  such  that  each  region  not  covered 
has  at  most  four  sides  (either  straight  sides  or  arcs),  as 
shown  in  Figure  1(a).  The  algorithm  then  adds  edges 
(radii)  between  centers  of  disks  and  points  of  tangency 
on  their  boundaries,  thereby  dividing  the  domain  into 
small  polygons  as  shown  in  Figure  1(b). 

The  second  stage  (Section  4)  triangulates  the  small 
polygons  using  Steiner  points  located  only  interior  to 
the  polygons  or  on  the  domain  boundary.  Restricting 
the  location  of  Steiner  points  ensures  that  triangulated 
small  polygons  fit  together  so  that  neighboring  trian¬ 


gles  share  entire  sides.  Figure  1(c)  shows  the  resulting 
nonobtuse  triangulation. 

This  algorithm  is  circle-based,  rather  than  grid-based 
like  the  previous  polynomial-size  nonobtuse  triangula¬ 
tion  algorithm  [8].  Analogously,  the  problem  of  no- 
small-angle  triangulation  has  grid-based  [10]  and  circle- 
based  [23]  solutions.  In  retrospect,  circle-based  algo¬ 
rithms  offer  a  more  natural  way  to  bound  angles,  as 
well  as  meshes  more  intrinsic  to  the  input  domain. 
This  conclusion  is  supported  by  two  more  examples. 
In  the  second  stage  of  this  paper’s  algorithm,  certain 
misshapen  small  polygons  cause  technical  difficulties; 
these  are  neatly  solved  by  packing  in  more  disks.  (One 
of  these  additional  disks  is  the  second  from  the  left  along 
the  bottom  side  of  Figure  1(b).)  In  other  recent  work, 
Mitchell  uses  the  “angle  buffering”  property  of  circles 
to  give  a  triangulation,  restricted  to  use  only  interior 
Steiner  points,  with  linear  size  and  largest  angle  nearly 
as  small  as  possible  [18]. 

3.  Disk  Packing 

In  this  section,  we  describe  the  first  stage  of  the  algo¬ 
rithm.  Let  P  denote  the  input:  a  region  of  the  plane 
bounded  by  a  set  of  disjoint  simple  polygons  with  a  to¬ 
tal  of  n  vertices.  An  arc-gon  is  a  simple  polygon  with 
sides  that  are  arcs  of  circles.  The  circles  may  have  var¬ 
ious  radii,  including  infinity  (which  implies  a  straight 
side). 

Throughout  the  disk-packing  stage,  we  make  use  of 
the  generalized  Voronoi  diagram  (GVD),  which  is  de¬ 
fined  by  proximity  to  both  edges  and  vertices.  The  in¬ 
terior  points  of  polygonal  region  P  are  divided  into  cells 


Figure  2.  Adding  disks  at  (a)  convex  and  (b)  concave  cor¬ 
ners  of  polygonal  region  P. 


according  to  the  nearest  vertex  of  P,  or  the  nearest  edge 
(viewing  each  edge  as  an  open  segment).  The  resulting 
partition  consists  of  a  set  of  bisectors,  either  line  seg¬ 
ments  or  parabolic  arcs;  it  is  essentially  the  same  as  the 
medial  axis  [21],  The  GVD  can  be  similarly  defined  for 
arc-gons,  or  more  generally  for  arbitrary  collections  of 
points,  segments,  and  circular  arcs.  The  GVD  of  a  col¬ 
lection  of  n  points,  segments,  and  arcs  can  be  computed 
in  time  O(nlogn)  [13]. 

The  disk-packing  stage  consists  of  three  smaller  steps. 
First,  one  or  two  disks  are  placed  at  each  vertex  of  the 
polygon.  Second,  holes  in  the  polygon  are  connected 
to  the  boundary  by  adding  disks  tangent  to  two  holes, 
or  to  a  hole  and  the  outer  boundary.  Third,  disks  are 
added  to  the  as-yet-uncovered  regions  (called  remainder 
regions),  recursively  reducing  their  complexity  until  all 
have  at  most  four  sides. 

Disks  at  Corners.  The  first  step  preprocesses  P  so 
that  we  need  only  consider  arc-gons  with  angle  0  at  each 
vertex.  At  every  convex  vertex  of  P,  we  add  a  small 
disk  tangent  to  both  edges,  as  shown  in  Figure  2(a).  At 
every  concave  vertex  of  P,  we  add  two  disks  of  equal 
radii,  tangent  to  the  edges,  and  tangent  to  the  angle 
bisector  at  the  corner,  as  shown  in  Figure  2(b).  We  can 
handle  point  holes  by  centering  small  disks  at  them. 
We  choose  radii  small  enough  that  disks  lie  within  P, 
and  none  overlap  (that  is,  intersect  at  interior  points). 
This  step  isolates  a  small  3-  or  4-sided  remainder  region 
at  each  corner  of  P.  The  large  remainder  region  is  an 
arc-gon  of  2n  -1-  r  =  0{n)  sides,  where  n  is  the  number 
of  vertices  of  P  and  r  is  the  number  of  concave  corners. 

The  first  step  can  be  implemented  in  time  0{n  logn) 
using  the  GVD  of  P.  By  checking  the  adjacencies  of 
GVD  cells,  we  can  determine  the  nearest  non-incident 
edge  for  each  vertex  v  of  P;  one-eighth  this  distance 
gives  a  safe  radius  for  the  disks  next  to  v.  (Our  imple¬ 
mentation  actually  uses  some  other  choices  of  radii  to 
reduce  output  size.) 

Connecting  Holes.  The  second  step  connects  polyg¬ 
onal  holes  to  the  outer  boundary  by  repeatedly  adding 
a  disk  tangent  to  two  or  more  connected  components 
of  the  boundary.  In  this  step,  previous  disks  touching 


a  hole  boundary  are  considered  to  be  part  of  the  hole. 
At  the  end,  the  large  remainder  region  is  bounded  by  a 
simply-connected  arc-gon  with  0{n)  sides.  Every  cor¬ 
ner  of  this  arc-gon  has  angle  0,  since  each  results  from 
a  tangency. 

The  second  step  can  be  implemented  in  time 
0(n  log^n).  We  use  a  data  structure  that  answers 
queries  of  the  following  form:  given  a  query  point  p, 
which  data  object  (vertex,  edge,  or  disk)  will  be  hit 
first  by  an  expanding  circle  tangent  to  a  vertical  line 
through  p  (tangent  at  p  and  to  the  left  of  the  line)?  Such 
a  query  can  be  answered  using  Fortune’s  *-map  [13],  a 
sort  of  warped  Voronoi  diagram. 

The  initial  set  of  data  objects  includes  the  edges,  ver¬ 
tices,  and  disks  attached  to  the  outer  boundary  of  the 
input  polygon.  The  first  query  point  is  the  leftmost 
point  on  a  hole.  The  answer  determines  a  disk  D  en¬ 
tirely  contained  within  the  polygon,  touching  both  the 
hole  and  the  outer  boundary.  Disk  D  is  inserted  into 
the  query  data  structure,  along  with  the  vertices,  edges 
and  disks  of  the  hole.  Each  subsequent  query  is  per¬ 
formed  using  the  leftmost  point  of  all  remaining  holes. 
Altogether,  the  queries  yield  a  set  of  disks  connecting 
all  holes  and  the  exterior  of  the  polygon. 

For  a  static  set  of  data  objects,  the  =t:-map  can  be 
built  in  time  O(nlogn)  [13],  and  standard  planar  sub¬ 
division  search  techniques  [21]  yield  an  O(logn)  query 
time.  In  our  case,  the  set  of  data  objects  is  not  fixed, 
since  edges  and  a  disk  are  added  following  each  query.  A 
trick  due  to  Bentley  and  Saxe  [5]  allows  dynamic  inser¬ 
tions  to  the  query  structure,  with  query  time  0(log^  n) 
and  amortized  insert  time  0(log^  n).  The  trick  is  to  di¬ 
vide  the  n  data  objects  among  0(log  n)  data  structures, 
one  for  each  bit  in  the  binary  representation,  of  n.  A 
query  searches  all  data  structures  in  0(log^  n)  time.  An 
insertion  rebuilds  all  the  data  structures  corresponding 
to  bits  that  change.  The  total  time  required  for  n  in¬ 
sertions  is  0(nlog^  n). 

Reducing  to  3-  and  4-Sided  Remainder  Regions. 

After  the  first  two  steps,  there  is  one  simply-connected 
remainder  region  A  with  0{n)  sides,  and  0{n)  remain¬ 
der  regions  in  corners  with  three  or  four  sides.  The  final 
step  of  the  disk-packing  stage  recursively  subdivides  A 
by  adding  disks.  The  result  will  be  a  linear  number  of 
remainder  regions  of  three  and  four  sides. 

To  subdivide  arc-gon  A,  we  add  a  disk  tangent  to 
three  of  its  sides.  Such  a  disk  divides  the  region  enclosed 
by  the  arc-gon  into  four  pieces:  the  disk  itself  and  three 
smaller  regions  bounded  by  arc-gons.  We  choose  a  disk 
tangent  to  three  sides  of  A,  not  all  of  them  consecutive, 
thereby  ensuring  that  each  of  the  three  smaller  arc-gons 
has  at  most  n  —  1  sides.  As  shown  in  Figure  3,  a  disk 
tangent  to  three  sides  of  an  arc-gon  must  be  centered 
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Figure  4.  Remainder  regions  with  vertices  of  P. 


Figure  3.  A  disk  tangent  to  three  edges  of  an  arc-gon  is 
centered  at  a  vertex  of  the  GVD. 

at  a  vertex  of  the  GVD.  Since  A  is  simply  connected, 
the  edges  of  its  GVD  form  a  tree,  a  fact  that  will  be 
useful  in  bounding  the  running  time. 

Lemma  1.  It  is  possible  to  reduce  all  remainder 
regions  to  at  most  4  sides,  by  packing  0(n)  non¬ 
overlapping  disks  into  arc-gon  A. 

Proof:  Each  vertex  of  the  GVD  corresponds  to  a  disk 
tangent  to  three  sides  of  A.  If  A  has  at  least  five  sides, 
then  there  is  a  vertex  v  of  the  GVD  that  is  adjacent  to 
two  non- leaf  vertices  of  the  GVD;  a  disk  centered  at  v  is 
tangent  to  three  sides  of  A  that  are  not  all  consecutive. 

Now  let  d(n)  be  the  maximum  number  of  disks 
needed  to  reduce  an  n-sided  arc-gon  to  3-  and  4-sided 
remainder  regions.  We  prove  d(n)  <  n  —  4  by  induction 
on  n.  The  base  cases  are  d(3)  =  0  and  d(4)  =  0. 

For  the  inductive  step,  notice  that  adding  one  disk 
produces  three  new  arc-gons.  (We  can  simply  ignore 
extra  tangencies  in  the  degenerate  case  of  four  or  more 
tangencies.)  Suppose  the  new  arc-gons  have  k,  I,  m 
sides,  respectively,  with  3  <  k  <  I  <  m.  Since  we 
are  choosing  non-consecutive  sides,  m  <  n.  Counting 
1  for  the  added  disk,  we  have  that  d{n)  <  1  +  d{k)  -|- 
d(l)  -f  d(m).  Since  the  disk  divides  three  sides,  and  is 
itself  divided  in  three  places,  we  have  k-\-l-\-m  =  n-|-6. 

First  suppose  k  =  3.  Since  we  are  choosing  non- 
consecutive  sides,  /  >  4,  so 

d(n)  ^  1  -f-  d(3)  -f  d(/)  4-  d(?Tz) 

<  1  -I-  0  4-  (/  -  4)  -f  (m  —  4) 

=  (/  -f  m)  —  7  =  (n  4-  3)  —  7  =  n  —  4. 

When  fc  >  4,  we  have  d(n)  <  1 d{k) d{l)  +  d{m) .  By 
induction,  d{n)  <  1  4-  (A;  —  4)  -f  (/  —  4)  4-  (m  —  4),  which 

is  equal  to  (k  +  I  +  m)  —  ll  =  (n4-6)  —  ll  =  n  —  5.  ■ 

Finally  we  comment  on  running  time.  Any  tree  con¬ 
tains  a  vertex,  called  a  centroid,  whose  removal  leaves 
subtrees  of  size  at  most  two-thirds  the  original  size.  By 


choosing  a  disk  centered  at  a  centroid  of  the  GVD  of  A, 
we  split  A  into  arc-gons  Ai,  A2,  and  A3.  We  imagine 
splitting  Ai,  A2,  and  A3  in  parallel,  so  that  altogether 
there  will  be  O(logn)  splitting  stages,  each  involving  a 
set  of  arc-gons  of  total  complexity  0{n).  If  we  recom¬ 
pute  GVD’s  from  scratch  after  each  splitting  stage,  we 
obtain  total  time  O(nlog^  n).  This  can  be  improved  to 
O(nlogn)  by  rebuilding  GVD’s  in  linear  time,  using  an 
adaptation  of  the  algorithm  due  to  Aggarwal  et  al.  [1]. 

4.  Triangulating  the  Pieces 

We  now  describe  the  second  stage  of  our  algorithm.  At 
this  point,  polygonal  region  P  has  been  partitioned  into 
disks  and  remainder  regions  with  three  or  four  sides, 
either  straight  or  circular  arcs.  Each  circular  arc  of  a 
remainder  region  R  is  naturally  associated  with  a  pie¬ 
shaped  sector,  namely  the  convex  hull  of  the  arc  and 
the  center  of  the  circle  containing  the  arc.  We  denote 
the  union  of  R  and  its  associated  sectors  by  R'^ .  These 
augmented  remainder  regions  define  a  decomposition 
of  P  into  simple  polygons  with  disjoint  interiors.  In 
an  augmented  remainder  region,  we  retain  vertices  at 
circle  tangencies;  vertices  such  as  these,  at  which  the 
angles  measure  tt,  are  called  subdivision  points. 

In  this  section,  we  show  how  to  triangulate  each  72+ 
region.  All  Steiner  points  will  lie  either  on  straight 
sides  of  R  (that  is,  along  P’s  boundary)  or  interior  to 
72+ .  Thus  we  never  place  Steiner  points  on  the  radii 
bounding  sectors,  and  triangulated  72+  regions  will  fit 
together  at  the  end.  Our  triangulation  method  is  given 
in  three  cases:  remainder  regions  with  vertices  of  P, 
three-sided  remainder  regions,  and  four-sided  remain¬ 
der  regions.  The  first  two  cases  are  easy,  but  the  last  is 
quite  intricate.  In  all  cases,  triangulating  a  single  72+ 
region  takes  0(1)  time,  so  altogether  the  running  time 
of  the  second  stage  is  0{n). 

Remainder  Regions  with  Vertices  of  P.  Every 
vertex  of  P  was  isolated  by  one  or  two  disks  in  the  first 
step  of  the  algorithm.  The  resulting  regions  72+  can  be 
triangulated  with  at  most  four  right  triangles,  as  shown 
in  Figure  4,  by  adding  edges  from  the  disk  centers  to 
the  points  of  tangency  and  the  vertex  of  P. 
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Figure  5.  Three-sided  remainder  regions:  (a)  with  a  straight 
side,  (b)  with  only  finite-radius  arcs. 


Figure  6.  The  good  case  for  four-sided  remainder  regions. 


Three-Sided  Rem£iinder  Regions.  A  three-sided 
remainder  region  R  without  a  vertex  of  P  is  bounded 
by  three  circular  arcs,  so  that  arcs  meet  tangentially  at 
the  vertices  of  R.  Here  we  are  considering  a  straight 
side  to  be  an  arc  of  an  infinitely  large  circle.  We  call 
a  Steiner  point  in  an  augmented  remainder  region  R^ 
safe  if  it  lies  either  interior  to  or  on  the  boundary 
of  P. 

Lemma  2.  If  R  is  a  three-sided  remainder  region, 
then  can  be  triangulated  with  at  most  six  right 
triangles,  adding  only  safe  Steiner  points. 

Proof:  First  assume  that  R  has  a  straight  side  (nec¬ 
essarily  at  most  one),  and  view  R  so  that  this  straight 
side  forms  a  horizontal  base.  The  augmented  region  iZ"*" 
is  a  trapezoid  with  two  vertical  sides,  and  a  subdivision 
point  p  along  its  slanted  top  side.  We  cut  perpendic¬ 
ularly  from  p  (that  is,  tangent  to  both  arcs)  across  R 
until  we  hit  the  base,  and  there  add  a  safe  Steiner  point 
s.  We  add  edges  from  s  to  the  centers  of  the  arcs’  cir¬ 
cles  to  divide  iZ+  into  four  right  triangles,  as  shown  in 
Figure  5(a). 

Now  assume  all  the  sides  of  R  are  arcs  of  finite  ra¬ 
dius.  Notice  that  is  a  triangle  with  subdivided  sides. 
Moreover,  the  subdivision  points  along  the  sides  of  iZ+ 
are  exactly  the  tangency  points  of  the  inscribed  circle 
of  iZ+.  (This  follows  from  the  fact  that  the  inscribed 
circle  makes  each  corner  of  incident  to  two  edges  of 
equal  length.)  So  we  add  the  circle’s  center  c  and  edges 
from  c  to  all  the  vertices  around  iZ+,  dividing  into 
six  right  triangles,  as  shown  in  Figure  5(b).  ■ 

Four-Sided  Remednder  Regions.  A  four-sided  re¬ 
mainder  region  R  is  bounded  by  four  circular  arcs  (pos¬ 
sibly  of  infinite  radius)  that  meet  tangentially  at  the 
vertices  of  R.  Lemma  3  states  two  interesting  proper¬ 
ties  of  these  regions. 

Lemma  3.  The  arcs  of  R  have  total  measure  27r.  The 
vertices  of  R  are  cocircular. 


Proof:  If  all  arcs  have  finite  radius,  then  the  sum  of 
the  measures  of  the  arcs  of  R  is  identical  to  the  snm  of 
the  mccisures  of  the  angles  at  the  corners  of  iZ"!".  For 
straight  sides,  we  imagine  further  augmenting  iZ+  with 
“infinite  sectors”  of  angle  0. 

Next  we  show  that  the  vertices  are  cocircular.  Let 
Cl  and  Cs  be  finite-radius  circles  containing  opposite 
arcs  of  R.  (Here  notice  that  if  R  has  two  straight  sides, 
they  must  be  opposite.)  Assume  the  two  lines  that 
are  externally  tangent  to  both  Ci  and  Cs  meet  at  a 
point  X.  There  exists  an  inversive  transformation  ([11], 
pp.  77-95)  of  the  projective  plane  that  maps  x  to  infin¬ 
ity  and  hence  the  two  external  tangent  lines  to  parallel 
lines.  The  transformed  circles  and  C3,  correspond¬ 
ing  to  Cl  and  C3,  have  equal  size,  so  the  vertices  of 
the  transformed  remainder  region  R'  form  an  isosceles 
trapezoid.  It  is  easy  to  see  that  any  isosceles  trapezoid 
has  cocircular  vertices.  The  inverse  of  the  original  in¬ 
versive  transformation  maps  the  circle  containing  the 
vertices  of  R'  to  a  circle  containing  the  vertices  of  R.  ■ 

Now  if  we  are  lucky,  the  region  iZ+  can  be  triangu¬ 
lated  with  16  right  triangles,  as  in  the  following  lemma. 

Lemma  4.  If  R  is  a  four-sided  remainder  region,  in 
which  each  arc  measures  at  most  w  and  the  center  of 
the  circle  through  R’s  vertices  lies  in  the  convex  hull  of 
R,  then  iZ+  can  be  triangulated  with  16  right  triangles, 
adding  only  safe  Steiner  points. 

Proof:  We  may  assume  that  all  arcs  of  R  have  finite 
radius.  If  R  has  a  straight  edge,  we  can  apply  the  trian¬ 
gulation  to  a  region  with  an  infinite  sector  attached  to 
the  straight  edge  and  then  simply  remove  the  resulting 
infinite  strips. 

The  construction  is  shown  in  Figure  6.  Here  we  have 
added  the  center  c  of  the  circle  through  R’s  vertices  in 
order  to  form  four  kites  (quadrilaterals  with  two  adja¬ 
cent  pairs  of  equal-length  sides) .  ■ 
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Figure  8.  (a)  MutucJ  tangents  and  mutual  chord  meet  at  a  point.  (b)  Triangulation. 


Figure  7.  New  circle  C*  breaks  up  a  reflex  remainder  region. 

The  triangulation  of  Figure  6  can  fail  in  two  different 
ways:  (1)  if  one  of  the  arcs  of  R  measures  more  than  (a 
reflex  arc),  then  i?+  has  a  reflex  vertex  at  which  angles 
will  measure  more  than  7r/2;  and  (2)  if  center  c  lies 
outside  the  convex  hull  of  R,  then  it  lies  on  the  wrong 
side  of  one  of  the  chords  and  will  introduce  unwanted 
intersections.  Each  of  these  difficulties  will  be  handled 
by  adding  yet  another  disk. 

First  assume  R  has  a  reflex  arc  on  circle  C3.  Add 
another  disk  C* ,  tangent  to  C3  and  Ci  (the  circle  con¬ 
taining  the  arc  opposite  Cs),  such  that  the  center  of 
C*  lies  on  the  line  joining  the  centers  of  Ci  and  C3. 
The  new  disk  C* — unlike  any  of  the  disks  used  up  un¬ 
til  this  point — may  overlap  an  old  disk  and  produce 
a  self-intersecting  remainder  region,  as  shown  in  Fig¬ 
ure  7.  Lemma  3  holds  without  modification  for  self- 
intersecting  remainder  regions.  Region  R'^ ,  formed  as 
before  by  adding  the  associated  sectors  to  R,  remains 
a  simple  polygon  with  subdivision  points  on  its  sides, 
specifically  a  triangle  with  three  subdivisions  on  one 
side  and  one  on  each  of  the  others.  The  next  lemma 
shows  how  to  triangulate  R+  with  a  generalization  of 
the  method  of  Lemma  2. 

Lemma  5.  Let  R  be  a  self-intersecting  four-sided  re¬ 
mainder  region  resulting  from  breaking  up  a  reflex  four¬ 
sided  remainder  region  by  the  addition  of  C* .  Then 
i?+  can  be  triangulated  with  at  most  12  right  triangles, 
adding  only  safe  Steiner  points. 


Proof:  Again  we  may  assume  that  all  arcs  of  R  have 
finite  radius,  as  a  solution  to  this  case  implies  a  trian¬ 
gulation  for  the  case  of  straight  sides. 

Consider  one  of  the  arcs  S  next  to  C* .  We  claim  that 
the  lines  tangent  to  S  at  its  endpoints  and  the  mutual 
chord  of  C*  and  its  opposite  arc  all  meet  at  a  single 
point  p  interior  to  R,  as  shown  in  Figure  8(a).  This 
claim  allows  the  triangulation  shown  in  Figure  8(b). 

Why  is  the  claim  true?  For  each  of  the  three  disks — 
C* ,  the  opposite  disk,  and  the  one  with  arc  S — we 
define  a  power  function.  The  power  function  of  a 
circle  with  center  {xcVc)  and  radius  r  is  P{x,y)  = 
{x  —  Xc)'^-{-{y—yc)^—r'^-  The  power  functions  of  two  tan¬ 
gent  circles  are  equal  along  their  mutual  tangent  line; 
the  power  functions  of  two  overlapping  circles  are  equal 
along  a  line  containing  their  mutual  chord.  The  point 
p  of  the  claim  is  the  point  at  which  all  three  power 
functions  are  equal.  ■ 

We  now  consider  the  second  difficulty.  Call  a  four¬ 
sided  remainder  region  R  centered  if  the  convex  hull 
of  R  contains  the  center  c  of  the  circle  through  i?’s 
vertices,  and  uncentered  otherwise.  Let  the  arc  of  R 
with  the  longest  chord  lie  along  circle  Ci,  and  denote 
the  other  circles  by  C2,  C3,  and  C4,  clockwise  around 
R.  (Circles  through  infinity  handle  the  case  of  straight 
sides.)  Assume  that  the  line  through  the  centers  of  Ci 
and  Cz  is  vertical  as  in  Figure  9.  If  i?  is  uncentered, 
then  c  must  lie  below  the  chord  on  Ci- 

Let  <12  be  the  vertex  of  R  at  which  Ci  and  C2  meet, 
and  similarly  define  <23,  ^34,  and  t^i.  For  a  disk  C*  tan¬ 
gent  to  both  Cl  and  C3,  let  Sl  (=  Sl{C*)  )  be  the  circu¬ 
lar  arc  with  endpoints  ti2  and  ^23  that  passes  through 
the  points  at  which  C*  meets  Ci  and  C3.  Lemma  3 
guarantees  that  such  an  arc  exists.  Similarly  define  Sr  ■ 
Let  Cl  and  be  the  centers  of  the  circles  containing 
Sl  and  Sr,  respectively. 
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Lemma  6.  There  exists  a  disk  C*  tangent  to  Ci  and 
C3,  such  that  Cl  lies  in  the  convex  hull  of  the  four  points 
of  tangency  around  Sl  ^nd  Cr  lies  in  the  convex  hull  of 
the  four  points  of  tangency  around  Sr- 

Proof:  First  let  C*  be  the  disk  that  is  tangent  to 

Cl  and  C3  such  that  the  center  of  C*  lies  on  the  line 
through  the  centers  of  Ci  and  C3. 

Centers  cl  and  Cr  lie  on  a  horizontal  line  through 
the  center  of  C* ,  hence  outside  Ci  and  C3.  But  the 
requirements  of  the  lemma  may  be  violated,  because  Cr 
may  lie  outside  the  chord  <34/41  of  Sr  (if  Sr  has  measure 
less  than  tt)  or  cl  may  lie  outside  the  chord  <i2<23  of  Sl 
(if  Sl  has  measure  less  than  tt).  We  assert  that  both 
of  these  bad  conditions  cannot  occur  at  the  same  time. 
Why?  It  suffices  to  show  that  the  sum  of  the  measures 
of  Sl  and  Sr  is  at  least  2ir.  Angle  Z<23<»3<34,  where 
<*3  is  the  point  of  tangency  of  C*  and  C3,  measures  at 
least  7r/2,  because  the  arc  of  R  on  C3  measures  at  most 
TT.  And  Z.<i2<*3<4i  measures  more  than  7r/2,  because 
the  center  of  the  circle  through  the  vertices  of  R  lies 
below  <i2<4i-  Hence  the  remaining  angles  at  <,3  (the 
two  subtended  by  the  endpoints  of  Sl  and  Sr)  sum  to 
less  than  tt. 

If  neither  bad  condition  occurs,  then  C*  satisfies  the 
conditions  of  the  lemma,  and  we  are  done.  But  if  one  of 
the  bad  conditions  does  occur,  then  we  sweep  C*  in  the 
direction  that  could  cure  the  condition,  while  keeping 
C*  tangent  to  both  Ci  and  C3.  If  Cr  lies  outside  <34/41, 
then  we  sweep  C*  to  the  left  in  Figure  9;  the  other  case 
is  symmetrical. 

During  the  leftward  sweep,  Cr  moves  towards  Ci 
along  the  perpendicular  bisector  of  <34/41  and  cl  moves 
towards  C2  along  the  perpendicular  bisector  of  <i2<23i  as 
shown  in  Figure  9.  These  bisectors  never  intersect  C3, 
so  Ci  and  Cr  can  never  lie  outside  their  chords  (<23<*3 
and  <*3/34)  on  C3.  The  chords  of  Sl  and  Sr  between 
tangency  points  on  C*  are  never  the  longest  chords  on 
these  arcs,  so  cl  and  Cr  also  lie  safely  inside  these  chords 
throughout  the  sweep. 

As  Cr  moves,  it  must  pass  through  <34/41  and  become 
good,  before  it  reaches  Ci  and  becomes  bad.  By  the 
arc-measure  argument  above,  Cr  must  cross  inside  <34/41 
before  cl  crosses  outside  <i2<23-  Hence  at  some  point 
in  the  sweep,  both  Cr  and  cl  satisfy  the  conditions  of 
the  lemma,  and  the  C*  at  this  point  is  C* .  ■ 

Lemma  6  breaks  up  uncentered,  non-reflex  remainder 
regions,  but  unless  C*  coincides  with  the  initial  C*  in 
the  sweep,  adding  C*  creates  a  new  reflex  remainder 
region.  The  following  lemma  finesses  this  final  difficulty 
(shall  we  say  circularity?)  by  triangulating  both  new 
augmented  regions  at  once. 


Figure  9.  The  trajectories  of  centers  cl  and  Cr  as  C*  sweeps. 


Lemma  7.  Let  R  be  a  non-reSex,  uncentered,  four¬ 
sided  remainder  region.  Then  R'^  can  be  triangulated 
into  at  most  28  right  triangles,  adding  only  safe  Steiner 
points. 

Proof:  Again  we  may  assume  that  all  arcs  of  R  have 
finite  radius.  We  start  by  adding  the  “centering”  disk 
C*,  guaranteed  by  Lemma  6.  As  above,  we  denote  the 
tangent  point  of  C*  and  Ci  by  <*i  and  the  tangent 
point  of  C*  and  C3  by  <*3.  In  addition  to  <*i  and  <*3, 
we  add  the  following  Steiner  points:  the  centers  cr  and 
Cr  associated  with  arcs  Sr  and  Sr,  and  the  midpoint  m 
of  segment  <,i<*3.  See  Figure  10. 

We  triangulate  by  adding:  all  chords  around  St  and 
Sr',  edges  from  ct  to  m  and  to  the  centers  of  Ci,  C2,  and 
C3;  and  edges  from  Cr  to  m  and  to  the  centers  of  C3, 
C4,  and  Cl.  Finally  we  add  an  edge  between  the  center 
of  Cl  and  <*i  and  between  the  center  of  C3  and  <*3. 

Resulting  triangles  come  in  sets  of  four,  each  set  form¬ 
ing  a  kite  and  its  diagonals.  Hence  all  triangles  are 
right.  (Notice  that  C*  is  treated  somewhat  differently 
than  the  other  circles,  since  we  do  not  use  its  center. 
Nevertheless  the  four  triangles  around  m  form  a  kite, 
because  <*i<,3  is  the  mutual  chord  of  C*,  S2,  and  S4.) 
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We  have  now  completed  the  proof  of  our  main  theo¬ 
rem,  linear-size  nonobtuse  triangulation. 

Theorem  1.  An  n-vertex  polygonal  region  can  be  tri¬ 
angulated  with  0(n)  right  triangles,  in  time  O(nlogn) 
for  simple  polygons  and  0(n  log^  n)  for  polygons  with 
holes.  I 


5.  Implementation 

We  implemented  our  algorithm  within  the  Matlab  en¬ 
vironment  [15].  The  implementation  differs  somewhat 
from  the  algorithm  described  in  the  text.  We  use  several 
heuristics  for  disk  placement  so  as  to  reduce  the  number 
of  triangles.  Also  we  do  not  bother  to  compute  general¬ 
ized  Voronoi  diagrams.  Rather  we  use  a  simple  0(hn) 
method  to  connect  h  holes  to  the  boundary,  and  we 
choose  arbitrary  disks  touching  three  non-consecutive 
sides,  rather  than  disks  centered  at  GVD  centroids.  To 
keep  the  user  entertained  during  the  worst-case  O(n^) 
running  time,  we  display  color-coded  disks  and  triangles 
as  they  are  added.  Finally,  although  Section  4  describes 
a  construction  using  only  right  triangles,  the  implemen¬ 
tation  produces  some  acute  triangles,  an  example  being 
the  large  downward-pointing  triangles  in  Figure  1(c). 

Experiments  with  a  variety  of  polygonal  regions  show 
that  an  n-vertex  input  typically  produces  about  22n 
triangles.  A  simple  polygon  with  n  —  3  reflex  corners 
can  produce  as  many  as  25n  triangles;  the  maximum 
for  polygons  with  holes  appears  to  be  about  33n.  Since 
a  floating  point  representation  entails  roundoff,  some  of 
the  right  angles  present  in  the  nonobtuse  triangulation 
become  slightly  obtuse.  The  worst  test  case  had  an 
angle  of  about  ^  -f  10“^^  radians  (Matlab  retains  16 
digits),  so  the  implementation  is  fairly  robust,  which 
is  somewhat  surprising  given  that  our  implementation 
often  places  very  small  disks  next  to  very  large  ones. 

6.  Parallelizing  the  Algorithm 

We  now  sketch  the  first  AfC  algorithm  for  nonobtuse 
triangulation.  We  give  a  straightforward  though  rather 
inefficient  algorithm,  with  parallel  time  0(log^  n)  and 
processor  requirement  O(n^).  Both  time  and  processors 
should  be  improvable.  One  bottleneck  subproblem  is 
the  computation  of  the  GVD  of  circular  arcs;  see  [14] 
for  the  GVD  of  line  segments. 

Theorem  2.  An  n-vertex  polygonal  region  P  (with 
holes)  can  be  triangulated  with  0(n)  right  triangles  in 
0(log^  n)  time  on  O(n^)  EREW  PRAM  processors. 


Proof:  Using  O(n^)  processors — one  for  each  vertex- 
edge  pair — and  time  0(log  n),  we  can  compute  the  near¬ 
est  non-incident  edge  for  each  vertex  and  hence  choose 
appropriate  radii  for  disks  to  pack  into  corners.  The 
second  step,  connecting  holes,  is  trickier.  We  first  com¬ 
pute  a  minimum  spanning  tree  (MST)  of  P’s  holes;  by 
this  we  mean  the  shortest  set  of  line  segments  S,  each 
segment  with  both  endpoints  on  the  boundary  of  P, 
such  that  the  union  of  S  and  the  exterior  of  P  is  a 
connected  subset  of  the  plane.  Using  O(n^)  proces¬ 
sors  and  time  0(log  n),  we  compute  for  each  vertex  the 
nearest  edge  lying  on  a  different  connected  component 
of  P’s  boundary.  We  use  this  information  to  compute 
distances  between  connected  components,  and  add  to 
S  the  shortest  component-joining  line  segment  incident 
to  each  component.  This  reduces  the  number  of  com¬ 
ponents  by  at  least  a  factor  of  two,  so  O(logn)  such 
merging  steps  suffices  to  complete  the  computation  of 
set  S. 

Now  it  is  not  hard  to  show  that  no  point  of  the 
plane  is  covered  by  more  than  0(1)  diameter  disks  of 
segments  in  S.  Hence  there  is  a  pairwise-disjoint  set 
of  diameter  disks  of  cardinality  a  constant  fraction  of 
|5|  [26] .  It  is  not  hard  to  find  these  disks  in  parallel 
time  O(logn)  using  separators.  We  repeat  the  process 
of  computing  the  MST  (of  the  new  connected  compo¬ 
nents,  holes  plus  disks)  and  finding  a  large  indepen¬ 
dent  set  of  diameter  disks.  After  O(logn)  cycles — for 
total  time  of  O(log^  n) — we  have  reduced  to  a  simply- 
connected  arc-gon. 

The  third  step  of  the  disk-packing  stage  uses  the 
generalized  Voronoi  diagram  in  order  to  find  centroid 
disks.  Using  O(n^)  processors  and  time  O(log^n),  we 
can  compute  the  GVD  of  a  set  of  n  circular  arcs  as  fol¬ 
lows.  We  compute  the  equal-distance  curve  (bisector) 
for  each  pair  of  arcs.  Then  for  each  arc  a,  we  com¬ 
pute  the  piecewise-polynomial  boundary  of  a’s  cell  re¬ 
cursively  by  dividing  the  set  of  bisectors  into  two  equal 
halves  and  then  merging  the  boundaries  for  each  half. 
Two  piecewise-polynomial  boundaries  of  0(n)  pieces 
can  be  merged  in  time  O(logn)  on  n  processors.  Once, 
the  GVD  has  been  computed,  a  centroid  can  be  found  in 
time  O(logn)  by  alternately  removing  leaves  and  merg¬ 
ing  degree-2  paths. 

Recall  that  the  algorithm  requires  a  “decomposition 
tree”  of  centroid  disks  of  height  O(logn),  so  by  simply 
recomputing  the  GVD  after  each  centroid,  we  obtain  an 
overall  time  for  the  third  disk-packing  step  of  0(log^  n). 
Finally,  the  triangulation  stage  consists  entirely  of  local 
operations,  so  it  is  trivially  parallelized.  ■ 
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7.  Conclusion 

We  have  presented  a  new  algorithm  for  nonobtuse  trian¬ 
gulation  of  polygons  with  holes.  The  number  of  trian¬ 
gles  produced  is  linear  in  the  number  of  vertices  of  the 
input,  a  significant  improvement  over  previous  meth¬ 
ods.  This  is  of  course  asymptotically  optimal,  resolving 
the  question  of  the  theoretical  complexity  of  nonobtuse 
triangulation  of  polygons. 

One  direction  for  further  work  is  extending  the  al¬ 
gorithm  to  inputs  more  general  than  polygons  with 
holes;  these  inputs  occur  in  modeling  domains  made 
of  more  than  one  material.  Currently,  there  is  an  algo¬ 
rithm  for  refining  a  triangulated  simple  polygon  into  a 
nonobtnse  triangulation  with  O(n^)  triangles,  and  also 
an  fi(n^)  lower  bound  [8].  There  is  still  no  algorithm 
for  polynomial-size  nonobtuse  triangulation  of  planar 
straight-line  graphs;  a  solution  to  this  problem  would 
give  another  solution  to  “conforming  Delaunay  trian¬ 
gulation”  [12].  Mitchell  [17]  recently  showed  how  to 
triangulate  planar  straight-line  graphs  with  maximum 
angle  at  most  Ttt/S,  using  at  most  logn)  triangles. 

Another  important  direction  is  exploring  whether  our 
ideas  can  be  used  for  related  mesh-generation  prob¬ 
lems.  For  instance,  disk-packing  may  yield  a  simpler 
algorithm  for  the  problem  of  no-small-angle,  nonobtuse 
triangulation  [3,  16].  Perhaps  we  can  use  our  meth¬ 
ods  to  produce  nonobtuse  meshes  with  skinny  trian¬ 
gles  aligned  with  the  boundary.  (See  [19]  for  aligned 
no-large-angle  meshes.)  Or  perhaps  our  methods  can 
be  allied  with  a  heuristic  method  called  “bubble  sys¬ 
tems”  [24]. 

Finally,  higher  dimensions  are  still  a  mystery.  Do 
three-dimensional  polyhedra  admit  polynomial-size  tri¬ 
angulations  without  obtuse  dihedral  angles?  Algo¬ 
rithms  for  point  sets  are  known  [6,  10]. 
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1  Introduction 

We  extend  the  work  of  Bern  [5]  on  triangulating  the 
region  between  three-dimensional  polyhedra.  Here  is  a 
summary  of  our  results: 

•  We  give  a  linear  algorithm  for  triangulating  the 
region  between  two  convex  polyhedra.  No  assump¬ 
tions  are  made  on  the  polyhedra:  they  can  be  dis¬ 
joint,  nested,  or  overlapping.  The  algorithm  pro¬ 
duces  0{n)  tetrahedra,  where  n  is  the  input  size. 
This  optimal  bound  improves  on  an  0{n  log  n)-size 
construction  of  Bern  [5]. 

•  We  give  a  method  for  triangulating  the  region  be¬ 
tween  a  convex  polyhedron  and  a  disjoint  polyhe¬ 
dral  terrain  (or  a  star-shaped  polyhedron).  The 
number  of  tetrahedra  produced  is  O(nlogn).  We 
prove  the  rather  surprising  result  that  the  bound 
is  tight.  More  generally,  we  show  that  any  convex 
decomposition  of  the  space  between  a  convex  poly¬ 
hedron  and  a  terrain  requires  n(nlogn)  pieces  in 
the  worst  case. 

•  We  prove  that  any  polyhedron  of  genus  g  must  have 
at  least  g  —  1  reflex  dihedral  angles.  To  our  surprise, 
this  result  appears  to  be  new.  Actually,  we  were 
not  even  able  to  find  any  proof  in  the  literature  that 
the  number  of  reflex  edges  is  0(ff).  Although  our 
proof  is  quite  simple  it  relies  on  the  Gauss- Bonnet 
curvature  formula  and  thus  it  is  not  completely  self- 
contained.  This  result  shows  that  the  triangulation 
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algorithm  of  Chazelle  and  Palios  [8]  works  just  the 
same  for  polyhedra  of  arbitrary  genus:  specifically, 
any  polyhedron  with  n  vertices  and  r  reflex  angles 
can  be  triangulated  with  0{n  +  r^)  tetrahedra,  re¬ 
gardless  of  its  genus.  The  bound  is  tight  in  the 
worst  case. 

Tetrahedralization  refers  to  the  triangulation  of 
three-dimensional  polyhedra  or  the  region  between 
them.  The  subject  has  been  heavily  researched  because 
of  its  relevance  to  the  finite-element  method,  mesh  gen¬ 
eration,  topology  of  3-manifolds,  interpolation  theory, 
tool  design,  etc.  [1,  2,  3,  4,  5,  8,  9,  14,  15].  An  odd  as¬ 
sortment  of  results  has  emerged.  For  example,  all  poly¬ 
hedra  can  be  tetrahedralized,  but  nonconvex  ones  might 
require  additional  vertices,  so-called  Steiner  points:  the 
canonical  example  is  the  Schonhardt  octahedron  [17], 
which  is  made  by  connecting  together  two  parallel, 
slightly  twisted  triangles.  Checking  whether  Steiner 
points  are  needed  or  not  has  been  shown  to  be  NP-hard 
by  Ruppert  and  Seidel  [16].  Steiner  points  are  some¬ 
what  of  a  nuisance  in  practice,  because  they  make  repre¬ 
sentations  more  complicated  and  cause  round-off  error 
problems.  Unfortunately  they  can  rarely  be  avoided. 
The  number  and  the  shape  of  the  tetrahedra  are  pa¬ 
rameters  that  are  both  more  critical  and  malleable.  The 
former  is  where  we  focus  our  attention  in  this  paper. 

It  was  shown  in  [8]  that  any  n- vertex  polyhedron  can 
be  triangulated  using  only  Q{n^)  tetrahedra.  This  is 
known  to  be  optimal  in  the  worst  case  [6].  Of  course,  in 
many  cases,  it  is  possible  to  do  much  better.  For  exam¬ 
ple,  a  convex  polyhedron  requires  only  0{n)  tetrahedra. 
Bern  [5]  provided  motivation  for  looking  at  the  region 
between  two  convex  polyhedra.  If  the  polyhedra  are  dis¬ 
joint  (side-by-side)  it  is  quite  easy  to  achieve  an  0{n)- 
size  triangulation.  The  nested  caise  is  more  difficult,  and 
Bern  proposed  a  construction  consisting  of  O(nlogn) 
tetrahedra.  Our  first  result  improves  this  to  0{n), 
which  is  optimal.  Furthermore,  the  method  allows  the 
polyhedra  to  be  intersecting.  It  is  based  on  a  dovetailed 
construction  of  the  Dobkin-Kirkpatrick  polyhedral  hi¬ 
erarchy  [10],  somewhat  in  the  style  of  [7]  though  much 
simpler.  Unlike  Bern’s  construction,  our  method  uses 
Steiner  points.  Whether  this  can  be  avoided  remains 
an  open  problem. 
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The  polyhedron  used  in  the  quadratic  lower  bound 
proof  of  [6]  is  built  by  placing  two  polyhedral  terrains/ 
one  on  top  of  the  other.  It  is  natural  to  ask  whether 
the  lower  bound  collapses  if  one  terrain  is  convex.  Our 
second  result  shows  that  0(nlogn)  is  the  exact  answer. 
The  upper  bound  is  obtained  by  carefully  pruning  the 
outer  polyhedral  hierarchy  at  critical  places  and  tracing 
its  interaction  with  the  terrain  by  way  of  a  fairly  delicate 
accounting  scheme. 

The  lower  bound  is  achieved  by  creating  an  art- 
gallery  that  requires  on  the  order  of  n  log  n  guards:  the 
roof  of  the  gallery  is  a  hangar-like  concave  structure;  the 
floor  plan  is  a  terrain  parameterized  by  a  permutation. 
We  prove  that  the  number  of  guards  needed  is  precisely 
equal  to  the  number  of  swaps  required  in  mergesorting 
that  permutation.  Expectedly,  we  use  the  bit-reversal 
permutation  to  achieve  the  best  possible  lower  bound. 
The  essence  of  our  proof  technique  is  to  be  able  to  in¬ 
terpret  a  triangulation  scheme  as  a  permutation  routing 
network,  whereby  each  node  corresponds  to  one  or  sev¬ 
eral  distinct  tetrahedra.  This  connection  allows  us  to 
classify  the  floor  plans  that  lead  to  easy  triangulations 
by  just  looking  at  the  complexity  of  their  corresponding 
permutations. 

2  Triangulating  between  Two  Convex 
Polytopes 

Let  P  and  Q  be  two  convex  polytopes, ^  with  a  total 
of  n  vertices.  We  wish  to  subdivide  the  region  between 
the  boundaries  of  P  and  Q  into  0{n)  tetrahedra.  Naive 
methods  such  as  merging  the  Dobkin-Kirkpatrick  hier¬ 
archies  of  the  two  poly  topes  are  doomed  to  fail.  Instead, 
we  define  a  process  for  “thickening”  the  boundaries  of 
the  polytopes  in  coordinated  fashion,  until  they  become 
so  wide  as  to  cover  all  of  during  the  growth  process 
we  maintain  a  triangulation  of  the  union  of  the  thick¬ 
ened  boundaries.  For  this  to  work,  the  two  boundaries 
have  to  grow  at  comparable  speeds. 

To  flesh  out  this  idea,  we  introduce  the  key  notion 
of  a  polyhedral  layer,  which  is  simply  the  region  be¬ 
tween  two  nested  convex  polytopes.  The  outer  and  in¬ 
ner  boundaries  might  share  faces,  so  a  layer  is  topologi¬ 
cally  equivalent  to  5^  x  [0, 1]  pinched  at  various  places. 
We  define  the  magnitude  of  a  layer  to  be  the  total  num¬ 
ber  of  faces  (of  all  dimensions)  among  its  inner  and 
outer  boundaries.  Layers  will  always  come  along  with  a 

polyhedral  terrain  is  the  graph  of  a  bivciriate  piecewise- 
linear  continuous  function:  any  vertical  line  intersects  it  in  exactly 
one  point. 

poly  tope  is  a  bounded  polyhedron:  our  restriction  to  the 
boimded  case  is  meant  only  to  simplify  the  explanation.  It  is 
routine  to  extend  our  results  to  unboimded  polyhedra.  Simi¬ 
larly,  we  assume  that  the  polytopes  are  nondegenerate  and  well- 
behaved.  In  particular,  it  is  assumed  that  no  vertex  is  completely 
surrounded  by  coplanar  facets:  if  this  were  the  case,  we  would 
remove  the  vertex  and  merge  the  incident  facets  together. 


cell  decomposition,  so  any  standard  data  structure  for 
three-dimensional  cell  complexes®  will  provide  an  ade¬ 
quate  representation  [11].  Specifically,  we  need  lists  of 
vertices,  edges,  facets,  and  cells,  along  with  their  inci¬ 
dence  relations. 

Thickening  the  Boundaries.  A  layer  can  grow  in¬ 
wards  or  outwards.  We  begin  with  the  inward  growth. 
Pick  a  maximal  independent  set  of  low-degree  vertices 
on  the  inner  boundary.  For  each  such  vertex  v,  form 
the  convex  hull  of  its  adjacent  vertices  and  keep  only 
those  faces  non  adjacent  to  v.  glue  the  resulting  patch 
to  the  layer.  The  space  between  the  patch  and  the 
layer  is  called  a  pocket  (Fig.  1).  It  is  a  polyhedron 
of  constant  description  size.  Growing  a  convex  poly¬ 
tope  inwards  in  this  fashion  produces  the  well-known 
Dobkin-Kirkpatrick  hierarchy  [10]. 


V 


The  outward  growth  [12,  13]  is  equivalent  to  the  in¬ 
ward  growth  in  dual  space.  We  select  a  maximal  in¬ 
dependent  set  of  nonadjacent,  low-complexity  facets  on 
the  outer  boundary  of  the  layer.  For  each  such  facet 
/,  we  intersect  the  halfspaces  bounded  by  /  and  its 
adjacent  facets  (on  the  relevant  sides).  This  creates  a 
pocket  with  at  most  a  constant  number  of  other  facets, 
one  of  which  is  /  (Fig.  2).  There  is  one  delicate  point 


to  address:  the  facets  of  the  pocket  are  coplanar  with 
the  facets  adjacent  to  /.  On  the  one  hand,  we  must 
merge  those  facets  together,  otherwise  the  complexity 
of  the  new  outer  boundary  does  not  decrease  (actually 
it  strictly  increases).  On  the  other  hand,  if  we  merge 

^Recall  that  a  cell  complex  is  a  subdivision  of  a  region  into 
relatively  open  faces  (i.e.,  vertices,  edges,  facets,  cells)  such  that 
the  closures  of  two  adjacent  faces  intersect  in  the  closure  of  a  face. 
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the  facets,  we  no  longer  have  a  cell  complex.  To  re¬ 
solve  this  dilemma,  we  enclose  the  new  outer  boundary 
into  a  slightly  enlarged  copy  of  itself,  and  connect  it  in 
the  obvious  way  (Fig.  3).  This  creates  a  layer  of  very 
thin  buffer  pockets  that  completely  surrounds  the  outer 
boundary;  such  a  pocket  is  a  symbolic  device  and  need 
not  have  actual  volume. 


A  simple  but  important  observation  is  that  thick¬ 
ening  both  the  inner  and  outer  boundaries  of  the  layer 
decreases  its  magnitude  by  a  constant  factor.  Note  that 
the  magnitude  of  a  layer  may  have  nothing  to  do  with 
its  complexity  as  a  cell  complex:  as  a  matter  of  fact, 
growing  a  layer  increeises  the  complexity  of  its  cell  com¬ 
plex  but  decreases  its  magnitude. 

Coordinated  Growth  of  Two  Layers.  Let  us 
now  look  at  the  thickening  of  two  layers,  L  and  L',  ema¬ 
nating  from  P  and  Q,  respectively.  We  do  not  make  any 
assumptions  about  the  relative  positioning  of  the  lay¬ 
ers.  We  assume  that  the  union  of  the  two  layers  is  fully 
available  as  a  cell  complex  (not  necessarily  simplicial); 
we  call  it  the  union  complex.  Note  that  it  also  provides 
a  cell  decomposition  of  the  boundary  of  LU  L' .  We  do 
not  assume,  however,  that  it  gives  a  cell  decomposition 
of  the  four  boundaries.  For  example,  portions  of  the 
boundaries  strictly  inside  L  L'  might  have  been  re¬ 
moved  and  have  no  representative  features  in  the  union 
complex. 

We  assume  that  all  four  boundaries  of  the  layers  are 
available  separately,  i.e.,  with  no  regard  to  the  fact  that 
some  might  intersect:  this  is  the  same  as  having  a  rep¬ 
resentation  of  four  distinct  convex  polytopes.  However, 
we  also  provide  pointers  from  the  facets  of  the  four 
boundaries  to  their  corresponding  facets  of  the  union 
complex:  for  this  to  make  sense,  we  require  that  any 
boundary  facet  that  does  not  lie  completely  inside  LVJL' 
should  have  its  “outside”  part  appear  as  the  union  of  at 
most  a  constant  number  of  facets  of  the  union  complex. 
(Just  one  is  not  enough,  because  such  a  facet  might 
be  homeomorphic  to  an  annulus,  so  it  needs  to  be  cut 
up  in  two  in  order  to  keep  all  the  facets  topologically 


equivalent  to  disks.) 

The  reason  we  need  this  correspondence  will  now  be¬ 
come  clear.  Suppose  that  we  wish  to  grow,  say,  the 
layer  L  inwards.  Our  first  task  is  to  determine  the 
intersection  of  the  new  pockets  of  L  with  the  current 
union  complex.  If  a  pocket  has  a  facet  in  (old)  dL  that 
intersects  L'  then  routine  navigation  through  the  cell 
complex  starting  at  that  intersection  does  the  job.  If 
not,  then  things  are  a  little  more  dicey,  as  the  pocket 
might  penetrate  L'  at  some  random  location.  We  use 
the  optimal  algorithm  of  [7]  to  intersect  the  boundaries 
of  L  and  L'  pairwise.  With  the  correspondence  men¬ 
tioned  above,  this  gives  us  access  to  all  the  union  com¬ 
plex  boundary  facets  that  intersect  the  pockets,  in  time 
linear  in  the  complexity  of  the  boundaries.  (There  are 
technical  details  which  we  omit.) 

What  do  we  do  with  the  new  pockets?  A  pocket  that 
does  not  intersect  either  boundary  of  L'  falls  in  one  of 
two  categories  (Fig.  4):  if  it  lies  entirely  inside  L' ,  it  is 
ignored,  i.e.,  it  has  no  eflTect  on  the  union  complex.  Oth¬ 
erwise,  it  is  simply  added  to  the  union  complex.  The 


more  interesting  case  arises  if  a  pocket  partly  intersects 
L'  (Fig.  5).  Then,  we  triangulate  the  complement  of 
L'  within  the  pocket  and  add  these  faces  to  the  current 
union  complex.  In  Fig.  5  this  complement  consists  of 
two  full-dimensional  components;  in  general,  it  can  have 
up  to  a  constant  number  of  them  (because  the  pocket 
has  0(1)  complexity).  We  should  also  observe  that  the 
union  complex  is  only  modified  by  growing:  once  it  con¬ 
tains  a  face  it  keeps  it  forever.  Also,  as  we  mentioned 
before,  we  should  not  expect  the  boundary  of  the  new 
layer  L  to  be  represented  in  the  union  complex  in  its 
totality. 

Triangulating  the  complement  of  L'  in  the  pocket  is 
quite  easy  because  the  boundary  of  L'  is  convex  and  the 
pocket  has  constant  size.  Outward  growth  is  handled 
similarly.  The  only  difference  is  that  a  pocket  might 
now  have  a  large  size  (remember  those  thin  buffer  pock¬ 
ets).  It  is  an  easy  exercise,  however,  to  carry  out  the 
triangulation  in  linear  time. 

The  Triangulation  Algorithm.  Returning  now 
to  our  original  problem,  we  grow  the  layer  of  P  first 
inwards  and  then  outwards.  Then,  we  switch  and  do 
the  same  with  respect  to  Q.  We  iterate  this  process 
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size  of  the  input  polyhedra.  The  size  of  the  iriangulation 
is  asymptotically  optimal  in  the  worst  case. 


back  and  forth  until  the  boundaries  become  of  constant 
size.  At  that  point,  a  simple  finishing  touch  adds  a 
few  unbounded  tetrahedra  to  create  a  full-fledged  union 
complex  that  subdivides  all  of  IR^.  Buffer  pockets  that 
have  not  yet  been  triangulated  can  be  handled  now, 
which  produces  the  desired  simplicial  complex.  The 
total  space  and  time  complexity  is  easily  seen  to  follow 
a  geometric  series  summing  up  to  0{n). 

Theorem  2.1  It  is  possible  to  triangulate  the  region 
between  the  boundaries  of  two  convex  polytopes  in  linear 
time,  using  a  linear  number  of  faces. 


Remark:  The  dovetailing  mechanism  can  be  replaced 
by  variants  such  as  always  thickening  the  bigger  layer. 
However,  it  would  be  a  big  mistake  to  grow  one  layer 
all  the  way,  and  then  work  on  the  other.  Similarly, 
simply  merging  the  two  hierarchies  fails  miserably.  In 
both  cases,  on  the  order  of  n  logn  tetrahedra  might  end 
up  being  produced.  As  one  should  note,  a  key  idea  to 
avoid  this  blowup  has  been  to  avoid  refining  the  interior 
of  the  layers  as  they  grow. 

3  Triangulating  between  a  Polytope 
and  a  Terrain 

Let  P  and  Q  be  respectively  a  convex  polytope  and  a 
polyhedral  terrain.  We  could  also  choose  an  arbitrary 
star-shaped  polyhedron  as  Q,  since  it  can  be  made  into 
a  terrain  by  a  suitable  projective  transformation.  We 
assume  that  P  and  Q  do  not  intersect,  or  if  they  do, 
that  the  input  size  n  accounts  for  all  the  intersection 
features  as  well. 

Theorem  3.1  It  is  possible  to  triangulate  the  region 
between  two  disjoint  polytopes  (one  convex,  the  other 
star-shaped),  using  0{n\ogn)  tetrahedra,  where  n  is  the 


3.1  The  Upper  Bound 

A  natural  idea  would  be  to  compute  the  outer  Dobkin- 
Kirkpatrick  hierarchy  of  P  and  merge  the  terrain  Q 
within  it.  Such  a  naive  approach  runs  into  problems, 
however,  because  terrain  facets  may  intersect  too  many 
features  of  the  hierarchy.  We  get  around  this  problem 
by  pruning  the  resulting  cell  complex  appropriately.  Re¬ 
call  from  the  previous  section  that  the  outer  hierarchy 
of  P  subdivides  IR^  into  constant-complexity  pockets 
and  arbitrarily  thin  buffer  pockets.  Suppose  that  two 
facets  /,  g  of  the  terrain  intersect  the  same  pocket  and: 

1.  Both  facets  intersect  the  same  set  of  pocket  edges. 

2.  No  edge  of  either  facet  intersects  the  pocket. 

3.  The  portion  D  of  the  pocket  between  the  two  facets 
does  not  intersect  the  terrain  (except  at  f,g). 

A  polytope  D  that  satisfies  all  these  conditions  is 
called  a  drum  (Fig.  6):  it  consists  of  two  facets  (which 
are  pieces  of  the  terrain  facets  /,  g  that  cut  all  across 
the  pocket)  and  at  most  a  constant  number  of  lateral 
facets  (pieces  of  pocket  facets):  the  pair  {f,g)  consti¬ 
tutes  the  type  of  the  drum.  We  say  that  two  drums 


are  adjacent  if  they  share  a  common  lateral  facet.  Our 
pruning  strategy  involves  removing  all  the  lateral  facets 
between  adjacent  drums  (along  with  the  incident  edges 
and  vertices,  unless  they  are  needed  by  neighboring  non¬ 
drum  cells).  Note  that  because  two  adjacent  drums  are 
always  of  the  same  type,  pruning  never  brings  in  con¬ 
tact  drums  of  different  type.  The  merged  drums  are 
called  superdrums.  Two  obvious  questions  are:  How  do 
we  turn  the  resulting  subdivision  into  a  triangulation? 
Why  is  the  triangulation  of  size  O(nlogn)? 


Completing  the  Triangulation.  Let  us  examine 
each  pocket  separately.  If  the  pocket  is  free  of  any  ter¬ 
rain  features,  then  triangulating  it  is  routine  (as  usual, 
one  must  be  careful  that  the  triangulations  of  a  facet 
should  agree  on  both  sides,  but  this  is  easy  to  enforce; 


234 


see  [5]  for  a  thorough  treatment  of  a  more  general  and 
difficult  version  of  that  issue.)  If  the  pocket  does  not 
give  rise  to  any  drum,  then  no  pruning  takes  place 
within  and  we  must  triangulate  a  portion  of  the  ter¬ 
rain  inside  a  single  pocket.  If  the  pocket  is  not  of  the 
buffer  type,  then  it  is  of  constant  size.  So,  we  can  simply 
erect  vertical  walls  from  the  edges  and  triangulate  the 
resulting  cylinders  (which  is  a  two-dimensional  prob¬ 
lem),  while  also  adding  enough  internal  Steiner  points 
to  make  the  facet  triangulations  compatible  (see  [1,  8] 
for  a  similar  idea).  If  the  pocket  is  a  buffer,  then  we 
can  further  assume  that  it  contains  no  terrain  vertex, 
so  there  is  not  even  any  need  to  erect  walls. 

The  only  difficult  case  is  when  a  pocket  contains  one 
or  several  drums.  The  portion  of  the  pocket  outside 
its  drums  can  be  handled  as  before.  The  question  is 
how  to  triangulate  a  superdrum  R.  Such  a  polyhedron 
resembles  a  thick  polygon  (Fig.  7).  It  consists  of  two 
facets  F,  G  (arbitrarily  complex  simple  polygons  within 
/  and  g)  connected  together  by  lateral  facets.  As  in  a 
drum,  the  edges  of  the  lateral  facets  provide  a  bijection 
between  the  vertices  of  the  polygons  F  and  G.  Note 
that  a  given  pair  {f,g)  can  give  rise  to  more  than  one 
polyhedron  R. 


Naive  methods  for  triangulating  R  seem  to  run  into 
all  sorts  of  snags,  eis  one  must  be  careful  to  keep  the  size 
small.  Recall  that  the  pockets  contributing  the  drums 
of  R  are  arranged  in  a  hierarchical  fashion.  The  cross- 
section  of  the  outer  hierarchy  of  P  with  the  plane  sup¬ 
porting  f  is  a  nonempty  collection  of  O(logn)  nested 
convex  polygons,  CojCi,  etc.  The  2d-pockets  between 
them  are  convex  polygons  consisting  of  a  base  made  of 
some  number  of  collinear  segments  and  a  constant-size 
polygonal  arc  (Fig.  8).  The  non-dotted  2d-pockets  in 
the  figure  indicate  the  drums.  The  non-dotted  region, 
which  is  precisely  the  facet  F  of  R,  is  surrounded  by  a 
set  of  blocking  2d-pockets  (the  dotted  polygons).  Note 
that  these  blocking  pockets  might  not  necessarily  be  all 
adjacent  to  R. 

We  describe  a  method  for  subdividing  F  and  explain 
how  it  can  be  carried  over  to  R.  Let  Bi  be  the  set 
of  vertices  of  Ci  incident  to  blocking  2d-pockets.  Note 
that  the  Bi  are  not  necessarily  disjoint.  By  analogy, 
the  points  of  Bi  are  called  blocking.  We  define  Di  to 
be  the  convex  hull  of  all  the  points  in  Bo  U  •  •  •  U  Bj-. 


An  easy  way  to  visualize  Di  is  to  think  of  the  blocking 
points  of  Bj  as  nails  in  a  board  and  Gi  as  a  tight  rubber 
band  which  is  to  be  snapped:  the  new  position  of  Ci  is 
precisely  Di. 

Recall  that  we  have  an  identical  structure  on  the 
facet  G,  so  we  can  similarly  define  a  polygon  B-  from 
C'i.  Note,  however,  that  although  Ci  and  C-  are  in  bi¬ 
jection,  the  same  need  not  be  true  of  Di  and  D'^.  We 
now  match  each  Di  and  by  taking  the  convex  hull 
Hi  of  their  union.  Since  the  sequences  Do,Di, . . .  and 
Dq,D'i,.  . .  are  nested,  the  same  is  true  of  the  sequence 
of  convex  hulls  Ho,  Hi,  etc.  Let  Ki  —  {Hi  fl  B)  U  B’,_i, 
so  that  R  =  U"_i(A't  \  Bj_i).  Note  that  hi  \  Bi_i  is 
the  region  between  two  nested  convex  polytopes:  it  is 
free  of  any  blocking  material,  so  we  can  apply  Theorem 
2.1  to  triangulate  it. 

The  Complexity  of  the  Triangulation.  An  edge 
of  the  terrain  can  appear  in  only  O(logn)  pockets. 
It  follows  immediately  that  the  number  of  tetrahedra 
produced  outside  of  the  drums  is  O(nlogn).  For  the 
same  reason  the  combined  complexity  of  all  the  merged 
drums  is  at  most  O(nlogn).  The  only  question  remain¬ 
ing  is  how  many  tetrahedra  are  produced  during  the  tri¬ 
angulation  of  the  superdrum  R.  The  final  triangulation 
step  adds  only  a  constant  factor  to  the  size,  so  it  suffices 
to  estimate  the  total  complexity  of  the  Ho,  Hi,  etc.  The 
size  of  Hi  is  proportional  to  the  total  number  of  edges 
in  Di  and  B-.  But  notice  that  the  edges  in  all  the  Bj’s 
form  a  planar  graph,  therefore  it  suffices  to  count  the 
number  of  vertices,  which  is  lBo|  -1-  |Bil  -t-  ■  ■  Unfor¬ 
tunately,  this  estimate  is  too  crude.  We  must  take  into 
consideration  the  fact  that  not  all  blocking  points  con¬ 
tribute  to  the  complexity.  In  particular,  it  is  clear  that 
only  points  on  blocking  2d-pockets  that  are  adjacent  to 
R  can  become  vertices  of  the  B  fl  B^’s.  Furthermore, 
if  more  than  two  vertices  lie  on  the  base  of  a  blocking 
2d-pocket,  only  the  two  endpoints  can  play  any  role.  It 
thus  appears  that  the  contribution  of  superdrums  to  the 
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overall  size  of  the  triangulation  is  at  most  proportional 
to  the  the  total  number  of  blocking  2c/-pockets  adjacent 
to  superdrums.  Such  a  2d-pocket  is  blocking  for  one  of 
two  reasons: 

1.  It  is  intersected  by  an  edge  of  /  or  y  (Fig.  9). 


2.  It  is  not  intersected  by  any  edge  of  /  or  p  but  it 
is  adjacent  to  what  would  be  a  drum  D,  if  it  were 
not  for  a  terrain  edge  that  intersects  D  (Fig.  10). 


Obviously,  there  are  at  most  O(nlogn)  2c/-pockets  of 
the  first  kind.  The  same  is  true  of  the  second  kind  of 
pockets,  but  to  see  why  is  a  little  more  subtle.  The  crux 
is  that  no  single  terrain  edge  can  create  more  than  two 
2d-pockets  of  type  2  within  the  same  three-dimensional 
pocket  G.  The  reason  for  this  is  that  D  cuts  all  across 
G  and  therefore  no  terrain  edge  can  intersect  both  D 
and  G\D.  We  conclude  that  the  triangulation  is  of  size 
0{n  log  n). 


The  naive  algorithm  for  constructing  the  triangula¬ 
tion  involves  intersecting  each  facet  of  the  terrain  with 
the  entire  outer  hierarchy,  which  certainly  takes  no  more 
than  O(n^)  time.  Within  that  running  time,  it  is  rou¬ 
tine  to  carry  out  the  entire  pruning.  It  is  possible  to 
improve  the  running  time  slightly,  by  using  fairly  com¬ 
plicated  batching  techniques  based  on  multidimensional 
searching.  It  is  not  worthwhile  pursuing  that  line  of  at¬ 
tack.  Whether  a  simple,  quasi-linear  algorithm  exists 
remains  an  open  question. 

3.2  The  Lower  Bound 

We  construct  a  convex  polytope  and  a  terrain  disjoint 
from  it,  such  that  any  convex  decomposition  of  the 
space  between  them  requires  on  the  order  of  n  log  n 
pieces,  where  n  is  the  combined  size  of  the  input. 

Building  the  Polyhedra.  The  polytope  is  a  cylin¬ 
der  with  half  of  a  regular  2m-gon  as  a  base,  where 
m  =  2P .  It  consists  of  (i)  a  horizontal  square  of  unit 
area  facing  up,  (ii)  m  slabs  facing  down,  (iii)  two  con¬ 
gruent  vertical  facets  whose  normals  are  parallel  to  the 
y-Axis  (Fig.  11). 


To  define  the  terrain  it  is  helpful  to  (conceptually) 
enclose  the  cylinder  inside  a  tall  box,  with  the  square  of 
the  cylinder  coinciding  with  the  top  of  the  box.  The  ter¬ 
rain  consists  of  the  plane  supporting  the  bottom  of  the 
box,  plus  m  blade-like  protrusions  abutting  the  slabs  of 
the  cylinder  (Fig.  11  shows  only  one  of  them).  Each 
protrusion  is  associated  with  a  slab  of  the  cylinder:  it 
is  essentially  a  trapezoid  (with  small  but  positive  thick¬ 
ness)  parallel  to  the  x-axis;  its  top  side  is  parallel  to 
a  slab  of  the  cylinder  and  comes  extremely  close  to  it 
without  touching  it.  The  trapezoids  are  in  one-to-one 
correspondence  with  the  m  slabs  of  the  cylinder.  We 
assume  that  each  trapezoid  extends  all  across  the  ver¬ 
tical  box.  This  is  why  it  is  necessary  to  make  the  box 
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tall  enough:  a  simple  calculation  shows  that  the  height 
of  the  box  should  be  on  the  order  of  m.  Clearly,  the 
total  description  size  of  the  input  scene  is  n  =  0(m). 

Our  construction  is  complete  once  we  specify  (i)  the 
permutation  that  defines  the  correspondence  between 
slabs  and  trapezoids,  and  (ii)  the  placement  of  the 
trapezoids  along  the  y-direction.  Because  the  box  is  tall 
enough,  any  permutation  on  m  elements  is  feasible.  We 
choose  the  bit-reversal  permutation  tt  =  (z'o,  •  ■  • , 
where  ik  is  1  plus  the  number  obtained  by  writing  k  in 
binary  over  p  bits  and  reversing  the  bits.  For  example, 
if  p  =  3,  we  obtain 

7r=  (1,5, 3, 7, 2, 6, 4, 8). 

Placing  Guards.  The  slabs  can  be  thought  of  as 
the  leaves  of  a  complete  binary  tree  of  depth  p.  An 
internal  node  v  thus  is  naturally  associated  with  a  se¬ 
quence  of  consecutive  slabs  Si, . .  .,Sj.  Let  W (v)  be 
the  wedge  formed  by  two  planes  tangent  to  the  cylin¬ 
der  along  the  segments  Sj_i  n  s,-  and  sj  fl  Sj+i.  We 
choose  the  planes  so  that  their  normals  bisect  the  an¬ 
gles  between  their  respective  slab  pairs.  If  Si  or  Sj  is  an 
extreme  slab  then  we  take  the  vertical  plane  support¬ 
ing  the  relevant  wall  of  the  box.  Fig.  12  shows  such 
a  wedge  (with  some  liberty  in  the  drawing  to  make  it 
clearer).  Given  a  node  v  (not  a  leaf,  not  a  parent  of  a 
leaf,  not  the  root),  consider  the  bounding  line  of  the 
wedge  W{v).  A  simple  geometric  observation  is  that 
since  the  trapezoids  are  extremely  close  to  the  slabs, 
the  line  L„  intersects  (strictly)  the  trapezoids  in  cor¬ 
respondence  with  the  slabs  Sj,...,Sj,  and  only  those. 
Furthermore,  because  of  the  bit-reversal  permutation, 
as  we  move  along  the  line  we  encounter  trapezoids  from 
the  left  and  right  children  of  v  in  alternation.  The  inter¬ 
section  with  the  trapezoids  are  such  tiny  segments  that 
we  might  as  well  as  think  of  them  as  points  qo, . . . ,  qj -i 
(given  in  ascending  y-order).  The  midpoint  of  a  seg¬ 
ment  of  the  form  qiqi+i,  for  0  <  /  <  j  —  i,  is  called  a 
guard  (the  crosses  in  Fig.  12).  Obviously,  there  is  a  total 
of  about  mlogm  guards  overall.  The  lower  bound  will 
follow  directly  from  the  fact  that  by  a  judicious  place¬ 
ment  of  the  trapezoids  in  the  y-direction  we  can  ensure 
that  no  two  guards  can  see  each  other. 

Instead  of  assigning  each  trapezoid  an  explicit  y- 
coordinate,  which  would  complicate  the  proof,  we  de¬ 
fine  their  placement  in  a  recursive  manner.  We  process 
the  tree  of  trapezoids  bottom-up.  For  a  node  to  be 
processed  means  that  the  entire  block  of  trapezoids  as¬ 
sociated  with  it  has  been  placed,  up  to  translation  (i.e., 
its  degrees  of  freedom  have  been  reduced  to  one).  Since 
we  proceed  bottom-up,  to  process  v  simply  means  de¬ 
ciding  the  relative  positioning  of  the  blocks  associated 
with  the  two  children  of  v.  To  simplify  matters  further, 
we  make  the  placements  of  all  the  blocks  at  a  given  level 
in  the  tree  identical  up  to  translation.  Thus,  it  suffices 


to  specify  the  placement  of  the  nodes  on  the  leftmost 
path  of  the  tree,  vq,  .  ■ Vp. 


Figure  13. 


Let  Ek  =  (c/m)*^,  for  some  small  enough  constant 
c  >  0.  For  any  1  <  fc  <  p,  we  place  the  block  for  the 
right  child  of  Vk  ahead  of  the  block  for  the  left  child  of  Vk 
by  precisely  Ck  (Fig.  13).  Observe  that  it  is  always  easy 
(for  c  small  enough)  to  fit  the  entire  block  of  trapezoids 
within  the  box.  The  lower  bound  follows  from  the  next 
lemma. 

Lemma  3.2  No  two  guards  can  see  each  other. 

Proof:  If  the  two  guards  p,  q  correspond  to  nodes  v,  w 
that  are  not  in  any  ancestral  relationship,  then  the  con¬ 
vex  cylinder  itself  obstructs  their  mutual  visibility.  If 
V  =  w,  then  the  trapezoids  passing  through  the  points 
qQ,qi,  etc.,  hide  the  guards  from  one  another.  So,  the 
only  case  remaining  is  if,  say,  v  is  an  ancestor  of  w.  Let 
j  and  k  {j  <  k)  be  the  heights  of  w  and  v,  respectively. 
If  we  project  trapezoids  and  guards  onto  the  sz-plane, 
we  find  that  not  only  the  projection  of  p  lies  in  that 
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of  every  trapezoid  associated  with  v,  but  actually  the 
projected  point  is  at  least  at  a  distance  proportional  to 
D/m  away  from  any  edge  of  the  trapezoid’s  projection 
(Fig.  14),  where  D  is  the  distance  from  the  bounding 
line  Z/„  to  either  of  the  two  lines  of  contact  between  the 
wedge  W(v)  and  the  cylinder. 


P 

Figure  14. 


Thus,  there  exist  two  disks  of  radius  Q(D/m)  which 
any  infinite  ray  of  visibility  from  p  must  necessarily 
avoid:  these  disks  are  centered  on  and  normal  to  the 
line  it,!  and  they  lie  on  each  side  of  p  at  a  distance 
£jt/2  from  it  (Fig.  15). 


Let  p'  be  the  point  of  nearest  to  p  (Fig.  16).  Be¬ 
cause  of  the  two  obstructing  disks,  p  cannot  see  any 
point  of  itu  away  from  p'  by  more  than  0{\pp'\ekm/ D), 
which  is  0{c\pp'\ej/D).  But  \pp'\  is  at  most  D,  so  by 
setting  c  small  enough,  we  find  that  no  point  on  itu 
away  from  p'  by  more  than  ej/3  can  see  p.  But  q  lies 
at  least  {ej  —  Ek)/ 2  away  from  p' ,  which  gives  a  contra¬ 
diction.  □ 


Figure  16. 


4  Genus  and  Reflex  Angles 

Intuitively,  it  seems  clear  that  a  polyhedron  with  large 
genus  should  have  many  reflex  dihedral  angles  (we  call 
the  corresponding  edges  reflex).  Recall  that  a  poly¬ 
hedron  of  genus  p  is  a  3-manifold  with  piecewise-linear 
boundary  that  is  homeomorphic  to  a  torus  with  g  holes. 
If  r  is  the  number  of  reflex  edges,  we  will  show  that 

g<r+l.  (1) 

The  curvature  at  the  vertex  v  is  defined  to  be 

where  di  is  the  angle  between  two  consecutive  edges 
incident  to  v.  The  Gauss-Bonnet  formula  relates  the 
total  curvature  to  the  genus: 

J2ky  =  2-2g, 

V 

Inequality  (1)  follows  easily  from  the  next  lemma. 

Lemma  4.1  The  number  of  reflex  edges  incident  to  a 
vertex  v  is  at  least  —k^. 

Proof:  Project  the  facets  incident  to  v  onto  a  unit 
sphere  centered  at  v.  This  produces  a  simple,  closed 
curve  made  of  geodesic  pieces.  In  other  words,  it  is  a 
“polygon”  on  the  sphere  made  of  arcs  of  great  circles. 
A  reflex  edge  gives  rise  to  a  reflex  angle  on  the  polygon 
and  vice-versa.  It  is  easy  to  see  that 

L  <  27r(f2 -p  1),  (2) 

where  L  is  the  length  of  the  curve  and  R  is  its  number 
of  reflex  angles.  Indeed,  if  R  is  zero  then  the  inequality 
simply  states  the  classical  fact  that  the  curvature  at  a 
locally  convex  point  is  nonnegative.  If  R  is  nonzero. 
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then  we  can  draw  arcs  of  great  circles  from  each  reflex 
point  along  the  bisector  of  its  reflex  angle,  and  thus 
decompose  the  unit  sphere  into  at  most  +  1  convex 
regions  (i.e.,  regions  where  any  two  points  are  visible 
along  an  arc  of  great  circle  connecting  them).  The  case 
i?  =  0  can  now  be  used  /?  +  1  times  to  establish  (2). 
The  lemma  follows  immediately  from  the  fact  that 

L  =  '^6i  =  27r(l  -  k^). 

i 

□ 

Chazelle  and  Palios  [8]  have  given  an  algorithm  for 
triangulating  a  genus-0  polyhedron  with  n  vertices  and 
r  reflex  angles,  using  0{n  -h  r^)  tetrahedra.  The  al¬ 
gorithm  works  for  arbitrary  genus  g,  but  the  bound 
becomes  0{n  -t-  (r  -|-  g)"^)  tetrahedra.  The  inequality 
fif  <  r  -t- 1  implies  that  the  true  bound  is  still  0{n  +  r^). 
In  other  words,  the  complexity  bound  for  genus  0  ap¬ 
plies  to  polyhedra  of  arbitrary  genus  as  well. 
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Abstract 

This  paper  shows  that  for  any  plane  geometric  graph  Q 
with  n  vertices,  there  exists  a  triangulation  T  conforms 
to  G,  i-e.  each  edge  of  G  is  the  union  of  some  edges  of  T , 
where  T  has  O(n^)  vertices  with  angles  of  its  triangles 
measuring  no  more  than  Additionally,  T  can  be 

computed  in  0(n^  logn)  time.  The  quadratic  bound  on 
the  size  of  its  vertex  set  is  within  a  constant  factor  of 
worst  case  optimal. 

1  Introduction 

In  many  engineering  applications  such  as  finite  ele¬ 
ment  analysis  [Cave74],  surface  interpolation  [Laws77], 
and  shape  reconstruction  [Bois88],  two  and  higher¬ 
dimensional  domains  are  frequently  decomposed  into 
small  and  simple  elements  before  numerical  computa¬ 
tion.  One  particularly  important  class  of  decomposi¬ 
tions  is  the  so-called  simplicial  cell  complex,  usually 
referred  to  as  triangulation,  where  the  domain  is  de¬ 
composed  into  simplices — triangles  in  two  and  tetrahe- 
dra  in  three  dimensions — so  that  the  intersection  of  two 
simplices  is  either  empty  or  a  face  of  both. 

For  a  given  domain  such  as  a  polygonal  region  or, 
more  generally,  a  plane  graph  with  straight  edges,  there 
are  clearly  many  ways  to  decompose  or  triangulate  it 
into  a  triangulation,  with  or  without  the  addition  of 
new  vertices  to  the  domain.  But  accuracy  and  efficiency 
of  an  engineering  computation  impose  optimal  criteria 
such  as  the  triangle  shape  (with  bounds  on  triangle  an¬ 
gles  away  from  0  and  tt)  and  vertex  size  (with  bounds 
on  the  number  of  new  vertices) ,  respectively.  These  cri- 
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teria  of  shape  and  size  are  somewhat  conflicting — good 
triangles  may  be  the  result  of  adding  new  vertices.  Au¬ 
tomatic  generation  of  triangulations  has  been  a  subject 
for  research  since  the  1960s.  Nevertheless,  many  in¬ 
teresting  results,  both  practical  and  theoretical  ones, 
have  been  discovered  in  the  recent  years  too;  see,  for 
example,  [BDE92,  BEG90,  BeEp91,  BMR94,  Dey91, 
MeSo92,  Mitc93a,  Mitc93b,  Mitc94,  Rupp93,  Tan93] 
and  the  survey  [BeEp92]. 

This  paper  considers  triangulating  a  plane  geomet¬ 
ric  graph  (i.e.  plane  graph  with  straight  edges)  using 
triangles  with  no  large  angles.  Such  resulting  triangu¬ 
lations  have  potential  applications  in  the  area  of  finite 
element  and  surface  approximation;  see,  for  example, 
[BaAz76,  BaLi84,  Greg75].  It  shows  that  triangulating 
a  plane  geometric  graph  of  n  vertices  using  angles  no 
larger  than  requires  at  most  O(n^)  new  vertices. 
A  previous  result  of  Mitchell  [Mitc93b]  achieves  angle 
bound  of  In  with  O(n^logn)  new  vertices.  This  pa¬ 
per  thus  provides  significantly  better  bounds  on  triangle 
shape  and  vertex  size.  We  also  note  that  the  quadratic 
bound  on  the  vertex  size  is  within  a  constant  factor  of 
worst  case  optimal.  Our  construction  is  conceptually 
simple  though  its  proof  is  quite  involved.  It  has  some 
flavor  of  Edelsbrunner  and  Tan  [EdTa93]  and  Mitchell 
[Mitc93b].  The  complete  proof  indeed  uses  some  results 
derived  from  the  latter. 

The  paper  is  organized  as  follows.  Section  2  formal¬ 
izes  the  problem.  Section  3  provides  the  outline  of  the 
method  that  proves  the  quadratic  bound,  and  Sections 
4  to  9  discuss  its  details  for  angle  bound  a  =  fw.  Then, 
Section  10  provides  details  on  implementing  the  proof 
and  extending  it  to  a  =  y^TT,  and  Section  11  concludes 
the  paper.  This  extended  abstract  omits  most  proofs; 
the  fufl  details  can  be  found  in  [Tan94]. 

2  Some  Terminology 

We  first  introduce  some  notions,  then  define  the  prob¬ 
lem. 
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Plane  Geometric  Graphs.  Let  S  be  a  set  of  n  points 
or  vertices  in  3?^.  An  edge  is  a  closed  line  segment 
connecting  two  points.  Let  £'  be  a  collection  of  edges 
determined  by  vertices  of  S.  Then  Q  =  (5,  E)  is  a  plane 
geometric  graph  if 

(i)  no  edge  contains  a  vertex  other  than  its  endpoints, 
that  is,  abnS  =  {a,  6}  for  every  edge  ab  £  E,  and 

(ii)  no  two  edges  cross,  that  is,  a6ncd  G  {a,  b}  for  every 
two  edges  ab  ^  cd  in  E. 

One  example  of  a  plane  geometric  graph  is  a  poly¬ 
gon  where  S  and  E  form  a  single  cycle.  The  cycle  is 
the  boundary  of  the  polygon.  It  subdivides  3?^  into  a 
bounded  face,  its  interior,  and  an  unbounded  face,  its 
exterior.  A  polygon  with  four  edges  or  sides  is  a  quadri¬ 
lateral. 

Tri angulations.  A  triangulation  is  a  plane  geometric 
graph  T  =  (5,  E)  so  that  E  is  maximal.  By  maxi- 
mality,  edges  in  E  bound  the  convex  hull  ch{S)  of  S 
(i.e.  the  smallest  convex  set  in  3?^  that  contains  S)  and 
subdivide  its  interior  into  disjoint  faces  bounded  by  tri¬ 
angles.  These  triangles  of  T  are  referred  simply  as  tri¬ 
angles,  and  their  angles  as  angles.  With  reference  to  a 
polygon,  we  talk  about  its  triangulation  as  restricted  to 
only  its  interior. 

Conforming  Triangulations.  A  plane  geometric 
graph  T  =  {S',E')  conforms  to  another  such  graph 
Q  =  (5,  £■)  if  5  C  5'  C  ch{S)  and  each  edge  in  £  is  a 
union  of  edges  in  E' .  Each  vertex  in  S'  —  S,  also  termed 
a  Steiner  vertex,  may  or  may  not  subdivide  an  edge  of 
E.  A  triangulation  T  conforms  to  a  plane  geometric 
graph  Q  is  called  a  conforming  triangulation  of  Q.  The 
problem  studied  here  is  as  follows: 

Given  a  plane  geometric  graph  Q  =  {S,E),  find 
a  conforming  triangulation  T  of  G  with  small 
vertex  set  and  with  angles  measuring  at  most  a 
where  a  <  tt. 

Previous  Results.  Two  results  are  directly  related  to 
the  problem  studied  here.  Bern,  Mitchell  and  Ruppert 
[BMR94]  have  recently  shown  how  to  augment  a  poly¬ 
gon  (with  or  without  holes)  by  linear  number  of  Steiner 
vertices  so  that  it  can  be  triangulated  with  no  obtuse 
angles.  For  a  general  plane  geometric  graph  with  n  ver¬ 
tices,  the  recent  result  of  Mitchell  [Mitc93b]  constructs 
0(n^  logn)  Steiner  vertices  to  achieve  the  angle  bound 
of  Itt.  This  bound  on  the  cardinality  of  Steiner  ver¬ 
tices  is  within  a  logarithmic  factor  from  the  lower  bound 
of  n(n^);  see  [BeEp91,  Mitc93b]  for  discussions  on  the 
lower  bound  construction  (for  any  angle  bounded  away 
from  tt)  due  to  M.  S.  Paterson. 


3  The  Outline  of  Construction 

Given  a  plane  geometric  graph  Q  —  {S,  E)  where  |5|  = 
n,  the  algorithm  of  Edelsbrunner,  Tan  and  Waupotitsch 
[ETW92]  can  augment  it  by  edges  to  a  triangulation 
that  minimizes  its  maximum  angle  over  all  possible  aug¬ 
mentations.  This  triangulation  is  the  so-called  min-max 
angle  triangulation  T  of  Q.  So,  for  a  given  Q,  we  are 
done  if  T  has  angles  measuring  at  most  a,  the  targeted 
angle  bound;  otherwise,  we  proceed  to  refine  this  trian¬ 
gulation  by  adding  Steiner  vertices  until  it  has  no  bad 
angle.  We  say  an  angle  is  bad  if  it  has  measure  larger 
than  a;  otherwise  it  is  good.  A  good  triangle  is  a  triangle 
with  only  good  angles  whereas  a  bad  triangle  has  some 
bad  angle.  A  polygon  (or  polygonal  region)  that  can 
be  triangulated  with  no  bad  triangles  is  said  to  have  a 
good  triangulation.  For  our  discussion  till  Section  10,  a 
is  set  to  |7r. 

Let  us  assume  now  that  T  has  some  bad  angles.  Sup¬ 
pose  Apqr  is  one  with  a  bad  angle  at  q.  Then,  in  refining 
T,  we  need  to  find  a  vertex  t  so  that  qt  subdivides  the 
angle  at  q  into  two  smaller  ones.  Notice  that  such  a  t 
may  not  be  any  of  the  existing  vertices  of  T.  So,  t  may 
possibly  be  a  Steiner  vertex.  Also,  using  only  one  ver¬ 
tex  t  inside  Apqr  does  not  improve  the  situation  since 
Iptr  >  Ipqr.  In  general,  t  on  pr  is  chosen  to  subdivide 
the  angle  at  g  as  well  as  Apqr  into  two  new  triangles. 
Such  a  refinement  results  in  the  other  triangle  sharing 
pr,  say  Aprs,  to  have  a  vertex  on  pr.  So  a  subdivision 
on  Aprs  is  now  necessary  to  get  rid  of  the  “large  angle” 
of  n  at  t.  This  situation  may  continue  to  propagate  to 
other  triangles.  The  main  idea  of  our  construction  is  to 
bound  such  propagations  of  vertices. 

The  proposed  construction  has  six  major  phases.  The 
first  phase  (Section  4)  subdivides  each  triangle  of  T  into 
three  quadrilaterals  by  adding  edges,  termed  spokes, 
perpendicular  to  its  three  sides  from  the  center  of  its  in¬ 
scribing  circle.  Some  of  these  quadrilaterals  are  marked 
as  fences-,  they  together  with  traps  set  up  in  the  sec¬ 
ond  phase  (Section  5)  help  in  controlling  propagations 
of  vertices.  The  third  phase  (Section  6)  propagates  ver¬ 
tices  of  spokes  and  traps.  From  one  vertex,  we  may  get 
a  sequence  of  vertices  where  one  propagates  to  the  next. 
This  sequence  can  be  viewed  as  a  kind  of  path,  called 
propagation  path,  whose  vertices  are  possibly  Steiner 
vertices.  The  fourth  phase  (Section  7)  removes  redun¬ 
dant  subpaths  and  the  fifth  phase  (Section  8)  untan¬ 
gles  crossings  of  paths.  The  remaining  vertices  of  paths 
form  the  set  of  required  Steiner  vertices,  and  the  final 
phase  (Section  9)  triangulates  each  quadrilateral  having 
Steiner  vertices  on  its  boundary  using  edges  of  paths  as 
guides. 

We  note  that  the  construction  above  as  it  is  may  add 
unnecessarily  many  Steiner  vertices.  This  can  neverthe- 
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less  be  avoided  by  modifying  some  steps  as  described  in 
Section  10.  Let  us  next  define  a  few  more  notions  before 
discussing  the  details  of  the  construction. 

Fences  and  Traps.  A  fence  is  a  polygon  that  has  a 
good  triangulation  even  if  its  edges  contain  Steiner  ver¬ 
tices.  A  trap  is  a  polygon  with  a  base  edge  ab  and  dead 
ends  a',  b'  where  a  and  6  are  connected  to  the  respective 
dead  ends  via  paths  Paa>  and  P^b'-  Other  than  a'  and  b' 
may  be  the  same  point,  Paa'  and  Pbb',  also  termed  the 
boundary  paths  of  the  trap,  do  not  intersect.  Addition¬ 
ally,  Paa'  with  vertices  a  =  po,pi,p2, .  ■ .  ,Pk  =  a'  and 
Pbb'  with  vertices  6  =  go,  Qi,  g2,  ,1k  =  b'  are  such  that 

Pi-iPi  and  qi-iQi,  for  1  <  i  <  A:,  lie  in  the  closure  of 
some  quadrilateral.  This  condition  implies  that  a  trap 
encloses  neither  vertices  of  S  nor  endpoints  of  spokes. 
Vertices  of  a  trap  are  not  necessarily  Steiner  vertices  in 
the  final  construction. 

The  construction  of  fences  and  traps  in  Sections  4 
and  5  considers  the  following  issues: 

•  building  linear  number  of  fences  and  traps,  and 

•  ensuring  each  trap  has  at  most  linear  number  of 
vertices. 

Paths.  A  path  with  origin  po  is  a  sequence  of  vertices 
po,pi, . .  .,pi,pi+i, . . .  where  p,-  and  p,+i  are  points  on 
two  different  edges  of  some  quadrilateral.  Two  consec¬ 
utive  vertices  pi,p,+i  of  the  path  define  an  edge  p,p,+i 
or,  more  specifically,  a  directed  edge  p^Vi+i  with  tail  p,- 
and  head  p,+i.  Each  directed  edge  defines  four  angles 
at  its  two  endpoints  with  edges  of  the  quadrilateral  con¬ 
taining  it.  For  our  purposes,  we  use  a  path  to  refer  to 
one  where  the  two  angles  at  the  tail  of  each  directed 
edge  are  good,  and  analogously  an  inverse  path  to  one 
where  the  two  angles  at  the  head  of  each  directed  edge 
are  good.  And,  vertices  of  paths  and  inverse  paths  are 
always  some  points  on  edges  of  T ■ 

The  length  of  a  path  or  an  inverse  path  is  its  number 
of  vertices  minus  one,  and  the  direction  is  the  direction 
of  its  directed  edges.  Two  paths  are  coherent  if  they  tra¬ 
verse  through  the  same  set  of  quadrilateral  edges,  and 
are  parallel  if  they  are  coherent  and  their  two  edges  in 
the  same  quadrilateral  are  parallel.  We  also  use  coher¬ 
ent  and  parallel  to  describe  two  inverse  paths,  or  one 
path  with  an  inverse  path. 

Two  directed  edges  ab  and  cd  (of  some  paths)  in 
quadrilateral  pq'cr'  are  vertex  disjoint  if  {a,  6}n{c,  d]  = 
0,  and  cross  if  they  are  vertex  disjoint  and  abDcd^f:  0. 
They  are  said  to  be  of  the  same  orientation  if  a  and  c 
are  on  the  same  edge  of  pq'cr'  (so  are  b  and  d) ,  otherwise 
opposite  orientation. 

Major  issues  regarding  propagation  paths  in  Section 
6  are; 


•  tracing  linear  number  of  paths,  and 

•  ensuring  each  path  has  length  at  mtKt  linear. 

Pebbles.  Propagation  paths  leave  traces  of  its  vertices 
on  boundaries  of  quadrilaterals.  A  quadrilateral  with  all 
traces  on  its  boundary  taken  as  Steiner  vertices  may  not 
have  a  good  triangulation  for  traces  that  are  too  near  to 
each  other  (unless  the  quadrilateral  is  a  fence).  Thus, 
portions  of  some  propagation  paths  need  to  be  removed 
and  the  remaining  portions  reconnected  to  nearby  ones 
in  the  same  direction.  The  set  of  propagation  paths  re¬ 
mained  plus  the  set  of  centers  of  inscribing  circles  form 
the  required  set  of  Steiner  vertices  (or  pebbles)  for  T. 

Two  major  concerns  of  Sections  7  to  9  are: 

•  ensuring  paths  terminate  on  convex  hull  edges, 
fence  edges,  endpoints  of  spokes  or  dead  ends,  and 

•  triangulating  quadrilaterals  whose  boundaries  con¬ 
tain  Steiner  vertices  using  no  bad  triangles. 

Analyzing  Cost.  Once  all  the  above  issues  are  re¬ 
solved,  the  number  of  Steiner  vertices  used  is  clearly 
quadratic  in  the  number  of  vertices  of  T. 

4  Erecting  Fences 

The  purpose  of  a  fence  is  to  block  or  terminate  propa¬ 
gations  of  vertices.  We  can  consider  convex  hull  edges 
of  T  =  (5,  E)  as  degenerate  fences  since  no  propagation 
of  vertices  out  of  the  convex  hull  is  necessary.  They  are, 
however,  insufficient  to  block  all  propagations;  we  need 
the  following  type  of  fences  plus  traps  discussed  in  the 
next  section.  Throughout  the  discussion,  we  refer  to 
some  specific  measures  of  angle.  As  mentioned,  a  =  |7r 
is  the  targeted  angle  bound.  Let  (f>  =  ^  and  =  f.  No¬ 
tice  that  a  —  Tr  —  j3  and  <p  +  2P  —  tt.  We  use  the  term 
isosceles  triangle  to  mean  a  triangle  with  apex  angle  4> 
and  two  base  angles  /?. 

Consider  a  triangle  pqr  G  T.  Let  c  be  the  center  of  its 
inscribing  circle,  i.e.  c  is  the  point  common  to  the  three 
angle  bisectors.  Let  p',q',r'  be  the  three  pverpendicular 
projections  of  c  onto  respectively  qr,rp,pq-,  see  Figure 
4.1.  Edges  cp',cq’  and  cr'  are  spokes  of  Apqr. 

Step  1.  Erecting  Fences. 

For  each  /\pqr  G  T  with  the  largest  angle  at  say  q, 
add  the  center  c  of  its  inscribing  circle  as  a  Steiner 
vertex.  Also,  add  spokes  cr',  cp'  and  cq'  to  sub¬ 
divide  Apqr  into  three  quadrilaterals  pq'cr' ,  rq'cp' , 
and  qr'cp'.  Then,  mark  qr'cp'  as  a  fence.  And, 
mark  pq'cr'  if  Ip  >  /?,  and  rq'cp'  if  Zr  >  /?. 


The  above  step  marks  quadrilaterals  that  indeed  sat¬ 
isfy  the  definition  of  a  fence  as  long  as  no  Steiner  vertices 
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Figure  4.1:  Trianglep^r  is  subdivided  into  three  quadrilaterals 
pq'cr' ,  rq'cp'  and  qr'cp' .  It  is  simple  to  check  that  Ipr'|  = 
|P9'|.  kp'l  =  kg'l,  kr'l  =  |gp'|,  and  |cr'|  =  |cp'|  =  |cg'|  =  p 
where  p  is  the  radius  of  the  inscribing  circle  of  Apqr. 

will  be  added  to  spokes  of  triangles.  This  follows  imme¬ 
diately  from  the  next  lemma.  Those  quadrilaterals  that 
are  not  marked  as  fences  are  referred  cis  non-fences. 

Lemma  4.1  Suppose  quadrilateral  pq'cr'  is  such  that 
Lq'  =  Ir'  =  ^  and  Zp  >  /?.  Then  it  has  a  good  triangu¬ 
lation  even  if  pr'  and  pq'  contain  some  Steiner  vertices. 
0 

5  Setting  Traps 

To  supplement  fences  that  block  some  propagations,  we 
step  up  traps  whose  dead  ends  can  serve  to  terminate 
the  other  propagations.  To  do  this,  we  first  identify 
portions  of  quadrilateral  edges,  other  than  spokes,  that 
must  be  bases  of  traps.  Clearly,  we  need  only  consider 
those  that  are  not  parts  of  the  boundaries  of  fences. 
We  thus  talk  about  quadrilateral  pq'cr'  in  the  following 
as  one  with  right  angles  at  r'  and  q'  and  Zp  <  /?,  un¬ 
less  stated  otherwise.  Also,  we  assume  pq'cr'  lies  inside 
Apqr  of  T. 

To  plan  for  traps,  we  introduce  a  special  type  of  in¬ 
verse  path,  termed  i-path,  where  all  its  directed  edges 
define  good  angles  of  (3  at  their  heads  and  all  such  good 
angles  are  on  the  same  side  of  the  inverse  path.  We  refer 
to  vertices  of  i-paths  as  i-vertices.  These  vertices  may 
not  be  Steiner  vertices  in  the  final  construction.  In  the 
next  step,  each  i-path  constructed  has  length  bounded 
by  K  where  K  is  one  more  than  four  times  the  number 
of  edges  of  T. 


Step  2.  Planning  Traps. 

Consider  ecich  quadrilateral  (non-fence)  pq'cr'-,  see 
Figure  5.1.  Trace  an  i-path  at  r'  with  its  second 
vertex  r"  on  pq'  so  that  /.r'r"q'  =  /?.  Similarly, 
tr8ice  an  i-path  at  q'  with  its  second  vertex  q"  on  pr' 
so  that  Lq'q"r'  =  0.  Suppose  ri  and  q^  are  vertices 


on  pq  and  pr,  respectively  of  the  two  quadrilaterals 
adjeicent  to  pq'cr'  and  incident  to  p.  If  92  lies  on 
pr" ,  trace  out  two  i-paths  at  92  such  that  the  first 
edges  of  the  i-paths  meet  pr'  at  two  points  that 
form  an  isosceles  triangle  with  apex  at  92-  Simi¬ 
larly,  trace  out  two  i-paths  at  ri  if  ri  lies  on  pq". 
In  all  the  above,  we  terminate  an  i-path  when  it  hits 
either  a  convex  hull  edge,  a  fence  edge,  a  spoke,  or 
when  its  length  has  reached  K. 


Figure  5.1:  For  pq'cr',  one  i-path  each  is  traced  at  r'  and  9', 
and  two  at  92  but  none  at  ri. 

The  above  step  traces  out  a  maximum  of  four  i-paths  at 
each  endpoint  of  a  spoke — one  each  into  the  quadrilat¬ 
erals  (non-fences)  where  the  endpoint  is  a  vertex,  and 
two  into  the  other  quadrilateral  with  the  endpoint  on 
one  of  its  edges.  Notice  that  the  latter  two  i-paths  at 
an  endpoint  may  not  be  coherent,  though  they  started 
off  by  moving  to  the  same  edge.  With  all  the  i-vertices 
in  place,  we  can  identify  (next  two  paragraphs)  some 
portions  of  quadrilateral  edges  as  bases  of  traps  to  be 
constructed. 

Consider  a  directed  edge  pip,-.).i  of  an  i-path  inside 
a  quadrilateral  pq'cr'  with  p,-  €  pr'  and  p,+i  €  pq'. 
The  head  pj+i  subdivides  pq'  into  two  segments:  one 
that  form  an  angle  of  0  with  PiPi+i  and  the  other  of 
IT  —  0.  The  former  is  a  positive  segment  and  the  latter 
(excluding  point  Pi+i)  a  negative  segment.  The  collec¬ 
tion  of  heads  of  directed  edges  of  i-paths  (inside  pq'cr') 
on  pq'  subdivide  points  on  pq'  into  two  types:  a  p>os- 
itive  point  is  one  in  at  least  a  positive  segment,  and 
a  negative  point  otherwise.  Points  p  and  9'  are  (arbi¬ 
trarily)  said  to  be  positive.  By  the  formulation,  pq'  is 
subdivided  into  three  (possibly  degenerate)  portions  by 
markers  say  mi  and  m2  where  mi  6  pm2.  And,  the 
two  portions  pmy  and  m2q'  are  I-types  containing  only 
positive  points,  and  mi  m2  (excluding  points  mi  and 
m2)  is  0-type  containing  only  negative  points. 
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Notice  that  the  above  description  of  an  edge  is  with 
reference  to  a  quadrilateral  incident  to  it.  So,  an  edge  is 
subdivided  into  0-types  and  I-types  on  its  two  sides  due 
to  at  most  three  incident  quadrilaterals.  So  there  are 
at  most  three  portions  of  O- types.  These  are  possible 
bases  of  traps  to  be  constructed.  Let  us  assume  without 
loss  of  generality  that  T  contains  no  horizontal  edges. 


Step  3.  Setting  Traps. 

For  each  vertex  p  E.  S,  consider  the  at  most  two 
quadrilaterals  that  overlap  an  arbitrarily  small  hor¬ 
izontal  line  segment  having  p  eis  an  interior  point. 
Pick  from  each  one  edge  say  pq'  as  a  candidate  edge 
to  construct  traps;  for  each  quadrilateral  incident 
to  pq',  construct  a  trap  with  base  xy  C  pq'  into  the 
quadrilateral  where  xy  is  0-type. 


The  above  construction  of  a  trap  with  base  xy  on  pq' 
in  the  direction  into  piq'iCir'^  is  as  follows.  We  need  the 
notion  of  a  t-path  (the  reverse  of  an  i-path).  A  t-path 
is  a  path  where  all  its  directed  edges  define  good  angles 
of  /?  at  their  tails  and  all  such  good  angles  are  on  the 
same  side  of  the  path. 

Trace  two  (coherent)  t-paths  at  x  and  y  into 
Piq'iCir'i  so  that  those  good  angles  of/?  at  tails 
of  directed  edges  are  on  the  same  side  with  re¬ 
spect  to  each  t-path.  Both  t-paths  continue  until 
they  have  just  reached  say  xi  (possibly  x)  and 
yi  (possibly  y),  on  piqj  of  quadrilateral  piqlcir'i 
where  one  of  the  following  is  true: 

1.  the  t-paths  intersect  at  a  point,  say  zi,  in 
the  interior  of  piqlcirj  when  continue  to  be 
extended; 

2.  piq'iCir'i  is  a  fence  or  is  a  convex  hull 
edge;  or 

3.  piri  is  the  very  first  edge  visited  for  the  fifth 
times  by  the  t-paths  where  Apiriqi  E  T 
contains  piq'iCir'i. 

For  cases  2  and  3,  xi  and  yi  are  the  dead  ends 
and  both  t-paths  are  the  boundary  paths  of  the 
trap.  For  case  1,  consider  z  E  pir'i  where  zzi  is 
perpendicular  to  xiyi.  We  take  z  as  the  dead  end 
if  zzi  does  not  intersect  c;r,';  otherwise,  take  r,' 
as  the  dead  end.  And,  we  complete  the  boundary 
paths  of  the  trap  by  extending  both  t-paths  till 
the  dead  end. 

Though  case  1  modifies  t-paths  slightly  in  constructing 
traps,  it  is  easy  to  check  that  each  trap  does  have  its 
base  connected  to  its  dead  end(s)  by  two  paths;  see  also 
the  next  lemma. 


Lemma  5.1  In  the  construction  of  a  trap  with  base 
xy,  the  two  t-paths  from  x  and  y  are  coherent.  0 

Lemma  5.2  The  number  of  traps  constructed  is  at 
most  6n.  0 

Lemma  5.3  Ecich  trap  has  at  most  K  edges.  0 

6  Propagating  Paths 

The  necessity  to  propagate  vertices  to  compute  quality 
triangulation  is  a  consequence  of  the  following  simple 
lemma,  which  was  introduced  in  Edelsbrunner,  Tan  and 
Waupotitsch  [ETW92]. 

Lemma  6.1  If  xy  is  an  edge  in  a  triangulation  T  of 
a  point  set  S,  then  p(T)  >  max,g5  Ixsy  where  p{T) 
denotes  the  largest  angle  in  T.  0 

In  effect,  propagation  is  to  subdivide  each  long  edge 
into  small  ones  so  to  remove  large  angles  extended  by 
the  edge. 

To  trace  a  path,  we  adapt  a  few  terminology  intro¬ 
duced  in  [Mitc93b].  Consider  a  quadrilateral  p^'cr'  with 
Ip  <  P  and  right  angles  at  q'  and  r'.  Let  s  be  a  point 
on  pq'.  The  cone  at  s  consists  of  all  points  t  of  pq'cr' 
such  that  both  Ltsp  and  Itsq'  are  good  (i.e.  at  most 
a).  Next,  the  maw  of  s  is  the  portion  of  the  cone  at 
s  on  the  boundary  of  pq'cr' ,  except  for  s.  Notice  that 
each  point  on  the  maw  of  s  is  a  candidate  for  extending 
a  path  from  s.  Next,  the  iterative  construction  of  union 
of  cones  at  points  in  maws,  starting  with  s,  results  in 
the  so-called  horn  of  s.  Initially  at  stage  0,  the  horn  is 
the  cone  at  s.  The  horn  at  stage  f  -f  1  is  the  union  of 
cones  at  points  in  the  union  of  maws,  or  simply  maw, 
at  stage  i.  It  is  clear  that  the  two  paths  bounding  the 
horn  of  s  are  actually  t-paths  starting  at  s.  They  are 
referred  as  the  boundary  paths  of  the  horn.  The  center 
path  of  the  horn  is  the  sequence  of  line  segments  where 
each  connects  the  midpoints  of  the  maws  (which  are 
center  path  points)  for  two  consecutive  stages,  starting 
at  s.  (We  can  Eissume  because  of  Lemma  6.2  below  that 
each  maw  contains  only  points  on  one  edge  of  a  quadri¬ 
lateral,  so  the  midpoint  of  the  maw  at  a  stage  is  well 
defined.) 

Given  some  point  t  in  the  maw  of  s  at  stage  j,  it  is 
trivial  to  work  from  t  back  to  s  to  obtain  a  path  of  length 
y  -F  1  with  origin  s  and  last  vertex  t.  This  is  how  the 
propagation  phase  constructs  a  propagation  path  for  a 
vertex  s  by  generating  the  horn  of  s  iteratively  until  it 
intersects  (or  covers)  some  point  t  that  is  an  endpoint 
of  a  spoke,  a  dead  end,  or  a  point  on  a  fence  or  a  convex 
hull  edge.  Besides  these,  t  may  possibly  be  some  point 
common  to  cones  of  the  horn  from  different  stages — 
which  happens  when  the  horn  is  self-intersecting  inside 
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some  quadrilateral.  In  this  case,  the  chosen  t  has  to 
appear  as  the  last  vertex  as  well  as  some  intermediate 
vertex  of  the  constructed  propagation  path.  One  such 
possible  t  is  a  point  in  the  current  maw  of  stage  j  that 
is  also  a  center  path  point  of  stage  i  for  i  <  j  and  the 
horn  defined  from  t  contains  t.  A  propagation  path 
constructed  using  this  condition  has  length  j  +  1  and 
consists  of  a  subpath  from  s  to  t  followed  by  a  loop 
containing  t  as  derived  from  the  horn  of  t. 

Due  to  a  technical  reason  (Section  7),  we  need  to  dis¬ 
tinguish  two  types  of  self-intersecting  horns — one  inter¬ 
sects  while  passing  through  a  quadrilateral  for  the  sec¬ 
ond  time  in  the  same  orientation  to  one  intersects  while 
passing  through  in  opposite  orientation.  It  is  the  latter 
that  we  do  not  use  for  constructing  propagation  path. 
This  is  to  avoid  terminating  a  propagation  path  with  di¬ 
rected  edge  qirj  such  that  the  last  vertex  rj  (other  than 
an  origin  of  some  propagation  path)  being  also  a  tail  of 
other  directed  edge  inside  the  quadrilateral  containing 
qfrj.  (On  the  other  hand,  it  is  possible  that  a  vertex, 
other  than  dead  ends  and  endpoints  of  spokes,  is  passed 
through  by  two  different  propagation  paths  in  opposite 
directions.  This  does  not  create  technical  difficulties  for 
Section  7.  But,  for  convenience,  we  will  interpret  such 
a  vertex  as  two  nearby  vertices  in  Section  7.) 

We  say  a  propagation  path  P  terminates  properly  if 
its  last  vertex  rj  is  an  endpoint  of  a  spoke,  a  dead  end, 
a  point  on  a  fence  or  a  convex  hull  edge,  or  a  head 
common  to  another  directed  edge  of  P  or  other  propa¬ 
gation  path  inside  the  quadrilateral  containing  the  last 
directed  edge  of  P. 


Step  4.  Propagating  Paths. 

Take  endpoints  of  spokes  and  dead  ends  of  traps 
as  the  current  Steiner  vertices  of  T.  These  Steiner 
vertices,  except  for  centers  of  inscribing  circles,  are 
origins  of  propagation  paths  to  be  generated.  For 
an  endpoint  of  a  spoke  on  an  edge  of  T,  we  generate 
a  propagation  path  in  the  direction  out  of  the  tri¬ 
angle  containing  the  spoke.  For  a  dead  end  (which 
is  not  also  an  endpoint  of  a  spoke),  we  generate 
two  distinct  propagation  paths — one  each  into  the 
two  quadrilaterals  containing  the  dead  end  on  their 
boundaries.  All  propagation  paths  are  to  terminate 
properly. 


The  following  two  addenda  to  Step  4  are  to  ensure 
that  the  lengths  of  all  propagation  paths  are  bounded 
by  0(n).  Firstly,  Step  4a  is  performed  to  generate  some 
new  origins  of  propagation  paths  before  the  normal  pro¬ 
cessing  of  Step  4.  This  extra  effort  is  to  avoid  using 
horns  self-intersect  in  opposite  orientations  to  create 
propagation  paths.  Secondly,  Step  4b  handles  an  ex¬ 
ceptional  case  due  to  Step  4  so  to  cut  short  some  prop>- 


agation  paths. 


Step  4a.  Propagating  Paths — Beginning  Case. 

For  each  endpoint  s  of  a  spoke  that  lies  on  some 
edge  of  T,  we  consider  the  two  t-paths  with  origin 
s  in  the  direction  of  a  propagation  path  to  be  con¬ 
structed  for  s  (in  Step  4).  These  t-paths  sandwich 
an  angle  of  —  2/?  at  s  and  each  terminates  once  (1) 
it  visits  an  edge  of  T  twice  or  (2)  it  crosses  some 
spoke  or  arrives  at  a  fence  or  a  convex  hull  edge. 
Only  for  (1),  the  last  vertex  of  the  t-path  becomes 
an  origin  as  if  it  is  a  dead  end  which  Step  4  uses  to 
generate  two  propagation  paths. 

Step  4b.  Propagating  Paths — Exceptional  Case. 

Suppose  in  Step  4,  a  propagation  path  remains  un¬ 
determined  for  s  after  its  horn  has  since  entered  the 
base  xy  of  a  trap,  say  T,  and  traversed  the  same 
number  of  edges  as  T  to  reach  the  edge  containing 
the  two  dead  ends,  and  yd,  of  T.  Let  w  be  any 
point  in  the  intersection  of  the  horn  of  s  with  xy, 
and  let  and  b^j  be  the  two  points  of  the  bound¬ 
ary  paths  of  the  horn  of  w  (in  T)  on  XdPd-  Treat 
Ou,  and  6u,  as  two  additional  dead  ends  just  like  the 
other  dead  ends  identified  earlier  on.  So  each  is  an 
origin  of  two  propagation  paths  to  be  generated  by 
Step  4,  one  into  T  through  Xdyd  and  one  out  of 
T.  And,  we  complete  the  construction  for  s  by  a 
propagation  path  that  terminates  at  a^,  or  6u,. 


A  few  remarks  are  in  order.  First,  the  two  t-paths 
of  s  in  Step  4a  actually  bounds  the  horn  at  s  initially, 
i.e.  they  are  coherent.  But  they  are  not  necessarily 
coherent  in  extending  new  directed  edges  and  one  may 
actually  be  longer  than  the  other  eventually.  Note  that 
we  talk  of  these  t-paths  without  requiring  their  vertices 
be  Steiner  vertices  in  the  construction  so  far.  Second, 
when  both  t-paths  end  in  case  (1),  we  can  actually  cre¬ 
ate  just  one  new  dead  end  instead  of  two  as  stated.  For 
this,  we  then  fix  one  t-path  as  the  initial  part  of  the 
propagation  path  to  be  constructed  for  s,  and  apply 
Step  4  to  the  last  vertex  of  the  t-path  to  complete  the 
remaining  part  of  the  propagation  path  of  s.  This  way 
avoids  creating  two  other  propagation  paths.  Third,  we 
will  see  that  the  above  special  arrangement  of  Step  4b  is 
needed  at  most  once  for  each  trap  (with  two  dead  ends). 
So  we  generate  only  0(n)  propagation  paths  in  total. 
Fourth,  the  horn  of  ic  in  T  is  properly  contained  in  T, 
as  the  horn  of  s  does  not  meet  the  boundary  paths  of 
T\  otherwise,  we  would  have  a  propagation  path  with 
origin  s  that  terminates  properly  at  one  dead  end  of  T. 
So,  Uu,  and  b^,  are  well  defined.  Again,  we  talk  of  the 
horn  of  w  without  requiring  vertices  on  its  boundary 
paths  be  Steiner  vertices. 
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The  next  result  implies  that  once  a  horn  intersects 
some  spokes,  we  can  construct  from  the  horn  a  prop> 
agation  path  that  terminates  properly.  So,  we  avoid 
propagation  paths  that  cross  spokes. 

Lemma  6.2  Let  s  be  a  point  on  pq'  of  quadrilateral 
pq'cr'.  If  the  maw  of  s  contains  points  on  cq'  or  cif, 
then  the  maw  of  s  contains  r'.  yj 

Lemma  6.3  For  a  propagation  path  P  to  return  to  an 
edge  of  T,  P  has  to  visit  some  candidate  edge  (identified 
in  Step  3).  0 

Lemma  6.4  If  the  horn  of  s  self-intersects  in  opposite 
orientation  at  stage  i,  then  there  exists  a  propagation 
path  with  origin  s  that  terminates  properly  and  has  less 
than  f  -I-  3n  vertices.  0 

Lemma  6.5  Each  propagation  path  constructed  W 
Step  4  has  less  than  15n  vertices.  0 

Lemma  6.6  Step  4  computes  less  than  66n  propaga- 
tion  paths.  0 

7  Removing  Complications 

The  result  of  the  propagation  phase  (Step  4)  is  another 
plane  geometric  graph  conforms  to  T.  Each  vertex  s  on 
an  edge,  say  pq' ,  of  a  quadrilateral  (non-fence)  pq'cr' 
is  such  that  there  is  another  vertex  t  on  pr'  where  st 
is  an  edge  of  some  propagation  path  passing  through 
pq'cr' .  So,  either  the  two  angles  at  s  or  that  at  t  in¬ 
side  pq'cr'  are  good.  Because  of  this,  quadrilaterals 
with  vertices  on  their  boundaries  can  be  triangulated 
with  no  bad  triangles  except  when  there  are  vertices 
due  to  nearby  directed  edges  of  the  same  orientation. 
As  such,  we  use  the  following  merging  step  to  remove 
this  “complication” .  Vertices  of  pq'cr'  along  pq'  are 
labeled  as  p  =  qa,qi,q2,  ■  •  ■  ^qi  =  <}'  and  along  pr'  as 
P  =  ^oTiT2,---Tm  =  r'.  A  neighboring  pair  of  di¬ 
rected  edges  are  two  directed  edges  of  the  same  ori¬ 
entation  such  that  the  segment  defined  by  their  tails 
contains  no  other  tail. 

Step  5.  Merging  Paths. 

For  each  quadrilateral  pq'cr' ,  repeat  the  following 
so  long  there  exists  a  neighboring  pair  qirii  and 
qfrji,  with  i  <  j  and  good  Ariiqjp  and  /.riiqjq' 
or,  analogously,  a  neighboring  pair  r,g,-  and  rjqy 
with  i  <  j  and  good  Iqiirjp  and  Iqiirjr' .  We  just 
describe  the  former  case.  Let  P,  be  the  shortest 
subpath  of  some  propagation  path  P  starting  with 
qjrji  until  a  vertex  of  P  that  is  a  head  common 
to  two  (or  more)  directed  edges,  or  a  vertex  on 
a  fence  or  convex  hull  edge,  or  an  origin  of  other 


propagation  paths.  We  remove  edges  and  vertices 
of  P,  except  possibly  for  the  laist  vertex  of  P,  that  is 
shared  by  other  propagation  paths,  and  terminate 
P  by  new  edge  9jr,'. 

Vertices  of  spokes  and  of  propagation  paths  remained 
after  Step  5  form  the  final  set  of  Steiner  vertices  for  T- 

Lemma  7.1  All  propagation  paths  still  terminate 
properly  with  each  merging  of  paths.  0 

Lemma  7.2  Once  Step  5  has  completed,  any  two  di¬ 
rected  edges  of  the  same  orientation  inside  a  quadrilat¬ 
eral  do  not  cross.  0 

Lemma  7.3  Once  Step  5  has  completed,  any  two  di¬ 
rected  edges  that  are  vertex  disjoint  and  of  the  same 
orientation  inside  a  quadrilateral  define  a  region  that 
has  a  good  triangulation.  0 

8  Untangling  Crossings 

For  this  and  the  next  section,  we  view  each  directed 
edge  as  an  individual  with  two  good  angles  at  its  tail, 
without  associating  it  to  a  particular  propagation  path. 
Though  Step  5  (Section  7)  also  removes  some  crossings, 
directed  edges  of  opposite  orientations  may  still  cross 
inside  a  quadrilateral.  With  reference  to  a  quadrilat¬ 
eral  (non-fence)  pq'cr' ,  the  next  step  is  to  get  rid  of 
all  crossings  by  modifying  edges  while  maintaining  the 
following  invariants: 

1-1.  each  edge  is  assigned  an  orientation  with  its  tail  defin¬ 
ing  two  good  angles; 

1-2.  each  Steiner  vertex  is  an  endpoint  of  some  directed 
edge; 

1-3.  no  Steiner  vertex  is  a  tail  of  more  than  one  directed 
edge; 

1-4.  directed  edges  of  the  same  orientation  do  not  cross; 

1-5.  for  two  (non-crossing)  vertex  disjoint  directed  edges 
g,r,/  and  qjrji  with  i  <  j,  the  region  defined  by  these 
four  vertices  can  be  subdivided  into  two  good  trian¬ 
gles  using  riiqj.  Analogous  assertion  applies  to  two 
vertex  disjoint  directed  edges  riTqi  and  rjTqj, 

These  invariants  are  true  initially.  In  particular,  1-1,  1-2 
and  1-3  are  trivially  true,  and  1-4  and  1-5  are  true  due  to 
Lemma  7.2  and  Lemma  7.3  respectively.  In  Step  6,  the 
next  available  index  from  i  refers  to  the  smallest  index, 
starting  at  i  and  in  increasing  order,  whose  correspond¬ 
ing  vertex  can  be  used  as  the  head  of  a  new  directed 
edge  while  maintaining  1-4.  We  say  an  ordered  pair 
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(a,  6)  is  lexicographically  smaller  than  another  ordered 
pair  (o',  6')  if  a  <  o',  or  a  =  a'  and  b  <  b'. 

Step  6.  Removing  Crossings 

Consider  each  quadrilateral  pq'cr'  in  turn  to  re¬ 
move  crossings.  The  following  two  cases  are  re¬ 
peated  until  all  crossings  are  removed.  Each  iter¬ 
ation  involves  edges  g,r,j  crosses  such  that 

{i  +  j,  ii  +  ji)  is  lexicographically  the  smallest  or¬ 
dered  pair.  See  Figure  8.1. 

Case  A.  9,rij  crosses  rjqj^. 

We  first  add  qiVj  with  the  appropriate  orientation 
and  remove  qirt^  and  rjqj^.  Next,  there  are  two 
symmetrical  cases  depending  on  whether  de¬ 

fines  two  good  angles  at  qj^  or  at  r,  j .  Let  us  discuss 
only  the  former.  If  qj^  is  no  longer  a  vertex  of  any 
directed  edge,  we  add  qj^r,^  where  is  the  next 
available  index  from  ii-  Next,  if  r,-,  is  no  longer  a 
vertex  of  any  directed  edge,  we  add  ri~qj^  where  j2 
is  the  next  available  index  from  ji. 

Case  B.  r.l*?,  crosses  qj^Vj. 

We  add  qirj  if  qi  is  not  a  tail  of  any  directed  edge, 
else  add  rfqi  if  rj  is  not  a  tail  of  any  directed  edge. 
Next,  if  r,i  {qj^ ,  respectively)  is  a  vertex  of  other  di¬ 
rected  edges,  we  are  done  by  removing  r.fg,-  {qj^rj, 
respectively).  Otherwise,  move  all  directed  edges 
(inclusive  of  ri^qi)  crossing  qj^rj  with  heads  at  qt 
to  qji  where  f  =  i  +  1. 


Figure  8.1:  Since  directed  edges  of  the  same  orientation  do 
not  cross  (invariant  1-4),  the  underlying  directed  edges  based 
on  the  two  chosen  line  segments  q.r,,  and  must  have 

opposite  orientations.  Any  directed  edge  e  with  endpoint  on  r,/ 
for  j  <  i'  <  ii  must  cross  gir,,  by  the  choice  of  (i  + J,«i  -bji)- 
As  such  and  because  of  1-4,  the  other  endpoint  of  e  must  be 
for  j2  >  ji  with  strict  inequality  for  Case  B,  and  e  must 
have  the  same  orientation  as  the  directed  edge  based  on  rygj,. 
A  similar  statement  applies  to  directed  edges  with  endpoints 
on  qji  for  i  <  j'  <  ji  ■ 

It  is  clear  that  each  iteration  of  either  Case  A  or  Case 
B  may  introduce  new  crossings.  Still,  the  process  can 
be  shown  to  terminate. 


Lemma  8.1  When  applied  to  a  quadrilateral  pq'cr' , 
Step  6  terminates  by  removing  all  crossings.  0 

Lemma  8.2  Invariants  1-1  to  1-5  are  maintained  after 
each  iteration  of  Case  A.  0 

Lemma  8.3  Invariants  1-1  to  1-5  are  maintained  after 
each  iteration  of  Case  B.  0 

9  Triangulating  Quadrilaterals 

We  now  have  for  each  quadrilateral  pq'cr'  some  Steiner 
vertices  on  its  boundary  and  directed  edges  in  its  in¬ 
terior.  Each  Steiner  vertex  lies  on  some  directed  edge, 
and  each  directed  edge  defines  two  good  angles  at  its 
tail.  Since  directed  edges  do  not  cross,  they  subdivide 
pq'cr'  into  regions  of  three,  four,  or  five  sides.  We  la¬ 
bel  these  regions  in  their  natural  sequence  starting  with 
that  containing  c  as  iii  then  its  adjacent  one  cis  R2  and 
so  on  until  that  containing  p  as  Rm‘-  All  except  pos¬ 
sibly  for  R\  have  straightforward  good  triangulations. 
We  must  however  first  handle  Ri  by  merging  nearby 
regions  to  Ri  until  we  have  obtained  a  region  with  a 
good  triangulation.  As  such,  some  directed  edges  may 
not  be  in  the  final  triangulation — they  serve  as  guides 
in  our  construction. 

Step  7.  Triangulating  Quadrilaterals. 

Quadrilaterals  with  no  Steiner  vertices  or  are  fences 
can  easily  be  triangulated  with  no  bad  triangles;  re¬ 
fer  to  Lemma  4.1.  We  next  consider  each  non-fence 
pq'cr'.  Label  its  regions  subdivided  by  directed 
edges  from  that  containing  c  to  that  containing  p  as 
Ri ,  R2, . . . ,  Rm'  ■  Starting  with  7?  =  0  and  ^  =  0, 
we  then  repeat  incrementing  k  and  including  Rk 
into  Tl  until  we  can  produce  a  good  triangulation 
for  Tl  =  Then  for  the  remaining  regions 

Rj^j  =  fc  -f-  \,k  2  . . .  ,m'  of  three  or  four  sides, 

we  triangulate  each  in  a  straightforward  way. 


Lemma  9.1  Each  region  Ti/:  for  fc  =  2,  3, .  . . ,  m'  has  a 
good  triangulation.  0 

Lemma  9.2  There  exists  a.  k  <  m'  such  that 
Tl  =  uf-jJ?,-  has  a  good  triangulation.  0 

Till  here,  we  have  the  following  theorem: 

Theorem  9.3  Triangulating  a  plane  geometric  graph 
Q  =  (5,  E)  of  |S|  =  n  vertices  and  IFJI  =  0{n)  edges 
using  angles  no  larger  than  requires  O(n^)  Steiner 

vertices.  0 


10  Implementing  Construction 

In  this  section,  we  describe  an  efficient  algorithm  to 
implement  the  above  constructive  proof.  Also,  we  dis¬ 
cuss  ways  to  avoid  some  redundant  Steiner  vertices, 
and  extend  the  construction  to  a  better  angle  bound 
of  a  =  Let  us  assume  that  each  point  coordinate 
can  be  stored  in  a  constant  amount  of  storage  and  that 
basic  geometric  operations,  such  as  projecting  a  point 
onto  a  line  can  be  carried  out  in  a  constant  amount  of 
time. 

Theorem  10.1  Triangulating  a  plane  geometric  graph 
Q  =  {S,E)  of  |5|  =  n  vertices  and  IT"!  =  0(n)  edges 
using  angles  no  larger  than  requires  O(n^)  storage 
and  0(n^  log  n)  time.  PI 

Reducing  Steiner  Vertices.  The  above  construction 
generates  substantially  many  Steiner  vertices,  though 
it  achieves  the  worst  case  optimal  bound.  As  implied 
by  Lemmas  6.5  and  6.6,  the  constant  factor  in  the 
quadratic  bound  is  relatively  large.  Fortunately,  we 
see  in  the  following  a  modified  construction  that  avoids 
many  redundant  propagation  paths  and  Steiner  ver¬ 
tices.  Let  us  start  with  Step  1.  Instead  of  subdivid¬ 
ing  each  triangle  pqr  of  T  with  largest  angle  at  q  into 
three  quadrilaterals,  we  subdivide  it  into  two  or  three 
triangles  as  follows  depending  on  Iq. 

Case  a.  Iq  >  a. 

Subdivide  Apqr  into  Apqq'  and  Arqq'  where  q'  G  pr 
and  Ipqq'  =  Lrqq' .  We  treat  qq'  as  a  spoke. 

Case  b.  2P  <  Iq  <  a. 

Same  as  Case  a.  Additionally,  mark  Apqq'  as  a  fence  if 
Lp  >  /?,  and  similarly  mark  Arqq'  if  Zr  >  /?. 

Case  c.  Lq  <  2(3. 

Subdivide  Apqr  into  Apr'q',  Aqp'r',  Arq'p'  and 
Ap'q'r'  where  p'  G  qr,  q'  G  pr,  r'  G  pq  are  so  that 
pq'  =  pr',qp'  =  qr',rq'  =  rp' .  Also,  treat  edges  of 
Ap'q'r'  as  spokes,  and  mark  each  of  the  other  three  tri¬ 
angles  as  a  fence  if  its  angle  opposite  its  edge  incident 
to  Ap'q'r'  is  no  less  than  (3. 

With  this  subdivision,  we  then  plan  and  set  up  traps 
in  Step  2  and  Step  3,  and  create  other  dead  ends  in  Step 
4a.  As  before,  we  next  generate  propagation  paths  in 
Step  4  (and  Step  4b)  but  only  for  selected  vertices.  Each 
q'  obtained  in  Case  a  is  an  origin  of  a  propagation  path 
out  of  the  triangle  containing  the  spoke  with  q'  as  an 
endpoint.  We  next  consider  whether  more  propagation 
paths  are  to  be  created  for  s  where  s  is  either  q'  in 
Case  b,  p'  or  q'  or  r'  in  Case  c,  or  a  dead  end.  When  s 
terminates  some  propagation  path  P'  generated  so  far, 
we  generate  a  propagation  path  P"  for  it  following  P' 
(so  we  can  imagine  P'  to  include  also  the  part  of  P"); 


otherwise,  it  is  ignored  in  Step  4  and  is  not  considered 
a  Steiner  vertex  in  later  steps. 

Step  5  and  Step  6  are  the  same  as  before.  The  for¬ 
mer  actually  further  removes  away  some  unnecessary 
Steiner  vertices.  Note  that  a  dead  end  can  be  treated 
as  an  ordinary  vertex  subject  to  removal  by  Step  5  if 
the  dead  end  originates  one  instead  of  two  propagation 
paths  (in  opposite  directions)  at  the  completion  of  Step 
4.  A  similar  consideration  applies  to  an  endpoint  of 
a  spoke.  Finally,  Step  7  works  quite  the  same  as  be¬ 
fore:  for  a  non-fence,  we  just  use  the  straightforward 
method  in  Lemma  9.1;  otherwise,  we  have  a  fence  with 
(3  <  Ip  <  2(3  that  can  be  solved  with  Lemma  4.1.  How¬ 
ever,  there  is  one  problem — let  us  pick  Case  c  for  the 
discussion,  and  note  that  a  similar  situation  occurs  for 
Case  b.  Vertices  p',q',r'  used  in  Step  1  to  subdivide 
Apqr  into  three  triangles  are  not  necessarily  Steiner 
vertices  as  mentioned.  So  the  resulting  triangulation 
obtained  from  Lemma  9.1  or  Lemma  4.1  may  have  tri¬ 
angles  with  vertices  that  are  not  actually  created.  These 
triangles  have  to  be  discarded.  As  a  result,  we  obtain 
an  untriangulated  region  R  supported  by  edges  of  Apqr 
and  directed  edges  inside  Apqr.  It  can  be  proved  that 
R  resulted  from  Case  b  or  Case  c  always  has  a  good  tri¬ 
angulation  and  one  can  thus  be  constructed  in  constant 
time  since  R  has  at  most  six  edges.  In  summary,  the  im¬ 
provement  still  runs  with  O(n^)  storage  and  0(n^  log  n) 
time. 

Reducing  Angle  Bound.  The  angle  bound  of  |7r  can 
slightly  be  improved  to  a  =  -^Tr  (i.e.  132°),  but  with  a 
larger  constant  in  the  quadratic  bound  on  vertex  size. 
In  this  case,  isosceles  triangles  has  base  angles  (3  =  -Air 
(i.e.  48°)  and  apex  angle  c/i  =  (i.e.  84°).  All  re¬ 

sults  developed  starting  from  Section  4  are  valid  except 
for  Lemma  4.1  and  Lemma  9.2.  We  need  not  worry 
about  Lemma  9.2  since  it  is  no  longer  relevant  to  the 
above  improved  construction.  As  for  Lemma  4.1,  prob¬ 
lem  arises  when  0  <  Zp  <  2(3.  One  way  to  resolve  this  is 
to  perform  the  following  before  Step  1:  subdivide  each 
triangle  pqr  of  T  with  largest  angle  less  than  2(3  by  a 
new  vertex  s  inside  pqr  into  triangles  with  angles  at  s 
equal  to  |7r.  The  existence  of  such  a  subdivision  can 
easily  be  verified. 

11  Concluding  Remarks 

Given  a  plane  geometric  graph,  this  paper  shows  that 
a  conforming  triangulation  with  a  quadratic  bound  on 
its  vertex  set  and  the  |7r  bound  on  its  angles  can  be 
computed  in  slightly  more  than  quadratic  time.  Addi¬ 
tionally,  the  result  applies  to  the  better  angle  bound  of 
yItt,  but  with  a  larger  constant  factor  in  the  quadratic 
bound  on  the  vertex  set.  This  paper  improves  the  re- 
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suit  of  Mitchell  [Mitc93b]  by  reducing  the  asymptotic 
bound  on  the  vertex  set  by  a  logarithmic  factor  and  the 
angle  measure  by  at  least  ^ .  Further  reducing  the  angle 
bound  closer  to  ^  is  still  very  interesting. 

The  main  idea  of  the  paper  is  on  the  control  of  the 
lengths  of  propagation  paths  using  fences  and  traps. 
This  is  a  similar  strategy  used  in  Edelsbrunner  and  Tan 
[EdTa93],  where  the  corresponding  concept  is  termed 
walls.  The  current  paper  is,  however,  much  more  com¬ 
plex;  major  difficulties  include  the  construction  of  the 
linear  number  of  traps  and  the  modification  of  prop¬ 
agation  paths,  whereas  these  issues  do  not  appear  in 
[EdTa93]  due  to  the  nice  feature  of  constructing  walls 
inside  disks  that  are  mutually  disjoint  and  using  only 
non-crossing  propagation  paths. 
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Abstract 

Motivated  by  practical  VLSI  routing  applications, 
we  study  the  maximum  vertex  degree  of  a  minimum 
spanning  tree  (MST).  We  prove  that  under  the  Lp 
norm,  the  maximum,  vertex  degree  over  all  MSTs  is 
equal  to  the  Hadwiger  number  of  the  corresponding 
unit  ball;  we  show  an  even  tighter  bound  for  MSTs 
where  the  maximum  degree  is  minimized.  We  give  the 
best-known  bounds  for  the  maximum  MST  degree  for 
arbitrary  Lp  metrics  in  all  dimensions,  with  a  focus 
on  the  rectilinear  metric  in  two  and  three  dimensions. 
We  show  that  for  any  finite  set  of  points  in  the  Man¬ 
hattan  plane  there  exists  an  MST  with  maximum  de¬ 
gree  of  at  most  4,  and  for  three-dimensional  Manhat¬ 
tan  space  the  maximum  possible  degree  of  a  minimum- 
degree  MST  is  either  13  or  14. 

1  Introduction 

Minimum  spanning  tree  (MST)  construction  is  a 
classic  optimization  problem  for  which  several  efficient 
algorithms  are  known  [8]  [14]  [17].  Solutions  of  many 
other  problems  hinge  on  the  construction  of  an  MST 
as  an  intermediary  step  [3],  with  the  time  complex¬ 
ity  sometimes  depending  exponentially  on  the  MST’s 
maximum  vertex  degree,  as  in  the  algorithm  of  Geor- 
gakopoulos  and  Papadimitriou  [7].  Applications  that 
would  benefit  from  MSTs  with  low  maximum  vertex 
degree  include  Steiner  tree  approximation  [13]  as  well 
as  VLSI  global  routing  [1]  [16].  With  this  in  mind,  we 
seek  efficient  methods  to  construct  an  MST  with  low 
maximum  vertex  degree: 
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The  Bounded  Degree  Minimum  Spanning  Tree 
(BDMST)  problem:  Given  a  complete  weighted 
graph  and  an  integer  D  >  2,  find  a  minimum-cost 
spanning  tree  with  maximum  vertex  degree  <  D. 

Finding  a  BDMST  of  maximum  degree  D  —  2  is 
equivalent  to  solving  the  traveling  salesman  problem, 
which  is  known  to  be  NP-hard  [6].  Papadimitriou  and 
Vazirani  have  shown  that  that  given  a  planar  pointset, 
the  problem  of  finding  a  Euclidean  BDMST  with  D  = 
3  is  also  NP-hard  [15].  On  the  other  hand,  they  also 
note  that  a  BDMST  with  £)  =  5  in  the  Euclidean 
plane  is  actually  a  Euclidean  MST  and  can  therefore 
be  found  in  polynomial  time.  The  complexity  of  the 
BDMST  problem  when  D  =  4  remains  open.  Ho, 
Vijayan,  and  Wong  [11],  proved  that  an  MST  in  the 
rectilinear  plane  must  have  maximum  degree  of  D  <  8, 
and  state  (without  proof)  that  the  maximum  degree 
bound  may  be  improved  to  £>  <  6.  The  results  of 
Guibas  and  Stolfi  [9]  also  imply  the  D  <  8  bound. 

In  this  paper  we  settle  the  bounded-degree  MST 
problem  in  the  rectilinear  plane:  we  show  that  given  a 
planar  pointset,  the  rectilinear  BDMST  problem  with 
D  <  3  is  NP-hard,  but  that  the  rectilinear  D  >  4  case 
is  solvable  in  polynomial  time.  In  particular,  we  prove 
that  in  the  rectilinear  plane  there  always  exists  an 
MST  with  maximum  degree  of  £)  <  4,  which  is  tight. 
We  also  analyze  the  maximum  MST  degree  in  three 
dimensions  under  the  rectilinear  metric,  which  arises 
in  three-dimensional  VLSI  applications  [10],  where  we 
show  a  lower  bound  of  13  and  an  upper  bound  of  14 
on  the  maximum  MST  degree  (using  the  previously 
known  techniques,  the  best  obtainable  lower  and  up¬ 
per  bounds  for  three  dimensions  under  the  rectilinear 
metric  were  6  and  26,  respectively). 

More  generally,  for  arbitrary  dimension  and  Lp 
metrics  we  investigate  :  (i)  the  maximum  possible  ver¬ 
tex  degree  of  an  MST,  and  (ii)  the  maximum  degree 
of  MSTs  in  which  the  maximum  degree  is  minimized. 
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We  relate  the  maximum  MST  degree  under  the  Lp 
metric  to  the  so-called  Hadwiger  number  of  the  cor¬ 
responding  Lp  unit  ball.  The  relation  between  MST 
degree  and  the  packing  of  convex  sets  has  not  been 
elucidated  before,  though  Day  and  Edelsbrunner  [5] 
studied  the  related  “attractive  power”  of  a  point.  For 
general  dimension  we  give  exponential  lower  bounds 
on  the  Hadwiger  number  and  on  the  maximum  MST 
degree. 

Our  results  have  several  practical  applications.  For 
example,  our  efficient  algorithm  to  compute  an  MST 
with  low  maximum  degree  enables  an  efficient  im¬ 
plementation  of  the  Iterated  1-Steiner  algorithm  of 
Kahng  and  Robins  for  VLSI  routing  [13],  which  af¬ 
fords  a  particularly  effective  approximation  to  a  rec¬ 
tilinear  Steiner  minimal  tree  (within  0.5%  of  optimal 
for  typical  input  pointsets  [18]),  and  where  the  central 
time-consuming  loop  depends  on  the  maximum  MST 
degree  [1]  [2].  Our  results  also  have  implications  to 
newly  emerging  three-dimensional  VLSI  technologies 
[10],  as  well  as  for  any  other  algorithms  that  use  an 
MST  as  the  basis  for  some  other  construction. 

The  remainder  of  the  paper  is  as  follows.  Section  2 
establishes  the  terminology  and  relates  the  maximum 
MST  degree  to  the  Hadwiger  numbers  (the  central 
result  is  Theorem  4).  Section  3  studies  the  Lj  Had¬ 
wiger  numbers  and  the  maximum  MST  degree  for  the 
Li  metric.  Section  4  considers  the  Hadwiger  numbers 
and  the  maximum  MST  degree  for  arbitrary  Lp  met¬ 
rics.  We  conclude  in  Section  5  with  open  problems. 

2  Hadwiger  and  MST  Numbers 

A  collection  of  open  convex  sets  forms  a  packing  if 
no  two  sets  intersect;  two  sets  that  share  a  boundary 
point  in  the  packing  are  said  to  be  neighbors.  The 
Hadwiger  number  H{B)  of  an  open  convex  set  B  is 
the  maximum  number  of  neighbors  of  B  considered 
over  all  packings  of  translates  of  B  (a  translate  of  B 
is  a  congruent  copy  of  B  moved  to  another  location  in 
space  while  keeping  B’s  original  orientation). 

There  is  a  vast  literature  on  Hadwiger  numbers 
(e.g.,  Croft  et  al.  [4],  Fejes  Toth  [19]).  Most  re¬ 
sults  address  the  plane,  but  there  are  several  results 
for  higher  dimensions.  In  particular,  if  5  is  a  convex 
set  in  (i.e.,  fe-dimensional  space),  the  Hadwiger 
number  H (S)  satisfies: 

k^  +  k<  H{S)  <3*^-1 


It  is  known  that  the  regular  A;-simplex  realizes  the 
lower  bound  and  the  fc-hypercube  realizes  the  up¬ 
per  bound  [19].  Tighter  bounds  are  known  for  k- 
hyperspheres;  Wyner  [20]  showed  that  the  Hadwiger 
number  for  spheres  is  at  least  and  Ka- 

batjansky  and  Levenstein  [12]  showed  that  it  is  at 
most  Only  four  Hadwiger  numbers  for 

spheres  are  known  exactly;  these  are  the  numbers  in 
dimensions  2,  3,  8,  and  24,  and  they  are  6,  12,  240,  and 
196560,  respectively.  The  three-dimensional  Hadwiger 
number  has  a  history  dating  back  to  Newton  and  was 
only  determined  much  later. 

For  two  points  x  —  (xi,  X2,  ■  ■  ■ ,  Xk)  and  y  = 
(2/1, 2/2,  •  ■  •  >  2/ib)  fc-dimensional  space  9^^,  the  Lp  dis- 

tance  between  x  and  y  is  \\xy\\p  =  \xi  —  yi]^- 

For  convenience,  if  the  subscript  p  is  omitted,  the  rec¬ 
tilinear  metric  is  assumed  (i.e.,  p  =  1).  Let  B{k,p,x) 
denote  an  open  Lp  unit  ball  centered  at  a  point  x  in 
(we  use  to  denote  fc-dimensional  space  where 
distances  are  computed  under  the  Lp  metric).  When 
X  is  the  origin,  B{k,p,  x)  is  denoted  as  B{k,p),  and  we 
use  H{k,p)  to  denote  the  Hadwiger  number  o{B{k,p). 

Let  I{k,p)  be  the  maximum  number  of  points  that 
can  be  placed  on  the  boundary  of  an  Lp  ^-dimensional 
unit  ball  so  that  each  pair  of  points  is  at  least  a  unit 
apart.  With  respect  to  a  finite  set  of  points  P  C  ^p 
and  an  MST  T  for  P,  let  p{k,p,P,T)  be  the  maxi¬ 
mum  vertex  degree  of  T  (also  referred  to  as  simply 
the  degree  of  T).  Let  p{k,p,P)  =  maxp{k,p,  P,T), 
where  r  is  the  set  of  all  MSTs  for  P ,  and  let 

p(k,p)=  max  p{k,p,P).  In  other  words,  v{k,p,P) 
PCSt’' 

denotes  the  maximum  degree  of  any  MST  over  P,  and 
i'{k,p)  denotes  the  highest  possible  degree  in  any  MST 
over  any  finite  pointset  in  ^;-dimensional  space  under 
the  Lp  metric. 

We  will  need  the  following  result,  which  is  easily 
established. 

Lemma  1  ||3;y||p  >  2  if  and  only  if  B{k,p,x)  C 
B{k,p,y)  =  0.  Further,  ||xy||p  =  2  if  and  only  if 
B{k,p,x)  and  B{k,p,y)  are  tangent. 

Lemma  2  The  Hadwiger  number  H{k,p)  is  equal  to 
I[k,p),  the  maximum  number  of  points  that  can  be 
placed  on  the  boundary  of  the  unit  ball  B{k,p)  so  that 
all  interpoint  distances  are  at  least  one  unit  long. 

Proof:  We  first  show  that  I{k,p)  <  H{k,p).  Suppose 
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that  I{k,p)  points  are  on  the  boundary  of  B{k,p)  and 
are  each  at  least  a  unit  distance  apart.  Consider  plac¬ 
ing  an  Lp  ball  of  radius  ^  around  each  point,  includ¬ 
ing  one  at  the  origin.  By  Lemma  1,  these  balls  form 
a  packing,  and  all  the  balls  touch  the  ball  containing 
the  origin.  Therefore,  I{k,p)  <  H{k,p). 

Next  we  show  that  H{k,p)  <  I{k,p).  Consider  a 
packing  of  Lp  unit  balls,  and  choose  one  to  be  centered 
at  the  origin.  Consider  the  edges  connecting  the  origin 
to  each  center  of  the  neighboring  balls.  The  intersec¬ 
tions  of  these  edges  with  the  boundary  of  B(k,p)  yield 
a  pointset  where  each  pair  of  points  is  separated  by  at 
least  a  unit  distance,  otherwise  we  would  not  have  a 
packing  of  Lp  unit  balls.  ^ 

Lemma  3  The  Hadwiger  number  H(k,p)  is  equal  to 
v{k,p),  the  maximum  MST  degree  over  any  finite 
pointset  in  IStp. 

Proof:  We  show  that  i'{k,p)  <  I(k,p),  the  maximum 
number  of  points  that  can  be  placed  on  the  bound¬ 
ary  of  an  Lp  fc-dimensional  unit  ball  so  that  each 
pair  of  points  is  at  least  a  unit  apart.  Let  a;  be  a 
point,  and  let  yi, ...  ,yi,  be  points  adjacent  to  x  in  an 
MST,  indexed  in  order  of  increasing  distance  from  x. 
Note  that  (j/;,  pj)  must  be  a  longest  edge  in  the  trian¬ 
gle  {x,yi,yj),  and  that  the  MST  restricted  to  x  and 
yi,...,yu  is  a  star  centered  at  x.  Draw  a  small  Lp 
ball  around  x,  without  loss  of  generality  a  unit  ball, 
and  consider  the  intersection  of  the  segments  {x,yi), 

I  <  i  <  v,  with  B{k,p,x).  Let  these  intersection 
points  be  called  yi,  1  <  i  <  and  suppose  there  is 
a  pair  pi  and  pj ,  i  <  j,  with  \\yiyj\\p  <  1.  Note  that 
{in, Pi)  is  the  shortest  edge  on  the  triangle  {x,yi,yj), 
and  (x,  pj)  is  a  longest  edge.  Now  consider  similar  tri¬ 
angle  {x,  Pi,  z),  where  z  is  a  point  on  the  edge  {x,  yj). 
The  path  from  yj  to  z  to  j/j  is  shorter  than  the  length 
of  {x,yj),  so  (yijPj)  is  not  a  longest  edge  in  triangle 
{x,  Pi,  Pj),  a.  contradiction.  We  note  this  bound  is  tight 
for  pointsets  that  realize  I{k,p).  U 

We  next  consider  a  slightly  different  number 
0{k,p),  which  is  closely  related  to  v{k,p).  Recall 
that  v{k ,p,  P,T)  denotes  the  maximum  vertex  de¬ 
gree  of  the  tree  T.  Let  v{k,p,P)  =  mmi'{k,p,P,T), 

T^t 

where  r  is  the  set  of  all  MSTs  for  P,  and  let 
i){k,p)=  max  i'(k,p,  P).  In  other  words,  i'{k,p,P) 

denotes  the  degree  of  an  MST  over  P  that  has  the 
smallest  possible  degree,  and  k{k,p)  denotes  the  max¬ 


imum  of  the  degrees  of  all  minimum-degree  MSTs  over 
all  finite  pointsets  in  9?^  (recall  that  we  use  the  phrase 
“the  degree  of  T”  to  refer  to  the  maximum  vertex  de¬ 
gree  of  the  tree  T). 

Although  it  is  clear  that  v{k,p)  <  p{k,p),  it  is 
not  clear  when  this  inequality  is  strict.  In  order  to 
count  i>(k,  p),  we  define  the  MST  number  M {k,  p)  sim¬ 
ilarly  to  the  Hadwiger  number  H{k,p),  except  that  the 
translates  of  the  Lp  unit  ball  B{k,p)  are  slightly  mag- 
nihed.  The  underlying  packing  consists  of  B{k,p)  as 
well  as  multiple  translated  copies  of  (1  -f  e)  •  B{k,p), 
and  M{k,p)  is  the  supremum  over  all  £  >  0  of  the 
maximum  number  of  neighbors  of  B{k,p)  over  all  such 
packings.  Clearly  M{k,p)  <  H{k,p).  We  also  define 
I{k,p)  as  the  number  of  points  that  can  be  placed  on 
the  boundary  of  B{k,p)  so  that  each  pair  is  strictly 
greater  than  one  unit  apart. 

Consider  a  set  S  =  {xi, . . . ,  Xn}  of  7i  points  in 

For  convenience,  let  N  -  2(2),  and  let  Si,.  .  .,Sn  be 
the  set  of  sums  of  the  interdistances,  one  sum  for  each 
distinct  subset.  Let 

0  <  ^  =  min  {|5,:  -  5,- 1  :  \Si  -5,1  >  0} 

A  perturbation  of  a  pointset  5  is  a  bijection  from  5 
to  a  second  set  S'  =  {sj, . .  . ,  (for  convenience, 
assume  that  the  indices  indicate  the  bijection);  we  say 
that  a  perturbation  of  5  is  small  if 

<  I 

!  =  1 

In  discussing  spanning  trees  of  5  and  the  perturbed 
set  5',  we  assume  that  the  vertex  set  [n]  consists  of  the 
integers  1  to  n,  where  vertex  i  corresponds  to  point 
or  point  x'^.  The  topology  of  a  tree  over  vertex  set  [n] 
is  the  set  of  edges  in  the  tree. 

Theorem  4  Let  S  be  a  set  of  points  in  and  let  S' 
be  a  set  of  points  corresponding  to  a  small  perturbation 
of  S.  Then  the  topology  of  an  MST  for  S'  is  also  a 
topology  for  an  MST  for  S. 

Proof:  Let  T  be  an  MST  for  5,  and  let  T'  be  an 
MST  for  S' .  Let  1{T)  and  1{T')  be  the  lengths  of  T 
andT',  respectively.  Then  l{T)-~^  <  1{T')  <  /(r)-p|. 
Consider  the  tree  T  with  the  same  topology  as  T'  but 
with  respect  to  pointset  5.  Now,  1{T')  -  |  <  1{T)  < 
1{T')  -f-  |,  so  1{T)  —  6  <  1{T)  <  1{T)  +  6.  Since  6  is 
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the  minimum  positive  difference  between  the  sums  of 
any  two  distinct  subsets  of  interdistances,  1{T)  =  1{T), 
and  T  is  also  an  MST  for  S.  □ 

Lemma  5  The  maximum  of  ihe  degrees  of  all 
minimum- degree  MSTs  over  all  finite  poinisets  in  Iftp, 
is  equal  to  the  maximum  number  of  slightly  magnified 
unit  balls  that  can  be  packed  around  a  given  unit  ball; 
that  is,  P{k,p)  =  M{k,p). 

Proof:  Let  5  be  a  set  of  points,  and  let  6  be  defined 
as  above.  Place  a  small  Lp  ball  about  each  point  x  G 
S  (without  loss  of  generality  a  unit  ball,  though  the 
intent  is  that  x  is  the  only  point  inside  B(k,p,x)), 
and  connect  each  distinct  pair  {x,y),  x,y  G  5,  with 
a  line  segment.  Consider  the  intersections  of  these 
edges  with  B{k,p,x).  Perform  a  small  perturbation 
on  S  so  that  no  two  intersection  points  have  length  1. 
Repeat  the  argument  used  in  the  proof  of  Lemmas  2 
and  3,  this  time  with  balls  of  the  form  (l+e)  ■  B{k,p), 
for  small  e  >  0.  The  first  part  shows  that  I{k,p)  = 
M{k,p),  and  the  second  that  i'{k,p)  <  I{k,p).  This 
bound  is  tight  for  pointsets  that  realize  I{k,p).  q 

3  The  Maximum  Li  MST  Degree 

Hadwiger  numbers  are  notoriously  difficult  to  com¬ 
pute.  In  this  section,  we  determine  the  2  and  3  dimen¬ 
sional  Hadwiger  numbers  for  the  diamond  and  octa¬ 
hedron,  respectively.  The  first  of  these  numbers  is 
well-known,  but  we  could  not  find  any  reference  for 
the  octahedron.  We  also  study  the  MST  numbers, 
obtaining  a  value  of  4  in  two  dimensions,  and  bounds 
in  higher  dimensions.  For  notational  convenience,  we 
define: 

The  Uniqueness  Property:  Given  a  point  p,  a 
region  R  has  the  uniqueness  property  with  respect 
to  p  if  for  every  pair  of  points  u,w  G  R,  ||ww|  < 

max(||u;pl|,up||). 

A  partition  of  space  into  a  finite  set  of  disjoint  regions 
is  said  to  have  the  uniqueness  property  if  each  of  its 
regions  has  the  uniqueness  property. 

Define  the  diagonal  partition  of  the  plane  as  the 
partition  induced  by  the  two  lines  oriented  at  45  and 
-45  degrees  through  a  point  p  (i.e.,  partitioning  3?^  — 
{p}  into  8  disjoint  regions,  four  “wedges”  of  dimension 
two  (labeled  R1  through  RA  in  Figure  1(a)),  and  four 
“half-lines”  of  dimension  one  (labeled  i?5  through  R8 
in  Figure  1(a)).  It  is  easy  to  show  that  the  diagonal 


partition  has  the  uniqueness  property,  which  in  turn 
implies  an  upper  bound  of  8  on  the  maximum  MST 
degree  in  the  Manhattan  plane. 

Lemma  6  Given  a  point  p  in  ihe  Manhattan  plane, 
each  region  of  the  diagonal  partition  with  respect  to  p 
has  the  uniqueness  property. 

Proof:  We  need  to  show  that  for  any  two  distinct 
points  u  and  w  that  lie  in  the  same  region,  Hmul  < 
max(||«)p||,  up||).  This  is  obvious  for  the  one  dimen¬ 
sional  regions.  Consider  m,  in  in  one  of  the  two  dimen¬ 
sional  regions.  Assume  without  loss  of  generality  that 
||up||  <  ||inp||  (otherwise  swap  the  roles  of  u  and  v  in 
this  proof).  Consider  the  diamond  D  with  left  corner 
at  p  and  center  at  c,  such  that  u  is  on  the  bound¬ 
ary  of  D  (see  Figure  1(c)).  Let  a  ray  starting  at  p  and 
passing  through  w  intersect  D  at  6.  By  the  triangle  in¬ 
equality,  ||inu||  <  ||in6|| -f  ||6u||  <  ||in61| -|- ||6c|| -)- ||cu||  = 
||w6||  +  ||6c||  -h  ||cp||  =  ||u^p||-  Thus  every  one  of  the 
8  regions  of  the  diagonal  partition  has  the  uniqueness 
property.  □ 


Figure  1:  A  partition  of  K  —  {p}  into  8  regions  such 
that  for  any  two  points  u  and  w  that  lie  in  the  same 
region,  either  ||uui||  <  ||inp||  or  else  ||miu||  <  i|«p||- 
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Corollary  7  The  maximum  possible  degree  of  an 
MST  over  any  finite  poiniset  in  the  rectilinear  plane 
is  equal  to  8;  that  is,  iy(2, 1)  =  8. 

Proof:  By  Lemma  6  the  diagonal  partition  has  the 
uniqueness  property,  which  implies  that  1(2,1)  <  8. 
The  pointset  {(0,  0),  (±1,0),  (0,±1),  (±^,  ±|)}  shows 
that  this  bound  is  tight.  Lemmas  2  and  3  imply  that 
the  maximum  MST  degree  in  the  Manhattan  plane  is 
equal  to  8.  □ 

We  now  show  an  analogous  result  for  three- 
dimensional  Manhattan  space.  Consider  a  cuboctahe- 
dra/ partition  of  into  14  disjoint  regions  correspond¬ 
ing  to  the  faces  of  a  truncated  cube  (Figure  2(a-b)), 
i.e.,  6  congruent  pyramids  with  square  cross-section 
(Figure  2(c))  and  8  congruent  pyramids  with  trian¬ 
gular  cross-section  (Figure  2(d)).  Most  of  the  region 
boundaries  are  included  into  the  triangular  pyramid 
regions  as  shown  in  Figure  3(c),  with  the  remaining 
boundaries  forming  4  new  regions  (Figure  3(d)),  to  a 
total  of  18  regions.  Following  the  same  strategy  as 
in  the  two-dimensional  case,  we  first  show  that  the 
uniqueness  property  holds. 

Lemma  8  Given  a  point  p  in  three-dimensional  Man¬ 
hattan  space,  each  region  of  the  cuboctahedral  partition 
with  respect  to  p  has  the  uniqueness  property. 

Proof:  We  need  to  show  that  for  any  two  points  u  and 
w  that  lie  in  the  same  region  of  the  cuboctahedral 
partition,  <  max(||rr;p||,  «p||).  This  is  obvious 

for  the  2  dimensional  regions  that  are  the  boundaries 
between  the  pyramids  (by  an  argument  analogous  to 
that  of  Lemma  6). 

Consider  one  of  the  square  pyramids  R  with  respect 
to  p  (Figure  2(c)),  and  let  u,w  £  R.  Assume  without 
loss  of  generality  that  ||up||  <  ||rcp||  (otherwise  swap 
the  roles  of  u  and  re).  Consider  the  locus  of  points 
D  C  R  that  are  distance  ||up||  from  p.  (Figure  2(e)); 
D  is  the  upper  half  of  the  boundary  of  an  octahedron. 
Let  c  be  the  center  of  the  octahedron  determined  by 
D,  so  that  c  is  equidistant  from  all  points  of  D.  Let 
b  be  the  intersection  of  the  surface  of  D  with  a  ray 
starting  from  p  and  passing  through  w.  By  the  triangle 
inequality,  ||tfw||  <  UrcfeU  ±  ||&m||  <  H'U'^ll  +  ||^’c||  ± 
||cu||  =  ||u;6||  ±  ||6c||  ±  ||c;?||  =  UmpH  (recall  that  the 
square  pyramid  regions  do  not  contain  their  boundary 
points).  Thus,  w  is  closer  to  u  than  it  is  to  p,  which 
implies  that  the  region  R  has  the  uniqueness  property. 


Figure  2:  A  truncated  cube  (a-b)  induces  a  3- 
dimensional  cuboctahedral  partition  of  space  into  14 
regions:  6  square  pyramids  (c),  and  8  triangular  pyra¬ 
mids  (d).  Using  the  triangle  inequality,  each  region 
may  be  shown  to  contain  at  most  one  candidate  point 
for  connection  with  the  origin  in  an  MST  (e-f). 


To  show  the  uniqueness  property  for  the  triangular 
pyramids,  consider  one  of  the  triangular  pyramids  R 
with  respect  to  p  (Figure  2(d)),  and  let  u,w  £  R. 
Assume  without  loss  of  generality  that  ||wp||  <  ||u:p|| 
(otherwise  swap  the  roles  of  u  and  w).  Consider  the 
locus  of  points  D  in  R  that  are  at  distance  ||up||  fromp 
(Figure  2(f)).  Let  b  be  the  intersection  of  D  with  a  ray 
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Figure  3:  Assigning  the  boundary  points  to  the  var¬ 
ious  region:  a  hollow  (solid)  dot  indicates  an  open 
(closed)  interval.  A  topological  mapping  of  the  cuboc- 
tahedron  (a)  is  shown  in  (b).  The  various  boundaries 
are  included  into  the  triangular  pyramid  regions  (c), 
while  the  square  pyramids  do  not  contain  any  bound¬ 
ary  points.  The  remaining  boundaries  form  4  new  re¬ 
gions  (d),  bringing  the  total  to  18. 


starting  from  p  and  passing  through  w.  By  the  triangle 
inequality,  ||u;u||  <  ||w;6||-|-||6u||  <  ||u;6||  +  ||6p||  =  ||wp|| 
(recall  that  each  triangular  region  is  missing  one  of  its 
boundary  faces,  as  shown  in  Figure  3(a-b)).  Thus,  w 
is  closer  to  u  than  it  is  to  p,  which  implies  that  the 
region  R  has  the  uniquenes  property. 

Thus  every  one  of  the  18  regions  of  the  cuboctahedral 
partition  has  the  uniqueness  property.  q 

Corollary  9  The  maximum  possible  degree  of  an 
MST  over  any  finite  pointset  in  three-dimensional 
Manhattan  space  is  equal  to  18/  that  is,  j/(3, 1)  =  18. 

Proof:  By  Lemma  8  the  cuboctahedral  partition  has 
the  uniqueness  property,  which  implies  that  1(3, 1)  < 
18.  The  pointset  {(0,0,0),  (±1,0,0),  (0,±1,0), 

(0,0,±1),  (±|,±i0),  (0,±|,±i),  (±i,0,±i)} 


shows  that  this  bound  is  tight.  Lemmas  2  and  3  imply 
that  the  maximum  MST  degree  in  three-dimensional 
Manhattan  space  is  18.  q 

We  can  further  refine  the  maximum  MST  degree 
bound  of  Corollary  7  by  applying  the  perturbative  ar¬ 
gument  of  Theorem  4. 

Theorem  10  The  maximum  degree  of  a  minimum- 
degree  MST  over  any  finite  pointset  in  the  rectilinear 
plane  is  4;  that  is,  j>(2, 1)  =  4. 

Proof:  The  pointset  {(0,0),  (±1,0),  (0,  ±1)}  estab¬ 
lishes  a  lower  bound  of  4.  To  get  the  upper  bound  of 
4,  consider  1(2, 1),  the  number  of  points  that  can  be 
placed  on  boundary  of  a  unit  ball  (i.e.,  a  diamond)  in 

such  that  each  pair  of  points  is  strictly  greater  than 
one  unit  apart.  Consider  the  diagonal  partition,  as  in 
the  proof  of  Lemma  6;  at  most  one  point  can  be  in 
the  closure  of  each  of  the  four  2-dimensional  regions, 
proving  the  result.  ^ 

Theorem  10  has  an  interesting  consequence  on  the 
complexity  of  the  BDMST  problem  restricted  to  the 
rectilinear  plane: 

Instance:  A  planar  pointset  P  =  {xi,. .  .,Xn},  and 
integers  D  and  C. 

Question:  Is  there  a  rectilinear  spanning  tree  with 
maximum  degree  <  D  and  cost  <  Cl 

li  D  =  A,  the  question  can  be  decided  in  polyno¬ 
mial  time.  In  fact,  our  methods  establish  that  such 
a  bounded-diameter  MST  can  be  computed  as  effi¬ 
ciently  as  an  ordinary  MST.  On  the  other  hand,  if 
D  =  2,  the  problem  is  essentially  a  rectilinear  travel¬ 
ing  salesman  problem  (a  wandering  salesman  problem, 
since  the  tour  is  a  path  rather  than  a  circuit),  and  it 
is  therefore  NP-complete.  It  turns  out  that  the  D  =  3 
question  is  also  NP-complete,  using  a  proof  identical 
to  the  one  appearing  in  Papadimitriou  and  Vazirani 
[15]  (their  result  is  for  the  corresponding  Euclidean 
problem,  but  since  they  restrict  their  construction  to 
a  special  type  of  grid  graph,  their  proof  holds  in  the 
rectilinear  metric  as  well).  We  can  summarize  the  rec¬ 
tilinear  and  Euclidean  results  in  the  following  table: 


Complexity  of  the  B 

DMST  Problem 

D 

Euclidean 

rectilinear 

2 

N  P-complete 

NP-complete 

3 

NP-complete 

NP-complete 

4 

open 

polynomial 

>  5 

polynomial 

polynomial 
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Next,  we  refine  the  maximum  MST  degree  bound 
of  Corollary  9. 

Theorem  11  The  maximum  degree  of  a  minimum- 
degree  TIST  over  any  finite  poinisei  in  i^-dimensionai 
rectilinear  space  is  either  13  or  14;  that  is,  13  < 
z>(3,l)  <  14. 

Proof:  For  the  lower  bound,  the  following  pointset 
shows  that  the  maximum  degree  of  an  MT  is  I{k,p)  > 
13:  {(0,0,0),  (±100,0,0),  (0, ±100,0),  (0,0, ±100), 
(47,  _4, 49),  (-6,  -49, 45),  (-49,  8, 43),  (-4, 47,  -49), 
(_49,_6,-45),  (8, -49, -43).  (49,49,2)},  since  for 
this  pointset,  all  non-origin  points  are  strictly  closer 
to  the  origin  than  they  are  to  each  other,  forcing  the 
MST  to  be  unique  with  a  star  topology. 

To  obtain  the  upper  bound  of  14  on  the  maximum 
MST  degree,  consider  the  cubeoctahedral  partition. 
Any  two  points  lying  in  the  closure  of  one  of  the  14 
main  regions  of  the  cuboctahedral  partition  must  be 
within  distance  1  of  each  other.  □ 

We  note  that  there  is  an  elementary  means  to  set¬ 
tle  the  13  vs.  14  question  raised  in  Theorem  11. 
Suppose  we  are  trying  to  decide  whether  14  points 
can  be  placed  on  the  surface  of  a  unit  octahedron 
so  that  each  pair  is  greater  than  a  unit  distance 
apart.  The  relationship  between  point  (xi,yi,Zi)  and 
point  (xj,yj,Zj)  can  be  phrased  by  the  inequality 
\xi  -  Xj\-{-\yi  -  yj\  +  \zi  -  I  >  1  subject  to  the  con¬ 
straints  \xi\  ±  \yi\  ±  \zi\  =  1  and  \xj\  ±  \yj\  ±  \zj\  =  1. 
The  absolute  values  can  be  removed  if  the  relative  or¬ 
der  between  Xi  and  x^,  etc.,  is  known.  We  can  there¬ 
fore  consider  all  permutations  of  the  coordinates  of 
the  14  points  and  produce  the  corresponding  inequal¬ 
ities.  If  the  inequalities  corresponding  to  a  particular 
permutation  are  simultaneously  satisfied,  7'(3, 1)  =  14, 
otherwise  7'(3, 1)  =  13.  Feasibility  can  be  settled  by 
determining  whether  a  particular  polytope  contains  a 
nonempty  relative  interior  (this  approach  is  easily  ex¬ 
tended  to  any  dimension  k).  We  have  not  settled  the 
13  vs.  14  question,  and  the  above  procedure  seems 
impractical  due  to  the  large  number  of  resulting  in¬ 
equalities. 

We  now  address  the  Hadwiger  and  MST  Numbers 
for  the  fc-crosspolytope. 

Theorem  12  The  maximum  degree  of  a  minimum- 
degree  MST  over  any  finite  pointset  in  SRj  is  at  least 

Q(20.0312i)  .  p(fc,  1)  =  Q(20.0312J:) 


Proof:  Consider  the  family  F{j)  of  points 

(±4, . . . ,  ±i,  0, . . . ,  0),  where  j  is  an  integer  between 
1  and  k.  (Here,  the  j  nonzero  terms  can  be  arbitrarily 
interspersed  in  the  vector.) 

Each  member  of  F[j)  is  distance  1  from  the  origin; 
the  distance  between  x  £  F{j)  and  y  E  F{j)  depends 
on  the  positions  and  signs  of  the  nonzero  terms.  Given 
X  =  {xi,X2,  ...,Xk)E  F{j),  let  X  be  the  binary  vector 
containing  a  1  in  bit  i  if  Xi  yf  0  and  a  0  in  bit  i  if 
Xi  =  0.  If  the  Hamming  distance  between  x  and  y  is 
at  least  j,  then  |lxp||  >  1.  (The  Hamming  distance 
between  two  bit  vectors  is  the  number  of  bit  positions 
in  which  they  differ.)  We  want  to  find  a  large  set  of 
X  that  are  mutually  Hamming  distance  greater  than  j 
apart. 

Consider  the  set  V{f)  of  bit  vectors  containing  ex¬ 
actly  j  I’s;  \V{j)\  =  Form  a  graph  G{t)  = 

{V{t),  E{t))  for  which  {x,  y)  E  E{t)  if  and  only  if  the 
Hamming  distance  between  x  and  y  is  at  most  j.  Note 

Lj72J 

that  G{t)  is  regular  with  degree  d{j)  - 

To  see  this,  we  determine  the  number  of  edges  adja¬ 
cent  to  X  =  (1,...,1,0,...,0),  where  there  are  j  I’s. 
The  set  of  vectors  in  V{j)  adjacent  to  x  can  be  par¬ 
titioned  into  vectors  that  contain  i  O’s  in  the  first  j 
positions,  1  <  i  <  {§}  ±  1.  For  a  given  i,  there  are  ({) 
ways  to  choose  the  0  positions  and  positions  to 

place  the  displaced  I’s  in  the  last  k  —  j  positions. 

Here  is  our  strategy  to  find  a  subset  ofV{  j)  of  large 
cardinality  that  are  mutually  far  apart:  choose  a  ver¬ 
tex,  delete  its  neighbors,  and  continue.  The  number 
of  vertices  chosen  must  exceed  \V{j)\/d{j).  Suppose 
that  cj  =  l&\/ej  =  k.  Then 

(?)  '  0 
i(i)  -(i  +  i)(l)(‘:')  + 

Here,  c'  and  c"  are  constants;  the  approximation  to 
({)  is  from  Graham  et  al.  [8].  Substituting  for  c  gives 
the  result.  □ 

4  The  Maximum  MST  Degree 

In  this  section,  we  provide  bounds  on  M{k,p)  for 
general  Lp  metrics. 
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Theorem  13  The  maximum  degree  of  a  minimum- 
degree  MST  over  any  finite  pointset  in  is  at  least 
iy{k,p)  =  where  a  =  ^  and  E{x)  = 

x\g^  +  {l-x)\gj^. 


Proof:  Consider  the  vertices  of  the  ^-hypercube 
(±1, . . . ,  ±1).  Each  of  these  points  is  from  the 
origin.  On  the  other  hand,  if  points  x  and  y  differ 
from  each  other  in  j  positions,  they  are  distance  2j 
from  each  other.  If  2j^^P  >  k^/P ,  then  x  and  y  are 
further  from  each  other  than  they  are  from  the  origin. 

We  need  to  find  the  largest  cardinality  set  of  points 
on  the  ;l:-hypercube  that  differ  in  at  least  J  =  ^  po¬ 
sitions.  To  do  this,  construct  a  graph  G  whose  vertex 
set  is  the  set  of  binary  strings  of  length  k,  and  for 
which  there  is  an  edge  between  string  a  and  string  b 
if  and  only  if  the  Hamming  distance  between  a  and  b 
is  at  most  J .  Proceed  in  the  same  manner  as  in  the 

proof  of  Theorem  12,  except  that  d{J)  =  The 

i=i 

number  of  vertices  chosen  must  exceed  2^ /d{J).  Now, 


2tB(a)-^lgi+0(l) 


for  0  <  a  <  i  (see  Graham  et  al.  [8],  Chapter  9, 
Problem  42).  Note  that  a  =  ^,  so 


2k 

W) 


□ 

Theorem  13  shows  that  for  any  fixed  p  >  I,  i>{k,p) 
grows  exponentially  in  the  dimension.  Note  that  this 
bound  is  less  than  the  bound  obtained  by  Wyner  [20] 
(for  H{k,2),  it  is  0(2°-^®®'')  since  E{\)  =  |lg3-  1  » 
0.189),  but  it  is  sufficient  for  our  purposes. 

It  is  well  known  that  H{k,p)  <  3^’  —  1  (e.g.,  [19]).  In 
2-dimensional  space,  the  Hadwiger  number  is  largest 
for  Li  and  L^o,  the  only  planar  Lp  metrics  with  Had¬ 
wiger  number  8.  For  all  other  Lp  metrics,  the  Had¬ 
wiger  number  is  6.  On  the  other  hand,  the  planar  MST 
number  is  smallest  for  Li  and  Loo,  having  a  value  of 
4,  and  it  is  easily  seen  to  be  5  for  all  other  Lp  metrics. 

These  observations  raise  an  interesting  question; 
how  does  the  MST  number  behave  as  a  function  of  p? 
Note  that  the  maximum  Hadwiger  number  is  achieved 
by  parallelotopes.  Next  we  derive  the  MST  number 
for  the  Loo  unit  ball  (i.e.,  the  fc-hypercube),  and  show 


that  the  MST  number  is  not  maximized  in  the  Loo 
metric  in  any  dimension. 

Theorem  14  The  maximum  degree  of  a  minimum- 
degree  MST  over  any  finite  pointset  in  is  2^ ;  that 
is,  i'{k,  oo)  =  2*^. 

Proof:  We  first  show  the  result  for  p  =  oo;  note  that 
the  Lp  unit  ball  is  a  fc-hypercube.  The  upper  bound 
is  established  by  considering  i{k,p),  the  number  of 
points  that  can  be  placed  on  boundary  of  a  unit  ball 
in  such  that  each  pair  of  points  is  strictly  greater 
than  one  unit  apart.  Note  that  at  most  one  point  can 
be  placed  in  each  fc-ant  (the  fc-dimensional  analogue 
of  “quadrant”).  The  lower  bound  is  established  by 
considering  the  set  of  2’^  vertices  of  a  Ar-hypercube.  □ 

Theorem  15  For  each  k,  there  is  a  p  such  that  the 
maximum  degree  of  a  minimum- degree  MST  over  any 
finite  pointset  in  Sftp  space  exceeds  2^;  that  is,  for  all 
k  there  exists  a  p  such  that  i/{k,p)  >  2^ . 

Proof:  Consider  the  pointset  (-1,  ±1, . . . ,  ±1), 

(e,±^, . .  .,±^),  and  (^*^''^,0, . .  .,0),  where  (e?  +  {k  - 
1)6?)  =  k.  It  is  possible  to  choose  e,  8,  and  p  so  that 
each  pair  of  points  is  on  the  surface  of  a  Lp  ball  of  ra¬ 
dius  k^^P,  and  all  interdistances  are  greater  than  k'^^P. 

□ 

5  Conclusion 

Motivated  by  practical  VLSI  applications,  we 
showed  that  the  maximum  possible  vertex  degree  in 
an  Lp  MST  equals  the  Hadwiger  number  of  the  corre¬ 
sponding  unit  ball,  and  we  determined  the  maximum 
vertex  degree  in  a  minimum-degree  Lp  MST.  We  gave 
an  exponential  lower  bound  on  the  MST  number  of 
a  Ccrosspolytope,  and  showed  that  the  MST  number 
for  an  Lp  unit  ball,  p  >  1,  is  exponential  in  the  dimen¬ 
sion.  We  concentrated  on  the  L\  metric  in  two  and 
three  dimensions  due  to  its  significance  for  VLSI:  for 
example,  we  showed  that  for  any  finite  pointset  in  the 
Manhattan  plane  there  exists  an  MST  with  maximum 
degree  of  at  most  4,  and  that  for  three-dimensional 
Manhattan  space  the  maximum  possible  degree  of  a 
minimum-degree  MST  is  either  13  or  14. 

We  solved  an  open  problem  regarding  the  complex¬ 
ity  of  computing  bounded-degree  MSTs  by  providing 
the  first  known  polynomial-time  algorithm  for  con¬ 
structing  a  MST  with  maximum  degree  4  for  an  ar¬ 
bitrary  pointset  in  the  rectilinear  plane.  Moreover, 
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our  techniques  can  be  used  to  compute  a  bounded- 
diameter  MST  as  efficiently  as  an  ordinary  MST.  Fi¬ 
nally,  our  results  also  enable  a  significant  execution 
speedup  of  a  number  of  common  VLSI  routing  algo¬ 
rithms.  Remaining  open  problems  include: 

1.  Whether  the  MST  number  for  Li  in  three  dimen¬ 
sions  is  13  or  14; 

2.  The  complexity  of  computing  a  planar  Euclidean 
MST  with  maximum  degree  4; 

3.  Tighter  bounds  on  the  Hadwiger  and  MST  num¬ 
bers  for  arbitrary  k  and  p. 
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Optimal  Linear-Time  Algorithm  for  the  Shortest  Illuminating  Line 

Segment  in  a  Polygon 

(Extended  Abstract) 

Gautam  Das  Giri  Narasimhanf 


Abstract 

Given  a  simple  polygon,  we  present  an  optimal 
linear-time  algorithm  that  computes  the  shortest  il¬ 
luminating  line  segment,  if  one  exists;  else  it  reports 
that  none  exists.  This  solves  an  intriguing  open 
problem  by  improving  the  0{n  log  n)-time  algorithm 
[Ke87]  for  computing  such  a  segment. 


1  Introduction 


In  this  paper  we  present  an  optimal  linear-time  al¬ 
gorithm  to  do  the  following:  given  a  simple  polygon 
with  n  vertices,  it  computes  in  0{n)  time  the  short¬ 
est  illuminating  line  segment  inside  the  polygon,  if 
one  exists;  otherwise  it  reports  that  no  such  seg¬ 
ment  exists.  In  other  words,  it  computes  the  short¬ 
est  line  segment  within  a  polygon  that  needs  to  be 
illuminated  in  order  for  the  interior  of  the  polygon 
to  be  lit.  The  algorithm  solves  an  outstanding  open 
problem  by  improving  the  0{n  log  n)-time  algorithm 
presented  by  Ke  [Ke87]  for  this  problem. 

The  concept  of  weak  visibility  in  polygons  was  in¬ 
troduced  by  Avis  and  Toussaint  [AT].  Since  then  it 
has  received  much  attention  from  researchers  [AT, 
BMT,  SS,  TL,  DHNl,  DHN2,  Che,  DC,  IK,  Ke87]; 
also  see  the  survey  article  by  J.  O’Rourke  [O’Rj. 
Two  sets  of  points  are  weakly  visible  from  each  other 
if  every  point  in  either  set  is  visible  from  some  point 
in  the  other  set.  Thus,  our  algorithm  computes  the 
shortest  line  segment  that  is  weakly  visible  from  a 
given  simple  polygon. 
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An  interesting  related  result  due  to  Bhattacharya 
and  Mukhopadhyay  [BM]  is  a  linear-time  algo¬ 
rithm  to  compute  a  single  weakly-visible  line  seg¬ 
ment  in  a  polygon.  Although  a  linea.r-time  algo¬ 
rithm  was  presented  for  this  problem  in  [DHN2],  the 
strength  of  their  result  lies  in  the  fact  that  their 
algorithm  does  not  use  the  linear-time  triangula¬ 
tion  algorithm  [Cha],  or  the  linear-time  algorithm 
to  compute  shortest  paths  in  a  triangulated  polygon 
[LP,  GHLSTj.  The  present  algorithm,  in  contrast, 
uses  both  the  triangulation  as  well  as  the  shortest 
path  algorithms,  but  manages  to  compute  more;  it 
computes  the  shortest  weakly-visible  segment  in  a 
polygon.  Another  related  result  is  an  optimal  linear¬ 
time  algorithm  due  to  Bhattacharya  et  al.  [BMT] 
for  computing  the  shortest  line  segment  from  which 
the  exterior  of  a  simple  polygon  is  weakly  visible.  It 
may  be  noted  that  we  are  concerned  with  internal 
weak  visibility. 

Besides  resolving  a  long-standing  open  problem, 
our  paper  is  also  interesting  because  of  the  tech¬ 
niques  used.  The  results  in  this  paper  build  on 
some  of  our  previous  work  on  optimal  linear-time 
algorithms  for  weak-visibility  problems  in  polygons. 
The  linear-time  algorithms  for  computing  all  LR- 
visible  pairs  of  points  [DHNl]  and  for  computing  all 
weakly-visible  chords  [DHN2]  output  a  mass  of  infor¬ 
mation  related  to  visibility  within  a  polygon.  Our 
present  algorithm  shows  how  to  exploit  this  wealth 
of  information  to  answer  more  interesting  questions 
related  to  weak  visibility  in  polygons.  We  achieve 
our  results  by  studying  the  structure  of  minimal 
weakly-visible  segments  and  identifying  the  bound¬ 
ing  chords  for  such  segments.  As  described  later, 
one  of  the  by-products  of  our  algorithm  in  this  pa¬ 
per  is  a  linear-time  algorithm  to  generate  all  minimal 
weakly-visible  segments.  These  techniques  were  also 
used  in  [DHN2]  to  obtain  a  linear-time  recognition 
of  L^-convexity  of  simple  polygons. 

The  shortest  illuminating  line  segment  in  a  poly- 


gon  can  also  be  thought  of  as  the  shortest  straight 
line  path  that  a  watchman  could  patrol  along  in  or¬ 
der  to  watch  over  a  polygonal  art  gallery.  There  have 
been  a  number  of  papers  on  the  shortest  watchman 
tour  problem  [CN,  PV].  Our  algorithm  finds  the 
shortest  straight-line  watchman  tour,  if  one  exists. 

2  Notation 

We  define  notation  for  this  paper.  A  polygonal  chain 
is  a  concatenation  of  line  segments.  The  endpoints 
of  the  segments  are  called  vertices,  and  the  segments 
themselves  are  edges.  If  the  segments  intersect  only 
at  the  endpoints  of  adjacent  segments,  then  the 
chain  is  simple,  and  if  a  polygonal  chain  is  closed 
we  call  it  a  polygon.  In  this  paper,  we  deal  with 
a  simple  polygon  P  of  n  vertices,  and  its  interior, 
int[P).  The  segment  between  two  points  x  and  y  is 
denoted  xy,  and  inti^)  =  xy\  {®,  y}.  Two  points 
x,y  £  P  are  visible  (or  co-visible)  ifxy  C  PUint(P). 
Two  sets  of  points  A  and  B  are  said  to  be  weakly 
visible  from  each  other  if  every  point  in  A  is  visible 
from  some  point  of  E,  and  vice  versa.  A  polygon  is 
said  to  be  ij-convex  if  for  every  pair  of  points  in  the 
polygon,  there  exists  another  point  from  which  the 
first  two  are  visible.  The  (Euclidean)  distance  be¬ 
tween  two  points  X  and  y  is  denoted  dist{x,y).  We 
assume  that  the  input  is  in  general  position,  which 
means  that  no  three  vertices  are  collinear,  and  no 
three  lines  defined  by  edges  intersect  in  a  common 
point. 

If  X  and  y  are  points  of  P,  then  Pcw{x,y) 
{Pccwi^^y))  is  the  subchain  obtained  by  travers¬ 
ing  P  clockwise  (counterclockwise)  from  x  to  y.  The 
subchains  Pcw{^,y)  and  Pccw{^,y)  includes  their 
endpoints  x  and  y.  A  polygon  P  is  said  to  be  LR- 
visible  with  respect  to  a  pair  of  points  x  and  y,  if 
Pcw{x,y)  is  weakly  visible  from  Pccw{x.,y). 

We  let  r[x,  a)  represent  the  ray  rooted  at  x  in 
direction  a.  For  a  vertex  x  of  P,  let  z'*’  be  the  ver¬ 
tex  adjacent  to  x  in  the  clockwise  direction,  and  x~ 
the  vertex  adjacent  in  the  counterclockwise  direc¬ 
tion.  Informally,  the  ray  shot  from  a  point  x  £  P  in 
direction  a  consists  of  “shooting”  a  “bullet”  from  x 
in  direction  a  which  travels  until  it  hits  a  point  of  P. 
The  first  point  where  this  shot  hits  P  is  called  the 
hit  point  of  the  ray  shot.  The  line  segment  from  a 
shooting  point  x  to  its  hit  point  x'  is  called  a  chord 
of  the  polygon.  A  weakly-visible  chord  is  a  chord 
that  is  weakly  visible  from  the  rest  of  the  polygon. 

A  weakly-visible  segment  is  simply  any  line  segment 
in  PU  int[P)  that  is  weakly  visible  from  the  rest  of 
the  polygon. 


Figure  1:  A  clockwise  component  and  its  C-polygon 

Each  reflex  vertex  v  defines  two  special  ray  shots 
as  follows.  If  a  is  the  direction  from  v~  to  v,  we  let 
Pcwi'ii)  —  r(u,  a)  represent  the  clockwise  ray  shot 
from  V.  If  v'  is  the  hit  point  of  the  clockwise  ray 
shot,  then  the  subchain  Pcw(v,v')  is  the  clockwise 
component  of  v  (see  Fig.  1).  Counterclockwise  ray 
shots  and  components  are  defined  in  the  same  way. 
A  component  is  redundant  if  it  is  a  superset  of  an¬ 
other  component. 

The  clockwise  component  of  v  also  defines  a  sub¬ 
polygon  called  the  clockwise  C-polygon  of  v,  which 
is  the  subpolygon  of  P  bounded  by  the  polygonal 
chain  Pcw(v,v')  and  the  chord  vv' .  The  clock¬ 
wise  C-polygon  of  v  is  shown  as  a  shaded  region 
in  Fig.  1.  The  counterclockwise  C-polygons  are  de¬ 
fined  in  a  similar  fashion.  We  define  the  envelope  of 
a  C-polygon  to  be  the  boundary  (in  the  interior  of 
P)  of  the  C-polygon  of  v.  Note  that  the  envelope 
of  a  C-polygon  is  simply  the  straight-line  segment 
(chord)  consisting  of  the  ray  shot  from  v.  Later,  we 
will  refer  to  the  envelope  of  an  intersection  of  a  set 
of  C-polygons,  which  is  the  boundary  (inside  of  P) 
of  the  intersection  of  the  set  of  C-polygons. 

3  Preliminaries 

In  this  section  we  describe  some  of  the  geometric 
properties  of  a  weakly-visible  line  segment. 

It  was  noted  in  [IK]  that  the  family  of  non- 
redundant  components  completely  determines  LR- 
visibility  of  P,  since  a  pair  of  points  s  and  t  ad¬ 
mits  LR-visibility  if  and  only  if  each  non-redundanl 
component  of  P  contains  either  s  or  t.  A  similar 
result  from  [DHN2]  states  that  the  family  of  non- 
redundant  components  also  determines  all  weakly- 
visible  chords,  since  a  chord  st  is  a  weakly-visible 
chord  if  and  only  if  each  non-redundant  component 
of  P  contains  either  s  or  i.  Lemma  1  below  shows 
that  the  family  of  non-redundant  components  also 
determines  the  family  of  weakly- visible  segments, 
while  lemma  2  describes  another  property  satisfied 
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by  all  weakly-visible  segments  in  P. 

Lemma  3.1  A  line  segment  uv  -  I  is  weakly  visible 
from  P  iff  the  segment  I  intersects  the  C-polygon 
corresponding  to  every  non-redundant  component  of 

P. 

Lemma  3.2  If  a  line  segment  uv  =  I  is  weakly  vis¬ 
ible  from  P,  then  the  chord  I',  which  is  obtained  by 
extending  I  in  both  directions  until  it  hits  P,  is  a 
weakly-visible  chord,  and  the  endpoints  of  I  form  a 
LR-visible  pair  of  points  with  respect  to  P. 

The  obvious  implication  of  Lemma  3.2  is  that  a 
polygon  has  at  least  one  weakly-visible  chord  iff  it 
has  at  least  one  weakly-visible  segment  and  conse¬ 
quently  a  shortest  weakly-visible  segment. 

Before  giving  an  overview  of  the  algorithm,  we 
describe  the  peculiar  output  of  the  0(n)-time  algo¬ 
rithm  for  computing  all  weakly-visible  chords  of  a 
polygon  [DHN2]  (the  chords  algorithm,  in  short), 
since  this  algorithm  is  used  by  our  scheme.  The 
chords  algorithm  generates  k  =  0(n)  pairs  of  the 
form  (Ai,  Bi).  Note  that  for  the  rest  of  the  paper  k 
will  be  used  to  denote  the  number  of  pairs  output  by 
the  chords  algorithm.  Here  Ai  is  a  line  segment  on 
P  that  is  described  in  terms  of  a  parameter  x  (for 
example,  2:  =  0  (1)  refers  to  the  left  (resp.  right) 
endpoint  of  Ai).  Bi  is  a  polygonal  chain  on  P  such 
that  every  line  segment  joining  a  point  on  Ai  and 
a  point  on  Bi  forms  a  weakly-visible  chord.  How¬ 
ever,  Bi  has  endpoints  that  are  linear  functions  of 
the  parameter  x.  It  may  also  be  noted  that  each  of 
the  .4tS  are  disjoint  line  segments,  while  the  BiS  are 
possibly  overlapping  polygonal  chains. 

4  Overview  of  algorithm 

The  first  step  of  our  algorithm  is  to  run  the  chords 
algorithm.  If  the  polygon  has  no  weakly-visible 
chords,  then  the  algorithm  stops  and  declares  that 
there  are  no  weakly-visible  segments  either. 

Note  that  for  the  rest  of  the  paper  all  refer¬ 
ences  to  the  term  components  are  references  to  non- 
redundant  components.  For  every  Ai  output  by  the 
chords  algorithm,  let  SAi  be  the  set  of  components 
that  contain  Ai,  and  let  CAi  be  the  intersection  of 
the  C-polygons  corresponding  to  the  components  in 
Sv4i.  Let  the  envelope  of  CAi  be  denoted  by  Oi.  Let 
SBi  be  the  set  of  all  components  not  in  SAi,  ^nd 
let  CBi  be  the  intersection  of  the  C-polygons  of  all 
the  components  in  SBi.  Let  Pi  denote  the  envelope 
oiCBi. 


Figure  2: 

It  is  clear  that  any  line  segment  inside  P  that 
touches  both  Oi  and  Pi  must  intersect  every  C- 
polygon  and  by  lemma  3.1  must  be  a  weakly-visible 
segment.  However,  the  converse  is  not  so  obvi¬ 
ous.  Lemma  4.1  below,  which  implies  the  con¬ 
verse,  proves  that  the  shortest  weakly-visible  seg¬ 
ment  must  join  a  point  on  cti  and  a  point  on  Pi,  for 
some  i.  This  vital  property  is  necessary  to  make  our 
algorithm  work  in  linear  time.  Note  that  SAi  cor¬ 
responds  to  a  subsequence  of  the  sorted  sequence  of 
components.  What  lemma  4.1  proves  is  that  only 
such  subseguences  (and  not  an  arbitrary  subset)  of 
non-redundant  components  need  to  be  considered 
for  computing  the  shortest  weakly-visible  segment. 

Lemma  4.1  If  It  =  I  is  the  shortest  weakly-visible 
segment,  then  s  must  lie  on  oti  and  t  must  he  on  Pi, 
for  some  i  =  1, .  .  . ,  fc. 

Proof: 

Extend  the  line  segment  I  =  st  until  it  hits  P  at 
points  p  and  q  as  shown  in  Fig.  2.  By  Lemma  3.1, 
segment  I  must  intersect  every  C-polygon.  This  im¬ 
plies  that  for  every  C-polygon  C,  one  of  the  tvm  line 
segments  ^  and  qt  must  lie  completely  in  C.  Also, 
if  the  envelope  of  C  intersects  I,  it  must  include 
precisely  one  of  the  two  segments  ps  and  qi. 

By  the  minimality  of  I,  it  is  clear  that  the  point 
s  (t)  lies  on  the  envelope  of  one  or  two  C-polygons. 
If  s  (resp.  t)  lies  on  the  envelope  of  some  C-polygon 
C,  (resp.  Ct),  then  since  the  envelope  of  C,  (resp. 
Ct)  intersects  I',  Cj  (resp.  Ct)  includes  the  point  p 
(resp.  q),  and  does  not  include  q  (resp.  p). 

Let  I'  =  pq.  By  Lemma  3.2,  /'  is  a  weakly-visible 
chord.  Consider  the  output  of  the  chords  algorithm. 
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Let  p  G  Ai.  By  the  correctness  of  the  chords  al¬ 
gorithm,  q  G  Bi.  Note  that  C,  must  include  all  of 
Ai  because  of  the  way  the  chords  algorithm  works. 
Thus,  C,  must  be  a  C-polygon  that  determines  a,-, 
and  Ct  must  be  a  C-polygon  that  determines  /3,-. 
Hence  s  must  lie  on  a,-  and  t  must  lie  on  /3i  thus 
concluding  the  proof.  □ 

The  above  lemma  suggests  the  following  skeleton 
for  our  algorithm.  For  every  i  =  con¬ 

struct  the  envelopes  a,-  and  Pi,  and  then  compute 
the  shortest  line  segment  joining  a  point  on  ai  and 
a  point  on  Pi.  The  shortest  of  these  is  the  shortest 
weakly-visible  segment. 

Note  that  since  both  ai  and  Pi  are  convex  polygo¬ 
nal  chains,  computing  the  shortest  line  segment  con¬ 
necting  them  can  be  computed  in  time  0(|Q,j  +  |/3i|), 
where  ja,'!  and  \Pi  \  are  the  lengths  of  the  two  chains. 
However,  in  general  there  may  be  considerable  over¬ 
lap  between  a,-  and  Oi+i,  as  well  as  between  Pi  and 
Pi+i-  For  the  i-th  iteration,  instead  of  simply  find¬ 
ing  the  shortest  line  segment  that  joins  ai  and  pi, 
the  algorithm  only  scans  the  portion  of  a;  that  is  not 
part  ofc^t+i  (and  the  portion  of  Pi  that  is  not  part  of 
Pi+i),  and  finds  the  shortest  segment  between  those 
parts.  The  assumption  is  that  the  rest  of  the  por¬ 
tions  of  the  two  polygonal  chains  will  be  scanned  as 
part  of  a  later  iteration.  Repetitious  scanning  of  the 
polygonal  chains  is  thus  prevented  by  delaying  the 
scanning  of  overlapping  portions  as  much  as  possi¬ 
ble. 

In  section  5.1,  we  precisely  characterize  how  ai 
changes  to  become  ai+i,  and  correspondingly  how 
Pi  changes  to  become  Pi+\.  In  section  5.2,  we  de¬ 
scribe  a  data  structure  that  stores  a;,  i  =  1, . . .  ,k, 
and  another  identical  structure  that  stores  Pi,i  = 
l,...,k.  The  planarity  of  these  data  structures  is 
sufficient  proof  that  the  size  of  the  union  of  ai  and 
the  size  of  the  union  of  Pi  for  i  =  1, . .  .,k  is  0(n). 

The  problem  with  the  skeleton  algorithm  de¬ 
scribed  above  is  that  the  shortest  line  segment  join¬ 
ing  Oi  and  Pi  for  some  i  may  not  lie  entirely  within 
P.  This  happens  because  even  though  the  line  seg¬ 
ment  when  extended  may  hit  Ai,  it  might  not  hit  Bi 
because  of  obstruction  from  the  rest  of  P,  i.e.,  the 
extended  line  is  not  a  weakly-visible  chord.  In  this 
case,  if  there  is  a  weakly-visible  chord  connecting 
Ai  and  Bi,  then  the  shortest  weakly- visible  segment 
joining  ai  and  Pi  would  touch  a  vertex  of  the  P. 
This  suggests  that  our  algorithm  needs  to  deal  with 
two  main  cases.  The  first  case  is  when  the  short¬ 
est  illuminating  segment  does  not  touch  a  vertex  of 
P  except  at  its  endpoints;  the  second  case  is  when 


it  touches  a  vertex  of  P  in  its  interior.  If  the  first 
case  occurs,  the  algorithm  briefly  described  earlier 
will  output  the  shortest  illuminating  segment.  The 
details  of  this  case  are  described  in  the  next  section. 
The  second  case  is  handled  separately  in  section  6. 
The  algorithm  for  the  second  case  is  a  modification 
of  our  earlier  algorithm  for  computing  all  weakly- 
visible  chords  of  a  polygon  [DHN2].  If  a  weakly- 
visible  segment  touches  a  vertex  of  P  it  is  referred 
to  as  a  non-tang ential  weakly-visible  segment,  oth¬ 
erwise  it  is  referred  to  as  a  tangential  weakly-visible 
segment. 

By  putting  all  the  pieces  together,  we  show  a 
linear-time  algorithm  to  obtain  the  shortest  non- 
tangential  weakly-visible  segment,  and  a  linear¬ 
time  algorithm  to  compute  the  shortest  tangential 
weakly-visible  segment.  The  shortest  of  the  two 
segments  is  the  shortest  weakly-visible  segment  in 
a  polygon,  thus  giving  us  the  desired  algorithm. 

5  Case  1:  Non-tangential 

weakly-visible  segment 

As  mentioned  earlier,  this  ca.se  corresponds  to  the 
situation  when  the  shortest  weakly-visible  segment 
does  not  touch  any  vertex  of  the  polygon  except  pos¬ 
sibly  at  its  endpoints.  For  each  i  =  1,...,A:,  let 
SNi  be  the  shortest  non-tangential  weakly- visible 
segment  that  joins  ai  and  Pi  with  at  least  one  end¬ 
point  on  ai  —  Qi  +  i  or  Pi  —  Pi  +  i.  It  can  be  shown 
that  the  shortest  of  the  segments  SNi ,  i  =  1, . . . ,  k 
must  be  the  shortest  non-tangential  weakly-visible 
segment  that  joins  ai  and  Pi,  i  =  1, . .  . ,  k. 

5.1  Structure  of  a,  and  Pi 

As  mentioned  earlier  both  ai  and  Pi  are  the  bound¬ 
aries  (internal  to  P)  of  the  intersection  of  a  set  of 
C-polygons.  Hence  it  is  clear  that  both  of  them  are 
convex  polygonal  chains.  It  may  be  possible  that 
a,'  =  ai,  =  ai„  =  ■■■  —  This  simply  means 

that  no  component  starts  or  ends  on  the  portion  of 
P  covered  by  Ai,,  A,-^, .  .  . , 

We  now  describe  the  stnictura!  differences  be¬ 
tween  ai  and  +  i  (in  ca.se  they  do  differ),  and  the 
corresponding  differences  between  Pi  and  Pi  +  i.  The 
main  purpose  of  studying  this  structure  is  to  iden¬ 
tify  the  polygonal  chains  a,  —  Qi  +  i  and  /?,  —  P,  +  i 
so  that  they  can  be  processed  in  the  i-th  iteration. 
Clearly,  if  ai  =  Qi  +  i,  then  no  processing  is  required 
in  iteration  i. 

Assume  that  a;  yf  Qi+i-  From  [DHN2j  we  know 
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that  Ai  and  A'+i  are  disjoint  line  segments.  There 
are  various  events  that  trigger  the  chords  algorithm 
to  go  from  iteration  i  to  iteration  i  +  1,  thus  out- 
putting  pairs  and  pairs  Bi+i).  An 

event  occurs  if  a  component  starts  or  ends  between 
Ai  or  Ai+i.  The  other  possible  events  have  to 
do  with  changes  in  the  points  of  tangency  for  the 
boundaries  of  the  weakly-visible  chords.  The  chain 
ai  is  different  from  Qi+i  only  when  a  component 
starts  or  ends  between  A,-  and  Aj+i.  For  the  next 
three  paragraphs  we  will  assume  that  the  counter¬ 
clockwise  end  for  any  polygonal  chain  is  the  front 
end,  while  the  clockwise  end  is  the  tail  end. 

If  a  component  c  starts  between  Ai  and  the 

changes  from  a;  to  are  as  shown  in  Fig.  3(b). 
Note  that  the  C-polygon  corresponding  to  compo¬ 
nent  c  lies  to  the  left  of  the  line  and  that  the  compo¬ 
nent  c  consists  of  the  polygonal  chain  PcwiP2,<l2)- 
Ai  lies  to  the  right  of  P2,  while  Ai+i  lies  to  the  left 
of  p2-  Oi  consists  of  the  chain  from  at  to  S2  to  aj, 
while  Oj+i  consists  of  the  chain  from  P2  to  S2  to  aj, 
i.e.,  a  portion  of  the  tail  of  ai  gets  replaced  by  a  por¬ 
tion  of  the  ray  shot  corresponding  to  the  component 
c.  At  the  same  time,  as  shown  in  Fig.  3(b)  /3<  has 
a  portion  of  its  front  replaced  by  a  new  polygonal 
chain.  Pi  consists  of  the  chain  from  bt  to  t2  to  q2, 
while  /3i+i  consists  of  the  chain  from  bt  to  t2  to  6/. 
In  other  words,  CAi  shrinks  at  its  tail  end,  and  CBi 
grows  at  its  front  end,  while  both  their  boundaries 
remain  convex.  Note  that  q,-  —  Qj+i  comprises  of 
the  polygonal  chain  from  at  to  52,  while  0i  —  Pi+i 
comprises  of  the  segment  from  92  to  t2. 

Note  that  in  the  Figs.  3(a)  and  (b),  the  region 
CAi  n  CAi+i  (as  well  as  the  region  CBi  n  CBi+i) 
have  been  shown  as  a  filled  region.  The  area  oc¬ 
cupied  by  CAi+i  (but  not  by  CAi)  is  indicated  as 
a  dot-filled  region,  while  the  area  occupied  by  CAi 
and  CBi  is  left  blank. 

By  a  similar  argument,  if  a  component  c  ends 
between  Ai  and  Ai^i,  the  portion  of  the  ray  shot 
corresponding  to  c  at  the  front  (right  end  or  the 
counterclockwise  end)  of  ai  gets  replaced  by  a  new 
polygonal  chain,  causing  CAi  to  grow  in  the  front. 
As  shown  in  figure  2(a),  Pi  ha.s  a  portion  of  its  tail 
(right  end  or  counterclockwise  end)  replaced  by  a 
portion  of  the  ray  shot  corresponding  to  c,  thus  caus¬ 
ing  CBi  to  shrink  at  its  tail  end.  In  this  case  note 
that  ai  —  Qt-i-i  comprises  of  the  segment  from  pi  to 
Si,  while  Pi  —  Pi+i  comprises  of  the  chain  from  bt  to 
ti. 

The  above  description  describes  the  changes  that 
take  place  to  the  a  and  0  chains  while  moving  from 
the  i-th  iteration  to  the  {i  -1-  l)-st  iteration. 


(a) 


<1/  Pi 


(b) 

aj  P2 


CA,  -  CAi^i 


C/ti  +  i  -  CAi 


CA,  n  i 


Figure  3:  Changes  in  the  structure  of  at  and  /?, 
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5.2  Data  structure  for  storing  the  a 
and  (3  chains 

The  union  of  all  the  chains  ai  (/3i)  form  a  tree.  We 
claim  here  without  elaborating  that  both  the  sets 
of  chains  a;  and  Pi  can  be  stored  in  simple  linear¬ 
sized  data  structures  that  are  similar  to  the  persis¬ 
tent  search  trees  of  Sarnak  and  Tarjan  [ST].  The 
data  structure  will  be  detailed  in  a  full  version  of 
this  paper. 

5.3  Computing  SNi 

As  described  in  the  previous  section,  the  algorithm 
goes  through  k  iterations.  In  the  i-th  iteration,  the 
chains  Oj  —  Qi+i  and  Pi  —  Pi+i  are  identified,  and 
the  shortest  segment  that  joins  o:,-  and  Pi  with  one 
endpoint  on  ai  —  Q,-+i  or  Pi  —Pi+i  is  computed.  Let 
this  segment  be  SNi  ~ 

Given  any  two  convex  polygonal  chains  a  and  P, 
there  is  a  simple  sweep  algorithm  to  find  the  shortest 
line  segment  that  joins  the  two  chains.  In  this  case, 
a  and  p  are  two  convex  chains  that  form  part  of  the 
boundary  of  two  disjoint  convex  polygons.  The  al¬ 
gorithm  involves  sweeping  the  two  chains,  one  from 
its  clockwise  end  and  in  counterclockwise  order,  the 
other  from  its  counterclockwise  end  in  clockwise  or¬ 
der.  Informally  speaking,  the  sweep  algorithm  works 
because  of  three  simple  facts:  (1)  for  a  fixed  point 
a  £  a,  its  distance  to  visible  points  b  £  P  is  uni- 
modal,  (2)  as  point  a  moves  monotonically  on  a,  its 
closest  point  on  P  moves  monotonically  on  /3,  (3) 
for  points  a  G  Q,  its  shortest  distance  to  p  (i.e., 
the  distance  to  its  closest  point  on  P)  is  unimodal. 
Intuitively  speaking,  fact  (3)  states  that  the  local 
minimum  is  also  the  global  minimum. 

In  the  i-th  iteration,  it  is  possible  that  the  seg¬ 
ment  spi  may  not  lie  entirely  within  P.  To  identify 
this  situation,  we  exploit  the  fact  that  given  a  point 
X  on  P,  the  chords  algorithm  has  already  identified 
which  directions  from  x  give  rise  to  weakly-visible 
chords.  Hence  to  check  whether  SNi  lies  in  P,  the 
algorithm  computes  the  endpoint  of  the  chord  gen¬ 
erated  when  the  line  segment  is  extended.  Call  this 
point  Pi.  Using  the  output  of  the  chords  algorithm 
our  algorithm  checks  whether  the  chord  in  the  di¬ 
rection  piSi  is  a  weakly-visible  chord.  It  should  be 
pointed  out  that  it  is  possible  that  pi  may  not  lie  on 
Ai,  but  on  some  other  segment  Aj .  To  identify  Aj, 
the  algorithm  traverses  from  Ai  to  Aj  along  P.  It 
is  non-trivial  to  show  that  this  portion  of  P  is  not 
traversed  again  for  this  purpose  (and  hence  does  not 
contradict  the  claimed  0(n)  time  complexity).  The 
intuition  behind  the  claim  is  that  if  pi  lies  on  Aj 


then  SNi  is  also  the  shortest  segment  between  Oj 
and  Pj  as  well  as  between  q;  and  Pi  for  all  values  of 
I  between  i  and  j.  On  the  other  hand,  if  the  chord 
is  not  weakly  visible,  then  the  segment  Sii,  is  ig¬ 
nored,  and  will  be  handled  by  the  second  phase  of 
the  algorithm  (corresponding  to  Case  2), 

Another  subtle  complication  is  introduced  by  the 
possibility  that  SNi  nray  have  one  endpoint  on 
/3i— ySi+i  and  another  endpoint  on  (instead 

of  ai  —  This  could  happen  if  the  sweep  algo¬ 

rithm  (described  at  the  start  of  this  subsection)  for 
finding  the  shortest  line  segment  joining  two  convex 
polygonal  chains  reaches  the  end  of  one  of  the  chains 
without  hitting  a  local  minimum.  For  example,  as¬ 
sume  that  the  end  of  q,-  —  Q,'+i  is  reached  before 
reaching  the  end  of  Pi  —  Pi+i  and  before  a  minimum 
was  encountered.  In  this  case,  our  algorithm  con¬ 
tinues  sweeping  on  Pi  —  Pi.^.i,  while  continuing  the 
sweep  on  Our  algorithm  needs  to  be  mod¬ 

ified  to  ensure  that  this  portion  of  n  Oi  +  i  is  not 
swept  again  in  iteration  f-f  1  (or  later).  In  this  ca.se, 
it  can  be  shown  that  SNj  cannot  have  an  endpoint 
on  this  portion  of  q,-  n  ai  +  i  and  hence  need  not  be 
considered  in  any  later  iteration.  The  relevant  por¬ 
tion  of  ai  n  is  marked  visited  so  that  a  sweep 
in  a  later  iteration  can  skip  over  this  portion  of  the 
chain.  The  entire  arguments  in  this  paragraph  could 
have  been  carried  out  with  a  replaced  by  P  and  vice 
versa. 

Once  a  local  minimum  is  found  for  the  i-th  itera¬ 
tion,  the  algorithm  also  verifies  if  it  is  a  global  min¬ 
imum  for  the  shortest  segment  between  q,  and  Pi. 
If  the  point  on  Oi  —  a.+i  or  Pi  —  A+i  is  not  the  end¬ 
point  of  that  subchain,  then  the  global  minimum  for 
the  shortest  segment  between  q,-  and  /?,■  must  have 
been  reached.  Otherwise,  a  simple  test  can  check 
whether  the  global  minimum  has  been  reached  or 
not.  If  it  is  not  a  global  minimum,  then  SNi  can  be 
ignored  since  the  shortest  segment  between  q,-  and 
Pi  connects  points  that  are  not  on  ai  —  Oi+i  as  well 
as  Pi  —  Pi^i-i.  Since  such  a  segment  would  connect 
a, -4.1  and  Pi.^i  it  will  be  encountered  in  a  later  iter¬ 
ation.  The  algorithm  with  the  minor  modifications 
mentioned  above  is  guaranteed  to  sweep  every  por¬ 
tion  of  the  a  and  P  chains  exactly  once  and  hence 
achieves  the  claimed  linear-time  complexity. 

6  Case  2:  Shortest  tangential 
weakly-visible  segment 

This  case  occurs  when  the  interior  of  the  shortest 
weakly-visible  segment  in  the  polygon  touches  a  ver- 
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tex  of  the' polygon.  However,  in  this  case,  the  corre¬ 
sponding  weakly-visible  chord  obtained  by  extend¬ 
ing  the  segment  is  also  a  tangential  chord,  i.e.,  it 
touches  a  vertex  of  the  polygon  in  its  interior.  The 
crucial  point  to  observe  is  that  these  are  exactly  the 
weakly-visible  chords  that  are  output  by  the  linear- 
time  chords  algorithm  [DHN2].  A  suitable  modifica¬ 
tion  of  the  chords  algorithm  can  output  all  tangen¬ 
tial  weakly-visible  segments,  of  which  the  shortest 
can  be  computed. 


The  chords  algorithm  uses  the  following  strategy. 
It  traverses  along  the  polygon  in  a  counterclockwise 
direction  with  a  point  x.  When  x  is  on  Ai,  the  points 
y{x]  and  z{x)  corresponding  to  the  other  endpoints 
of  the  two  tangential  chords  from  x  are  computed. 
The  points  y{x)  and  z{x)  move  monotonically  on 
P;  so  do  the  points  of  tangency  for  the  tangential 
chords,  namely  s(x)  and  t{x).  As  x  moves  on  Ai, 
there  are  several  possible  events  that  can  take  place, 
which  would  change  the  description  of  the  tangents, 
the  point  y{x)  (or  2;(x))  could  move  to  a  vertex  of 
P;  the  point  s(x)  (or  t{x))  could  move  to  a  vertex 
of  P.  These  events  cause  a  recomputation  of  the 
equations  of  the  tangential  chords  as  a  function  of  x. 
In  [DHN2]  we  show  that  the  number  of  these  events 
are  0(n),  thus  giving  us  a  linear-time  algorithm. 


The  modification  for  computing  the  tangential 
weakly-visible  segments  is  as  follows.  During  iter¬ 
ation  i,  the  chains  a,  and  Pi  are  computed.  When 
the  point  x  is  on  Ai,  the  points  of  intersection  of  the 
tangential  chords  with  a;  and  Pi  are  also  maintained 
(call  them  cii(x),  a2(x),  6i(x),  62(®))-  The  segment 
from  oi(x)  to  6i(x)  and  the  segment  from  02(x) 
to  b2{x)  are  the  two  tangential  weakly-visible  seg¬ 
ments  with  respect  to  x.  The  situation  is  described 
in  Fig.  4.  There  are,  however,  an  additional  number 
of  events  that  could  cause  a  change  in  the  descrip¬ 
tion  of  the  tangential  weakly- visible  segments:  the 
points  oi(x)  or  a2(x)  {bi{x)  or  62(®))  could  move  to 
a  vertex  of  a,-  {Pi).  This  would  cause  additional  re¬ 
computations  of  the  equations  as  well  as  the  lengths 
of  the  tangential  segments.  The  crucial  point  is  that 
in  between  events,  the  length  of  the  tangential  seg¬ 
ments  can  be  computed  in  terms  of  x,  from  which 
the  minimum  can  be  computed  for  that  interval  in 
constant  time.  It  can  also  be  shown  that  the  total 
number  of  events  that  will  be  encountered  is  0{n) 
and  that  oi(x),  6i(x),  a2(x),  62(x)  move  monotoni¬ 
cally  on  the  a  and  P  chains. 


Figure  4:  Case  2;  Determining  tangential  shortest 
weakly-visible  segments 


7  All  minimal  weakly-visible 
segments  algorithm 

One  of  the  by-products  of  our  algorithm  is  a  linear¬ 
time  algorithm  to  generate  all  minimal  weakly- 
visible  segments  of  a  polygon.  This  algorithm  is 
a  modification  of  the  algorithm  described  in  sec¬ 
tion  6  for  computing  the  shortest  tangential  weakly- 
visible  segment.  It  outputs  a  set  of  pairs  {Ui,  Vp,  i  — 
1, .  . . ,  m.  Here  Ui  and  Vi  are  subchains  of  the  polyg¬ 
onal  chains  a  and  P,  m  =  0{n),  and  any  segment 
joining  points  u  ^  Ui  and  i;  G  y  is  s.  minimal 
weakly-visible  segment.  One  note  of  caution  is  that 
Ui  and  Vi  have  left  and  right  endpoints  that  are 
linear  functions  of  a  parameter  x  in  a  spirit  simi¬ 
lar  to  that  of  the  endpoints  of  the  chain  Bi  that 
is  output  by  the  chords  algorithm.  For  a  point 
X  on  Ai,  the  polygonal  chains  ai  and  Pi  can  be 
computed  along  with  the  points  ai(x),a2(x)  G  cci 
and  6i(x),  62(E)  G  Pi-  The  output  of  the  algorithm 
consists  of  {Ui,  Vj)  =  ((^i(e)i  ®2(e))i  (62(2),  6i(x))). 
The  discussion  at  the  end  of  section  6  can  also  be 
used  to  show  that  the  number  of  these  pairs  pro¬ 
duced  is  m.  —  0{ti).  Lemma.  1  can  be  used  to  show 
that  these  segments  are  minimal  in  the  sense  that 
any  subsegment  of  these  segments  is  not  weakly  vis¬ 
ible. 
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8  Conclusion  and  open  prob¬ 
lems 

We  show  optimal  linear-time  algorithms  to  compute 
the  shortest  weakly-visible  segment  and  all  minimal 
weakly-visible  segments  in  a  given  simple  polygon. 
Some  interesting  open  questions  are: 

•  Can  the  exhaustive  sweeping  techniques  from 
this  paper  be  used  to  solve  other  weak  visi¬ 
bility  problems  efficiently?  For  example,  are 
there  linear-time  algorithms  for  the  all-pairs 
version  of  any  of  the  2-guard  walk  problems  (see 
[DHNl])? 

•  Can  the  algorithm  from  this  paper  be  designed 
without  using  the  triangulation  or  the  shortest 
path  algorithm  as  a  subroutine? 

•  Ntafos  [Nt]  introduced  the  notion  of  d~visibility, 
where  an  observer’s  visibility  is  limited  to  dis¬ 
tance  d.  Can  the  shortest  illuminating  segment 
be  computed  efficiently  under  d-visibility? 
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Abstract 

In  this  paper  we  investigate  the  problem  of  morphing 
(i.e.  continuously  deforming)  one  simple  polygon  into 
another.  We  assume  that  our  two  initial  polygons  have 
the  same  number  of  sides  n,  and  that  corresponding 
sides  are  parallel.  We  show  that  a  morph  is  always 
possible  by  a  varying  simple  interpolating  polygon  also 
of  n  sides  parallel  to  those  of  the  two  original  ones.  If 
we  consider  a  uniform  scaling  or  translation  of  part  of 
the  polygon  as  an  atomic  morphing  step,  then  we  show 
that  such  steps  are  sufficient  for  the  morph. 

1  Introduction 

In  computer  graphics  a  recent  area  of  interest  has  been 
morphing,  i.e.  continuously  transforming  one  shape  into 
another.  Morphing  algorithms  have  numerous  uses  in 
shape  interpolation,  animation,  video  compression,  as 
well  as  obvious  entertainment  value.  Some  of  the  rel¬ 
evant  graphics  papers  are  [KR,  SG,  SGWM].  In  gen¬ 
eral  there  are  numerous  ways  to  interpolate  between 
two  shapes  and  little  has  been  said  about  criteria  for 
comparing  the  quality  of  different  morphs  and  notions 
of  optimality.  A  common  problem  with  many  morph¬ 
ing  algorithms  is  that  although  locally  the  geometry  of 
the  interpolated  shape  remains  faithful  to  the  original 
shapes,  global  properties  such  as  symmetry,  simplicity 
(in  the  topological  sense),  etc.,  can  be  easily  violated. 

In  fact,  the  very  existence  of  morphs  that  can  maintain 
high-level  global  properties  of  the  interpolated  shapes  is 
often  far  from  obvious. 

In  this  paper  we  address  a  morphing  problem  raised 
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by  John  F.  Hughes  of  Brown  University  in  the  Pixels- 
and-Predicates  Workshop  organized  by  David  Salesin 
and  Jack  Snoeyink  at  the  University  of  British  Columbia 
last  spring.  The  problem  is  set  in  two  dimensions;  we 
are  given  two  simple  polygons  P  and  Q.  Assume  that 
both  P  and  Q  are  counterclockwise  oriented.  The  poly¬ 
gons  P  and  Q  are  parallel,  where  this  is  taken  to  mean 
that  P  and  Q  have  the  same  number  of  sides  and  their 
sides  can  be  put  in  one  to  one  correspondence  so  that 
corresponding  sides  are  parallel  and  oriented  the  same 
way.  This  is  equivalent  to  saying  that  that  the  two  poly¬ 
gons  have  the  same  sequence  of  angles.  The  morphing 
problem  is  to  interpolate  between  P  and  Q  via  a  con¬ 
tinuously  changing  polygon  R{t),  where  t  denotes  time. 
We  set  i?(0)  =  P  and  R{1)  =  Q.  We  require  that  R{t) 
remain  parallel  to  P  and  Q  at  all  times,  and  that  it  also 
remain  simple.  See  Figure  1  below. 


P  Q 

Figure  1 :  Two  parallel  polygons 

We  provide  a  discrete  solution  to  Hughes’s  problem. 
We  take  as  our  basic  operation  the  uniform  scaling 
and/or  translation  of  a  contiguous  part  of  a  polygon 
parallel  to  itself.  We  call  such  a  step  a  parallel  move  of 
the  corresponding  polygonal  chain.  The  affected  edges 
in  a  parallel  move  expand  or  contract  so  as  to  keep  the 
two  endpoints  of  the  chain  on  the  lines  supporting  the 
two  adjacent  edges  to  the  chain.  Also,  no  parallel  move 
is  allowed  that  would  cause  any  moving  edge  or  vertex 
to  pass  over  another  vertex  or  edge  of  the  polygon.  It  is 
clear  that  any  number  of  parallel  moves  keep  a  simple 
polygon  parallel  to  itself  and  simple.  Our  main  result  is 
that,  if  n  denotes  the  number  of  sides  of  P  and  Q,  then 
0(n^U+£)  parallel  moves  always  suffice  to  morph  P  to 
Q.  We  have  an  algorithm  of  the  same  complexity  that 


computes  such  a  sequence  of  moves. 

The  problem  of  constructing  a  simple  polygon  given 
a  sequence  of  angles  has  a  substantial  history  in  com¬ 
putational  geometry.  Culberson  and  Rawlins  [CR]  first 
raised  this  question  and  gave  an  algorithm  that  con¬ 
structs  such  a  polygon  in  time  0{nD),  where  the  as¬ 
sumption  is  that  all  angles  are  rational  multiples  of  tt 
by  a  fraction  whose  denominator  is  bounded  by  D.  Vi- 
jayan  and  Wigderson  improved  this  result  by  construct¬ 
ing  such  a  polygon  in  linear  time  with  no  restriction 
on  the  angles  being  rational  (reported  in  Vijayan  [V]). 
The  result  of  the  present  paper  paper  shows  that  all  sim¬ 
ple  polygons  realizing  the  same  sequence  of  angles  are 
homotopic  to  each  other  within  this  class  of  polygons. 
This  in  itself  is  an  interesting  result  which,  according 
to  Hughes,  had  been  in  the  folklore  but  never  formally 
proven.  Some  of  the  trickiness  involved  can  be  seen  in 
the  two  polygons  of  Figure  2.  These  are  oppositely  spi¬ 
raling  polygons,  but  they  are  still  parallel  and  therefore 
morphable. 


Figure  2:  These  oppositely  spiraling  polygons  are  still 
morphable 

The  main  idea  of  this  paper  is  to  use  parallel  moves 
to  transform  each  polygon  to  a  reduced  form  in  which 
the  side  lengths  are  of  widely  different  scales.  Once 
both  polygons  are  in  reduced  form,  we  can  use  a  se¬ 
quence  of  elementary  local  transformations  to  morph 
one  reduced  form  to  the  other.  In  the  reduced  form, 
small-scale  structures  do  not  interfere  with  larger-scale 
parallel  moves — when  a  large-scale  edge  translates,  any 
smaller-scale  structures  adjacent  to  its  endpoints  trans¬ 
late  along  as  if  they  were  part  of  the  endpoints.  We 
establish  a  correspondence  between  these  transforma¬ 
tions  in  reduced  polygons  and  (classical)  rotations  in 
binary  trees.  The  catch,  however,  is  that  our  trees  have 
weights  on  the  leaves  and  induced  weights  on  the  in¬ 
ternal  nodes.  These  weights  correspond  roughly  to  the 
polygon  angles.  Our  mapping  from  polygons  to  trees  is 
such  that  all  weights  of  leaves  or  internal  nodes  must  be 
in  the  range  (— 1,-t-l).  Although  the  problem  of  using 
single  rotations  to  transform  one  binary  tree  to  another 
is  well-studied  and  is  known  to  be  doable  in  a  linear 
number  of  rotations,  these  methods  are  not  applicable 
to  our  situation,  as  not  all  rotations  maintain  the  weight 
condition  on  the  trees.  The  combinatorial  essence  of  our 


morphing  algorithm  becomes  then  a  theorem  about  ro¬ 
tating  one  weighted  binary  tree  into  another  while  sat¬ 
isfying  the  weight  conditions.  This  in  itself  is  quite  an 
interesting  combinatorial  problem. 

We  note  that  for  rectilinear  polygons  the  problem  is 
easier,  and  a  solution  is  already  known  [T].  We  also 
remark  that  our  transformation  to  a  reduced  form  is 
unattractive  from  a  practical  point  of  view,  as  it  may 
change  wildly  the  scale  of  different  features  of  the  orig¬ 
inal  polygons.  We  discuss  at  the  end  of  the  paper  some 
concrete  and  interesting  algorithmic  issues  that  arise  in 
keeping  small  the  overall  distortion  of  the  original  poly¬ 
gons  during  the  morph  process. 

2  The  reduced  form  of  a  polygon 

Our  morphing  algorithm  works  by  transforming  each  of 
the  two  polygons  P  and  Q  into  a  reduced  form.  We 
first  transform  P  to  its  reduced  form,  then  morph  the 
reduced  form  of  P  to  that  of  Q,  and  then  undo  the 
transformations  that  reduced  Q.  We  will  be  able  to 
map  the  morphing  of  the  reduced  forms  to  the  purely 
combinatorial  tree  rotating  problem  mentioned  above. 

The  reduced  form  of  polygon  P  is  a  polygon  parallel  to 
P  with  a  particular  hierarchical  structure.  The  reduced 
form  of  P  can  be  obtained  from  P  by  0{n)  elementary 
parallel  moves.  The  reduced  form  is  not  unique,  but 
instead  depends  on  the  order  in  which  the  parallel  moves 
are  performed. 


Figure  3:  The  two  kinds  of  parallel  moves 

We  allow  two  types  of  parallel  moves,  defined  as  fol¬ 
lows:  (1)  Suppose  there  exists  a  circle  that  intersects 
exactly  two  edges  of  the  polygon.  Then  we  may  shrink 
or  expand  the  structure  inside  the  circle  uniformly,  so 
long  as  the  intersections  of  the  polygon  with  the  cir¬ 
cle  do  not  change.  (2)  Suppose  there  exist  two  disjoint 
circles,  each  intersecting  an  edge  e  of  the  polygon  and 
one  other  edge.  Then  we  may  translate  e  and  move  the 
two  circles  and  their  contents  uniformly  along  with  e, 
such  that  each  circle’s  intersections  with  the  polygon 
are  stationary  in  the  reference  frame  of  the  circle.  In 
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both  cases  we  enforce  the  condition  that  the  polygon 
always  intersects  each  circle  in  exactly  two  fixed  places. 
See  Figure  3. 

The  reduced  form  of  P  is  based  on  a  process  of  elim¬ 
inating  P's  edges  one  by  one.  Consider  an  edge  e  of 
polygon  P  whose  endpoints  u  and  v  are  shared  with 
adjacent  edges  e„  and  6^.  If  we  translate  e  parallel  to 
itself  while  keeping  the  rest  of  the  polygon  fixed,  u  and 
V  move  along  the  lines  supporting  e„  and  e„,  length¬ 
ening  or  shortening  those  edges  as  well  as  e  itself.  We 
can  translate  e  so  that  at  least  one  of  e,  e„,  and  gets 
shorter.  One  of  the  edges  will  eventually  be  reduced  to 
zero  length,  if  e  does  not  hit  another  part  of  P  first.  See 
Figure  4. 

U 


Figure  4:  Translating  an  edge 

We  show  in  Lemma  1  below  that  every  polygon  can 
be  morphed  with  0(1)  edge  translations  (amortized)  so 
that  at  least  one  edge  is  reduced  to  zero  length,  while 
keeping  the  polygon  simple.  In  this  case  we  can  it¬ 
eratively  reduce  the  polygon  to  a  triangle:  at  each  of 
n  —  3  steps,  we  eliminate  one  edge  of  P  by  translation. 
For  certain  degenerate  polygons,  it  is  possible  that  the 
elimination  process  will  terminate  with  a  parallelogram 
instead  of  a  triangle.  This  case  does  not  pose  any  addi¬ 
tional  difficulties  for  our  algorithm,  and  so  we  will  ignore 
this  possibility  from  here  on. 

The  reduced  form  of  a  polygon  models  the  edge  elim¬ 
ination  process  in  its  structure.  To  compute  it,  we  run 
the  edge  elimination  process  as  above,  except  that  in¬ 
stead  of  reducing  edges  to  zero  length,  we  reduce  them 
to  infinitesimal  length.  Suppose  that  the  edge  elimina¬ 
tion  process  eliminates  an  edge  e,  coalescing  its  vertices 
u  and  V  to  become  the  common  endpoint  of  adjacent 
edges  Cu  and  e„.  In  the  reduced  form  of  the  polygon, 
vertices  u  and  v  are  kept  separate,  joined  by  an  infinites¬ 
imal  edge  e.  We  imagine  drawing  an  infinitesimal  disk 
that  contains  u  and  v,  centered  on  the  intersection  of 
the  lines  supporting  Bu  and  Ci,.  (If  the  supporting  lines 
are  parallel,  we  drop  the  centering  requirement.)  In  the 
remainder  of  the  elimination  process,  the  disk  translates 
along  with  the  intersection  of  the  supporting  lines,  act¬ 
ing  like  a  finite-size  vertex.  Because  the  disk  and  its 
contents  are  infinitesimal,  we  are  sure  that  no  trans¬ 
lating  edge  touches  the  disk  unless  it  also  touches  the 
corresponding  vertex  in  the  original  edge  elimination 
process.  The  infinitesimal  edge  is  hidden  inside  the  disk 


and  does  not  participate  directly  in  the  remainder  of  the 
elimination  process.  See  Figure  5. 


Figure  5:  A  reduced  form 

When  the  edge  elimination  process  terminates,  the 
reduced  form  of  the  polygon  consists  of  a  triangle  with 
some  microstructure  at  its  vertices.  Each  vertex  is  con¬ 
tained  in  a  disk,  and  the  three  disks  are  disjoint.  If  we 
look  inside  a  disk,  we  see  two  infinite  edges — the  ones 
that  cross  the  disk  boundary — and  a  single  finite  edge 
joining  them.  The  vertices  where  the  finite  edge  joins 
the  infinite  edges  have  microstructure  themselves:  they 
are  contained  inside  smaller  disks  that  are  disjoint  from 
each  other  and  completely  inside  the  surrounding  disk; 
each  smaller  disk  contains  recursively  similar  structures. 

If  we  ignore  the  disks  of  the  reduced  form  and  look 
only  at  the  edges,  we  see  that  the  reduced  form  of  a 
polygon  P  is  a  special  hierarchical  morph  of  P:  the 
reduced  form  is  parallel  to  P,  and  it  is  obtained  from  P 
by  parallel  moves  that  preserve  simplicity. 

The  following  lemma,  due  to  Emo  Welzl,  shows  that 
the  edge  elimination  process  applies  to  any  polygon. 
Combining  this  with  the  discussion  above,  we  can  com¬ 
pute  a  reduced  form  for  any  polygon. 

Lemma  1  ([W]).  Given  a  simple  polygon  P  with  n 
vertices,  we  can  reduce  it  to  a  triangle  by  a  sequence 
of  0(n)  edge  translations,  each  of  which  preserves  sim¬ 
plicity  and  n  -  3  of  which  shorten  some  edge  to  zero 
length. 

Proof:  The  algorithm  repeatedly  translates  edges  to¬ 
ward  the  interior  of  the  polygon.  It  maintains  a  set  of 
frozen  edges  that  have  already  been  translated  and  may 
not  be  translated  again.  It  works  in  a  subpolygon  of  P, 
and  maintains  the  invariant  that  at  most  two  (adjacent) 
edges  of  the  current  subpolygon  are  frozen. 

The  generic  step  of  the  algorithm  picks  an  arbitrary 
unfrozen  edge  e  of  the  current  polygon  (initially  P)  and 
translates  it  toward  the  interior  of  P.  As  the  edge  trans¬ 
lates  parallel  to  itself,  its  endpoints  move  along  the  lines 
supporting  the  two  edges  adjacent  to  e,  and  those  two 
edges  shorten  or  lengthen.  The  translation  stops  when 
one  of  two  things  happens:  (1)  e  or  one  of  its  two  neigh¬ 
boring  edges  is  shortened  to  zero  length,  or  (2)  e  collides 
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with  some  vertex  v  of  P,  or  a  translating  endpoint  v  of 
e  collides  with  some  other  edge  of  P.  See  Figure  6a. 


Figure  6:  Eliminating  edges  to  find  a  reduced  form 

In  case  (1)  we  have  eliminated  an  edge.  In  case  (2), 
the  collision  between  an  edge  and  the  vertex  v  has  parti¬ 
tioned  the  polygon  in  two,  and  we  recursively  apply  the 
same  procedure  on  one  of  the  two  pieces.  Specifically, 
we  translate  e  away  from  the  polygon  interior  slightly, 
so  the  polygon  is  not  completely  partitioned  by  the  colli¬ 
sion  at  V  (the  amount  to  back  away  is  upper-bounded  by 
a  constant  that  can  be  computed  from  the  geometry  of 
the  polygon),  and  freeze  all  three  edges  participating  in 
the  collision:  the  two  edges  incident  to  v,  and  the  edge  v 
hit.  Because  at  most  two  adjacent  edges  of  the  current 
polygon  were  frozen,  the  same  is  true  of  at  least  one  of 
the  two  subpolygons  created  by  the  near-partition  at  v. 
We  recurse  on  the  subpolygon  with  fewer  frozen  edges. 
Each  of  the  subpolygons  is  at  least  a  triangle,  so  each 
also  has  fewer  edges  than  the  current  polygon. 

The  recursion  bottoms  out,  in  the  worst  case,  when 
the  current  polygon  is  a  triangle  with  two  frozen  edges 
(Figure  6b).  Translating  the  third  edge  inward  elimi¬ 
nates  one  of  the  frozen  edges.  When  a  frozen  edge  is 
eliminated,  it  contracts  into  the  vertex  v  that  caused 
its  freezing.  We  unfreeze  the  other  two  edges  that  were 
frozen  because  of  v.  This  restores  the  invariant:  the 
collision  at  v  originally  created  two  subpolygons  Pi  and 
P2,  each  with  at  least  three  edges,  with  one  edge  belong¬ 
ing  to  both  Pi  and  P2.  The  edge  elimination  reduces 
the  total  number  of  edges  in  Pi  and  P2  by  one,  to  a 
minimum  of  four.  The  new  subpolygon  is  the  union  of 
Pi  and  P2;  it  has  at  least  four  edges,  at  most  two  of 
which  are  frozen. 

The  size  of  the  current  subpolygon  decreases  at  each 
recursive  call,  and  the  algorithm  eliminates  an  edge 
when  the  current  subpolygon  is  a  triangle.  Thus  the 
algorithm  successfully  eliminates  all  the  edges  of  P  (or 
reduces  P  to  a  triangle,  if  we  stop  just  before  the  last 
elimination) .  Edges  are  unfrozen  only  when  we  move  up 
one  recursive  level.  Each  unfreezing  can  be  charged  to 
the  edge  elimination  that  occurred  just  before  it.  The 
total  number  of  edge  translations  is  bounded  by  n  plus 
the  number  of  unfreezings  (at  most  2n)  and  hence  is 
0(n).  □ 

The  preceding  algorithm  is  less  than  ideal,  because  it 
uses  more  than  n  —  3  edge  translations  to  reduce  P  to  a 


triangle.  We  conjecture  that  there  is  always  some  edge 
that  can  be  translated  to  eliminate  itself  or  one  of  its 
neighbors  without  violating  simplicity.  This  would  give 
the  ideal  algorithm  we  want,  but  we  have  so  far  been 
unable  to  prove  the  conjecture. 

The  key  feature  of  the  reduced  form  is  the  hierarchy 
of  edge  eliminations,  as  the  following  lemma  shows. 

Lemma  2.  If  two  reduced  forms  are  parallel  polygons 
and  have  the  same  hierarchical  structure,  then  one  can 
be  morphed  to  the  other  using  a  linear  number  of  edge 
translations. 

Proof:  Let  P  and  Q  be  the  two  reduced  forms.  The 
algorithm  follows  the  hierarchy  of  the  reduced  form.  We 
repeatedly  translate  edges  of  P  to  be  collinear  with  the 
corresponding  edges  of  Q.  When  we  are  done,  the  two 
polygons  are  identical.  The  first  step  translates  the  top- 
level  triangle  edges  of  P  to  be  collinear  with  those  of  Q, 
moving  the  microstructure  disks  along  with  the  inter¬ 
sections  of  the  edges.  Within  each  microstructure  disk, 
we  translate  the  finite  edge  of  P  to  be  collinear  with 
the  corresponding  edge  of  Q.  (The  subdisks  translate 
along  with  the  intersection  points  of  the  finite  and  infi¬ 
nite  edges.)  We  then  recursively  make  the  two  polygons 
identical  inside  each  subdisk.  □ 

3  The  tree  representation  of  a 
reduced  form 

The  hierarchical  nature  of  the  reduced  form  can  be  nat¬ 
urally  represented  by  a  tree  structure.  The  infinitesimal 
disks  of  the  reduced  form  correspond  to  nodes  in  a  bi¬ 
nary  tree;  a  disk  and  the  two  subdisks  it  contains  corre¬ 
spond  to  a  node  and  its  children.  Each  of  the  three  top- 
level  disks  of  the  reduced  form  corresponds  to  a  tree. 
Logically  these  three  trees  should  be  joined  in  a  ring; 
however,  to  maintain  the  convenience  of  a  consistent 
binary  tree  representation,  we  omit  the  ring  edges  and 
work  with  the  trees  separately.  Each  tree  represents  the 
reduced  form  of  a  simple  bi-infinite  chain  (one  whose 
initial  and  final  edges  are  rays) . 

In  this  section  and  Section  4  we  work  only  with  chains 
and  their  corresponding  trees;  we  show  how  to  morph 
two  parallel  chains  to  each  other.  In  Section  5  we  ad¬ 
dress  the  original  problem  of  morphing  polygons. 

Each  node  of  a  tree  has  an  angular  weight  associated 
with  it,  namely  the  turn  angle  at  the  corresponding  disk, 
i.e.,  the  difference  between  the  direction  angles  of  the 
incoming  and  outgoing  edges  of  the  disk  (we  assume  the 
polygon  is  oriented  counterclockwise).  For  convenience 
we  normalize  the  weight  of  each  vertex  of  P  by  dividing 
it  by  TT,  so  the  weight  is  in  the  range  (-1,  -fl);  the  sum 
of  the  weights  over  all  the  vertices  is  therefore  2.  The 
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weight  at  each  disk  of  the  reduced  form  is  the  sum  of 
the  weights  of  all  the  polygon  vertices  inside  it. 

Because  the  edges  incident  to  each  vertex  of  P  and 
each  disk  of  the  reduced  form  make  an  angle  strictly 
between  — tt  and  tt,  the  weight  of  each  node  in  the  bi¬ 
nary  tree  is  in  the  open  range  (-1,-|-1).  We  say  that 
a  node  whose  weight  is  in  this  range  is  valid.  If  all  the 
weights  in  a  tree  are  valid,  we  say  the  whole  tree  is  valid. 
The  binary  tree  associated  with  the  reduced  form  of  a 
bi-infinite  chain  is  always  valid.  The  following  lemma 
shows  that  the  converse  is  also  true. 

Lemma  3.  Any  valid  tree  corresponds  to  the  reduced 
form  of  some  bi-infinite  chain. 

Proof:  The  proof  is  by  recursive  expansion.  Let  r  be 
the  root  of  the  tree.  The  weight  of  r  corresponds  to  the 
total  turn  of  the  chain.  We  draw  the  infinite  chain  edges 
and  draw  a  circle  C  centered  on  the  intersection  of  their 
supporting  lines.  Let  a  and  f3  be  the  weights  of  the  chil¬ 
dren  of  r.  Within  the  circle  we  draw  an  edge  that  makes 
angles  of  a  and  f3  with  the  two  infinite  edges,  and  two 
smaller  circles  inside  C  centered  on  the  endpoints  of  the 
new  edge.  Applying  this  process  recursively  with  each 
subtree  and  its  corresponding  circle  produces  a  reduced 
form  for  the  tree.  (At  each  step  the  drawing  inside  the 
innermost  circle  is  provisional,  to  be  replaced  when  the 
angle  inside  the  circle  is  expanded.)  Q 

In  the  remainder  of  this  section  we  reduce  the  problem 
of  morphing  between  two  parallel  bi-infinite  chains  to  a 
problem  of  transforming  one  valid  tree  into  another  by 
rotations.  Because  we  want  the  tree  transformation  to 
correspond  to  polygon  morphing,  we  insist  that  the  tree 
be  valid  at  all  times.  In  particular,  the  transformation 
may  use  only  valid  rotations,  ones  whose  initial  and  final 
states  are  both  valid.  For  example,  in  Figure  7,  if  A,  B, 
and  C  represent  node  weights,  it  must  be  the  case  that 
each  of  A,  B,C,  A-i-  B,  B  +  C,  and  A-h  B  +  C  lies  in 
the  range  (—1, 1). 


Figure  7:  A  tree  rotation  and  the  corresponding  morph 
of  a  reduced  form 

Let  Ti  and  T2  be  two  valid  trees  with  the  same  se¬ 
quence  of  weights  at  the  leaves.  Section  4  shows  how  to 


transform  Ti  to  T2  using  only  valid  rotations.  We  now 
show  how  to  interpret  a  tree  rotation  as  a  morphing  op¬ 
eration  that  transforms  one  reduced  form  into  another. 
Let  A,  B,  and  C  be  the  tree  nodes  involved  in  the  ro¬ 
tation,  grouped  {{A  B)  C)  before  and  (A  {B  C))  after 
the  rotation.  The  tree  structure  ((A  B)  C)  corresponds 
to  a  disk  of  the  reduced  form  in  which  the  edges  before 
A  and  after  C  are  infinite,  the  edge  BC  is  finite,  and 
the  edge  AS  is  infinitesimal,  contained  in  a  subdisk. 
Of  course.  A,  B,  and  C  are  vertices  only  at  this  level 
of  the  reduced  form;  they  may  have  microstructure  of 
their  own.  See  Figure  7.  We  want  to  morph  AB,  BC, 
and  the  two  infinite  edges  such  that  AB  becomes  finite 
and  BC  infinitesimal. 

Lemma  4.  Let  A,  B,  and  C  be  the  turn  angles  of  a 
four-edge  bi-infinite  chain,  and  suppose  the  angles  A  -I- 
B,  B-hC,  and  A -h  B  -h  C  are  all  valid.  Then  either  of 
the  edges  AB  and  BC  can  he  reduced  to  zero  length  by 
translating  the  infinite  edge  adjacent  to  it. 


Figure  8:  Each  finite  edge  may  be  eliminated  by  trans¬ 
lating  an  infinite  edge 

Proof:  Let  Ca  he  the  infinite  edge  incident  to  A,  and 
define  en  to  be  the  ray  parallel  to  whose  origin  is 
B.  Let  ec  be  the  infinite  edge  incident  to  C.  Edge  AB 
can  be  shortened  to  zero  length  by  translating  ca  if  and 
only  if  BC  and  ec  ho  not  enter  the  infinite  trapezoid 
bounded  by  ca,  AB,  and  cb-  If  edge  BC  enters  the 
trapezoid,  then  angle  A  -I-  B  is  invalid;  if  the  infinite 
edge  ec  enters  the  trapezoid,  then  angle  A  -I-  B  -t-  C  is 
invalid.  By  assumption,  both  angles  are  valid,  so  AB 
can  be  shortened.  A  similar  argument  for  BC  finishes 
the  proof. 

The  following  lemma  shows  that  a  reduced  form  al¬ 
lows  us  enough  flexibility  to  apply  Lemma  4. 

Lemma  5.  In  a  reduced  form,  the  disk  sizes  can  be 
chosen  so  that  the  infinite  edges  incident  to  any  disk 
are  free  to  translate  a  distance  at  least  equal  to  the 
diameter  of  the  disk. 

Proof:  The  proof  is  top-down.  The  base  case  is  easily 
established;  the  top-level  infinite  edges  of  a  chain  are 
unconstrained.  (In  the  top-level  triangle  of  a  polygo¬ 
nal  reduced  form,  keeping  the  disk  radius  at  most  one- 
fourth  of  the  minimum  edge  length  of  the  triangle  estab¬ 
lishes  the  base  case.)  Within  each  disk  B  of  a  reduced 


form,  there  are  two  infinite  edges  /  and  g,  joined  by  a 
finite  edge  e.  The  endpoints  of  e  are  contained  in  two 
subdisks  concealing  microstructure.  In  the  inductive 
step,  we  assume  that  /  and  g  are  able  to  translate  at 
least  the  diameter  of  so  they  can  certainly  translate 
at  least  5  =  the  maximum  diameter  of  a  subdisk.  Edge 
e  appears  infinite  from  the  vantage  point  of  each  of  its 
disks.  When  we  translate  e  relative  to  the  disk  defined 
by  (e,  /),  the  other  disk  translates  uniformly  along  with 
the  intersection  of  the  lines  supporting  e  and  g.  There¬ 
fore,  we  choose  5  small  enough  that  translating  e  by  h 
relative  to  one  subdisk  does  not  move  the  other  subdisk 
out  of  V.  □ 

To  morph  ((A  B)  C)  to  (yl  {B  C)),  we  erase  the 
disk  around  AB,  then  translate  the  infinite  edges  to 
lengthen  AB  and  shorten  BC,  leaving  vertex  B  (and  its 
microstructure)  fixed.  Once  AB  is  of  finite  length  and 
BC  is  infinitesimally  short,  we  draw  a  new  disk  around 
BC,  leaving  A  at  the  top  level.  During  these  transla¬ 
tions  and  regroupings,  we  may  scale  the  microstructure 
at  A,  B,  and  C  uniformly  if  necessary.  (Unless  the  infi¬ 
nite  edges  are  parallel,  we  can  actually  do  the  rotation 
without  translating  the  infinite  edges  at  all.  If  they 
are  parallel  and  we  do  have  to  move  them,  we  make 
sure  the  movement  is  small  enough  that  the  condition 
of  Lemma  5  is  maintained  at  the  other  ends  of  those 
edges.) 

Because  a  valid  rotation  on  trees  can  be  mapped  di¬ 
rectly  to  a  constant-complexity  morph  of  one  reduced 
form  to  another,  any  algorithm  for  transforming  trees 
via  valid  rotations  corresponds  directly  to  an  algorithm 
for  morphing  the  reduced  forms  of  two  parallel  bi¬ 
infinite  chains.  Theorem  7  thus  implies  the  following: 

Theorem  6.  Given  two  parallel  bi-inGnite  chains  P 
and  Q  of  n  sides,  we  can  continuously  deform  P  to  Q 
by  parallel  moves  while  maintaining  simplicity.  The  de¬ 
formation  uses  0(n^/^+')  parallel  moves. 

4  Tree  reconciliation  via  rota¬ 
tions 

In  this  section  we  address  the  purely  combinatorial 
problem  of  transforming  one  binary  tree  into  another  via 
a  sequence  of  (single)  rotations.  This  is  a  well-studied 
problem  [CW,  STT];  it  is  known  that  0(n)  rotations 
are  necessary  and  sufficient  to  rotate  one  tree  into  an¬ 
other  in  the  worst  case  (the  best-known  upper  bound  is 
2n  -  6).  But  in  our  setting  there  is  additional  structure 
to  cope  with.  Our  trees  have  weights  at  their  leaves. 
The  internal  nodes  also  have  induced  weights,  namely 
the  sum  of  the  leaf  weights  in  all  leaves  belonging  to 
the  subtree  rooted  at  that  internal  node.  These  weights 
restrict  the  possible  trees  we  can  have,  as  well  as  the 
allowed  rotations  between  them. 


As  mentioned  earlier,  a  weighted  tree  is  called  valid 
if  the  weights  of  all  its  nodes  (leaves,  as  well  as  internal 
nodes)  are  reals  in  the  range  (— 1,-t-l)  (open  interval). 
A  rotation  is  called  valid  if  it  transforms  a  valid  tree  into 
another  valid  tree.  It  is  clear  that  rotations  preserve  the 
sequence  of  leaf  weights,  but  change  the  grouping  struc¬ 
ture  on  these  weights  imposed  by  the  tree.  If  Ti  and 
T2  are  two  valid  binary  trees  on  n  leaves  with  the  same 
sequence  of  leaf  weights  Wi,W2,.. .  ,Wn,  then  the  result 
mentioned  in  the  previous  paragraph  asserts  that  one 
can  always  go  from  one  tree  to  the  other  via  a  sequence 
of  rotations.  Arbitrary  rotations,  however,  can  destroy 
validity  by  generating  internal  nodes  whose  weights  are 
outside  the  range  (—1,  -pi).  So  the  natural  question  that 
arises  is  whether  it  is  always  possible  to  go  from  Ti  to 
T2  via  a  sequence  of  valid  rotations  and,  if  so,  what  is 
the  maximum  required  length  of  such  a  sequence. 

We  show  that  the  answer  to  this  question  is  affirma¬ 
tive,  but  we  do  not  know  if  our  upper  bound  is  best 
possible  (in  fact,  we  strongly  suspect  that  a  nearly  lin¬ 
ear  upper  bound  holds). 

Theorem  7 .  If  Tj  and  T2  are  any  two  valid  binary  trees 
on  the  same  sequence  of  leaf  weights  W\,W2,  ■  ■  ■  ,Wn, 
then  it  is  always  possible  to  go  from  Ti  to  T2  by  a  se¬ 
quence  of  0(n^/^'*'^)  valid  rotations. 

The  rest  of  this  section  proves  the  theorem.  Our  basic 
strategy  is  go  from  Ti  to  T2  as  follows:  take  any  pair 
of  leaves  in  T2  that  are  siblings  and  perform  valid  rota¬ 
tions  on  Ti  (if  necessary)  to  make  them  siblings  in  Ti 
as  well  (we  have  to  argue  that  this  is  always  possible). 
In  each  tree  contract  that  pair  of  leaves  into  their  par¬ 
ent,  making  the  parent  a  new  leaf  and  giving  it  a  weight 
that  is  the  sum  of  the  two  previous  leaf  weights.  We  are 
now  left  with  two  valid  trees  on  n  -  1  weighted  leaves, 
and  we  repeat  the  whole  process  until  both  trees  have 
been  reduced  to  a  single  node.  It  is  clear  that  in  this 
process  we  generate  a  sequence  of  valid  rotations  that 
transforms  Ti  into  T2.  This  simplistic  scheme  has  to  be 
refined  by  adding  a  certain  “tree  renormalization”  step 
at  key  points,  in  order  to  speed  up  the  process. 

Let  us  then  focus  first  on  the  problem  of  taking  a 
pair  of  adjacent  but  non-sibling  leaves  w  and  y  in  Ti 
and  performing  valid  rotations  to  make  them  siblings. 
See  Figure  9.  Let  x  be  the  least  common  ancestor  of  w 
and  y,  and  let  v  and  z  be  the  left  and  right  children  of  x 
respectively.  A  key  concept  for  our  analysis  is  that  of  a 
spine  in  a  binary  tree.  A  spine  is  a  maximal  sequence  of 
nodes  that  form  a  path  in  the  tree  all  of  whose  links  go 
in  the  same  direction  (all  left  or  all  right).  For  example, 
in  Figure  9,  the  paths  [u..i(;]  and  [z..y]  are  spines.  We 
base  our  procedure  for  making  w  and  y  siblings  on  a 
sequence  of  spine- contraction  steps,  as  in  the  lemma 
and  corollary  below. 
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Figure  9:  Two  adjacent  non-sibling  leaves 

Lemma  8.  Among  any  four  consecutive  internal  nodes 
along  a  spine  in  a  valid  tree,  at  least  one  can  be  rotated 
out  of  the  spine  by  a  valid  single  rotation. 

Proof:  Without  loss  of  generality  assume  that  the 

spine  is  a  left  spine,  like  [z..y]  in  Figure  9.  Let  A,  B,  C, 
and  D  be  the  right  children  of  four  consecutive  nodes 
on  the  spine,  and  let  X  be  yl’s  sibling,  as  in  Figure  10a. 
By  abuse  of  notation  let  A,  B,  C,  D,  and  X  also  denote 
the  weights  of  the  nodes.  If  any  consecutive  pair  of  right 
children,  say  {A,B),  has  a  valid  sum  (—1  <  A  +  B  < 
1),  then  a  valid  rotation  can  be  applied  at  the  edge 
joining  the  parents  of  A  and  B.  This  makes  A  and  B 
into  siblings  whose  grandparent  lies  on  the  spine,  and 
removes  one  edge  from  the  spine.  See  Figure  10b. 


Figure  10:  A  rotation  may  be  performed  on  any  spine 
with  more  than  four  nodes 

If  no  consecutive  pair  has  a  valid  sum,  then  \A  +  B\  > 
1  and  \C+D\  >  1.  All  four  of  A,  B,  C,  and  D  must  have 
the  same  sign,  so  \A+B+C+D\  =  \A+B\  +  \C+D\  >  2. 
The  weight  at  the  parent  of  D  isw  =  X  +  A  +  B  +  C  +  D. 
Because  |Ai|  <  1,  we  have  |?n|  >  1,  so  the  initial  tree  was 
invalid,  a  contradiction.  □ 

If  we  partition  a  spine  into  consecutive  groups  of  four 
nodes  each  and  apply  the  valid  rotation  guaranteed  by 
the  previous  lemma  to  each  group  independently,  we  can 
reduce  the  length  of  our  spine  by  1/4.  By  iterating  this 
process  until  the  previous  lemma  is  no  longer  applicable, 
we  obtain  the  following  spine-contraction  corollary.  We 
note  that  when  no  rotation  can  be  applied  along  a  spine, 
all  the  weights  of  the  spine  nodes  must  be  of  the  same 
sign. 

Corollary  9.  Given  a  spine  of  length  k,  we  can  reduce 
it  to  length  at  most  4  by  k  —  4  valid  rotations.  Fur¬ 
thermore,  when  this  procedure  terminates,  the  original 


children  of  the  spine  will  have  been  regrouped  into  trees 
attached  to  the  contracted  spine,  each  of  height  at  most 
O(logfc). 

If  we  use  a  spine  contraction  on  the  spines  [u..w]  and 
[z..y],  we  can  make  the  least  common  ancestor  x  of 
leaves  w  and  y  be  at  most  5  levels  above  the  two  leaves. 
We  can  then  finish  as  follows. 

Lemma  10.  If  the  weights  of  two  adjacent  leaves  w  and 
y  have  a  valid  sum,  then  w  and  y  can  be  made  siblings 
by  at  most  eight  valid  single  rotations  once  the  spines 
[v..w]  and  [z..y]  have  been  contracted. 

Proof:  After  the  spine-contraction  process,  we  can 

assume  that  no  rotation  involving  only  spine  [u..w;]  or 
spine  [z..y\  is  valid.  Let  Zi,...,4  denote  the  weights 
of  the  left  children  of  spine  [v..w],  in  left-to-right  order, 
and  similarly  let  ri , . . . ,  rj  denote  the  weights  of  the 
right  children  of  [z..y].  Observe  that  the  Vs  all  have 
the  same  sign;  the  same  is  true  for  the  r’s.  By  abuse 
of  notation,  let  w  and  y  also  denote  the  weights  of  the 
corresponding  nodes.  Denote  the  weight  at  a:  by  a;  = 
li  -b  ■  ■  ■  +  Ik  +  w  +  y  +  ri  +  ■  ■  ■  +  Vj.  We  know  \x\  <  1. 

If  h  has  the  same  sign  as  a;,  say  positive,  then  x>  x  — 
l\  >  a:-l  >  -1,  so  a;-li  is  valid.  Similarly,  if  sign(rj)  = 
sign(a:),  then  x  —  rj  is  valid.  If  sign(li)  =  sign(r’j) 
sign(a;),  then  sign(a;)  =  sign(rt)  -|-  y),  say  positive.  Then 
X  <  x  —  li  <  w  -by  <  1  and  x  —  li  is  valid  (so  is  a;  -  fj). 

Without  loss  of  generality  assume  x  —  li  is  valid.  Per¬ 
form  a  rotation  at  edge  (u,  a;),  moving  the  right  child  of 
V  to  be  the  left  sibling  of  z.  The  new  parent  of  z  and 
the  moved  node  has  weight  x  —  h,  which  is  valid.  All 
the  other  modified  nodes  have  weights  that  were  present 
before  the  rotation,  so  the  rotation  is  valid.  It  shortens 
by  one  the  length  of  the  path  between  w  and  y.  After 
the  rotation,  we  can  still  apply  the  argument  above  to 
show  that  at  least  one  of  the  edges  incident  to  the  lowest 
common  ancestor  of  w  and  y  can  be  rotated.  Perform¬ 
ing  at  most  eight  rotations,  we  shorten  the  path  length 
to  two,  making  w  and  y  siblings.  □ 

The  overall  rotation  cost  of  making  leaves  w  and  y 
siblings  is  proportional  to  the  sum  of  the  original  lengths 
of  the  two  spines  [u..w;]  and  [z..y].  This  is  2  less  than  the 
tree  distance  of  w  and  y.  Let  us  call  the  latter  quantity 
simply  the  distance  in  Ti  between  w  and  y. 

Our  discussion  so  far,  coupled  with  our  overall  tree 
morphing  strategy  mentioned  earlier,  provides  a  proof 
that  Ti  can  be  rotated  into  T2.  In  fact,  we  also  get  an 
0{n^)  upper  bound  on  the  number  of  rotations  needed, 
as  the  distance  between  any  two  adjacent  leaves  is  0{n), 
and  the  number  of  sibling-formation  operations  is  at 
most  n  —  1. 

In  order  to  improve  this  bound,  we  must  analyze  more 
carefully  the  distances  between  adjacent  leaves  that  are 
being  made  siblings.  In  particular,  the  difficulty  is  that 
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rotations  that  reduce  one  spine  length  can  increase  other 
spine  lengths — actually  it  is  very  easy  to  check  that  any 
single  rotation  will  decrease  the  distance  between  two 
pairs  of  leaves  by  1,  and  it  will  increase  the  distance 
between  two  other  pairs  by  1.  We  can,  however,  argue 
that  there  is  a  sequence  of  spine  contractions  over  a 
whole  tree  that  reduces  the  maximum  spine  length. 

Lemma  11.  Any  valid  tree  on  n  leaves  with  maxi¬ 
mum  spine  length  k  can  be  rotated  using  0{n)  rotations 
into  another  valid  tree  whose  maximum  spine  length  is 
0(logk). 

Proof:  Apply  Corollary  9  on  the  left  and  right  spines 
of  the  original  tree,  reducing  them  to  constant  length. 
Each  subtree  that  was  originally  a  child  of  the  left  or 
right  spine  is  now  at  depth  O(logfc).  Let  T  be  such 
a  subtree.  Without  loss  of  generality,  assume  that  the 
parent  of  T  is  to  its  left.  Apply  Corollary  9  on  the  right 
spine  of  T,  reducing  it  to  length  at  most  four.  (This 
process  does  not  modify  any  ancestors  of  T.)  Now  the 
length  of  the  right  spine  that  includes  T’s  right  spine 
(including  ancestors)  is  0(log /c).  Recursively  apply  this 
procedure  to  the  original  child  trees  of  T’s  right  spine, 
until  every  node  in  the  tree  has  participated  in  one 
spine-reduction  operation. 

Spine-reduction  does  not  affect  the  internal  structure 
of  the  children  of  the  spine;  the  cost  of  spine-reduction 
is  proportional  to  the  length  of  the  spine.  Each  spine- 
reduction  operation  is  applied  to  a  spine  that  has  not 
previously  been  operated  upon.  Thus  the  total  work  is 
0(n).  By  construction,  each  spine  in  the  final  tree  con¬ 
sists  of  at  most  four  nodes  that  result  from  the  reduction 
of  that  spine,  plus  0(log  k)  ancestors  that  were  rotation 
nodes  in  a  spine  reduction  higher  up  in  the  tree.  □ 

Corollary  12.  Any  valid  tree  on  n  leaves  can  be  ro¬ 
tated  using  0(n  log*  n)  rotations  into  another  valid  tree 
whose  maximum  spine  length  is  0(1). 

Proof:  Recall  that  log*  n  =  min(A:  |  log^*^^  n  <  1), 

where  log^*^  n  is  the  fc-fold  composition  log(log  •  •  •  (n)). 
By  applying  Lemma  11  0(log*n)  times,  we  reduce  the 
maximum  spine  length  to  0(1).  □ 

We  call  the  process  of  transforming  a  tree  into  one 
whose  maximum  spine  length  is  constant  a  normaliza¬ 
tion.  We  intersperse  normalizations  with  the  leaf  pair¬ 
ing  and  contraction  process  so  as  to  keep  leaf  distances 
small.  But  we  need  to  develop  one  more  idea  first,  the 
intuition  being  that  during  the  pairing  and  contraction 
process  some  leaf  pairs  may  have  a  large  distance,  but 
not  many  may  do  so.  After  all,  in  a  single  tree,  the  av¬ 
erage  distance  between  adjacent  leaves  is  constant-;-but 
of  course  our  rotations  change  the  tree  all  the  time. 

We  saw  earlier  that  rotations  redistribute  spine 
length.  In  fact.  Corollary  9  states  that  the  rotations 


that  make  siblings  of  a  pair  of  leaves  that  were  originally 
at  distance  k  can  increase  the  distance  of  any  other  ad¬ 
jacent  pair  of  leaves  by  at  most  0(log  k).  Let  us  consider 
the  following  game:  we  have  n  bins,  each  containing  at 
most  n  tokens.  The  bins  model  the  gaps  between  suc¬ 
cessive  leaves,  and  the  tokens  account  for  the  distances 
between  successive  pairs.  We  play  as  follows: 

•  At  each  turn,  we  remove  all  k  tokens  from  one  bin 
and  distribute  them  to  k  other  bins.  At  most  one 
new  token  gets  added  to  each  bin  at  each  turn. 

•  We  play  for  n  turns;  the  goal  is  to  maximize  the 
number  of  tokens  moved. 

In  the  real  tree  situation,  contracting  a  pair  of  leaves 
at  distance  k  may  increase  the  distance  of  some  other 
pair  not  by  1  but  by  0{\ogk).  But  let  us  focus  on  the 
simpler  game  situation  for  now. 

Lemma  13.  If  originally  each  bin  contains  0(1)  to¬ 
kens,  then  after  turns,  the  maximum  number  of 
tokens  moved  is  Q{n). 

Proof:  We  first  show  that  during  any  run  of  turns 
in  the  first  turns,  at  most  tokens  are  moved. 
The  run  moves  tokens  from  a  set  B  of  bins  into 
other  bins.  There  are  at  most  tokens  in  B  at  the 
start  of  the  run  (because  we  are  in  the  first  turns) 
and  at  most  tokens  get  added  into  B  during  the 
run  (because  each  bin  in  B  receives  at  most  tokens 
in  turns).  Partitioning  the  first  turns  into 
runs  of  turns  apiece  completes  the  upper  bound. 

To  prove  the  lower  bound,  let  A  and  B  be  two  disjoint 
sets  of  bins.  In  the  first  turns,  put  at  least 

tokens  in  each  bin  of  A.  Divide  the  second 
turns  into  phases  of  turns  apiece.  In  each 

phase,  move  all  the  tokens  from  A  to  B  or  vice  versa, 
putting  in  each  bin.  □ 

In  the  slightly  more  elaborate  tree  setting  in  which  a 
single  contraction  of  length  k  may  increase  a  particular 
distance  by  0(logA;),  the  bound  on  the  total  work  done 
after  contractions  becomes  0(n^+'^). 

We  can  now  complete  the  proof  of  the  main  theorem 
of  this  section.  After  every  contraction  steps,  we 
renormalize  the  tree.  Thus  the  cost  of  the  rotations  for 
the  leaf  contractions  balances  the  cost  of  the  rotations 
for  renormalization  and  both  are  bounded  by 
Thus  the  theorem  is  proved. 

5  Morphing  polygons 

In  this  section  we  use  the  tools  of  the  preceding  sections 
to  show  how  to  morph  one  polygon  to  another  parallel 
polygon.  This  is  more  difficult  than  morphing  bi-infinite 
chains  (Sections  3  and  4),  because  we  cannot  guarantee 
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that  the  reduced  forms  of  two  parallel  polygons  will  have 
the  same  top-level  triangle.  We  show  below  that  each 
morphing  operation  that  involves  two  of  the  top-level 
trees  can  be  transformed  with  0(1)  parallel  moves  to 
involve  a  single  tree. 

Let  Ti  =  (ri,T(,r(')  be  the  set  of  three  trees  associ¬ 
ated  with  the  top-level  triangle  of  the  reduced  form  of 
polygon  P,  and  let  T2  =  {T2,T2,  T^')  be  the  correspond¬ 
ing  set  of  trees  for  Q.  In  the  tree  model,  our  algorithm 
for  morphing  P  to  Q  is  the  same  as  that  of  Section  4: 
we  pick  a  pair  of  leaves  that  are  siblings  in  one  of  the 
trees  of  T2,  rotate  them  to  be  siblings  in  one  of  the  trees 
of  71,  and  then  collapse  the  pair  into  their  parent  in 
both  Ti  and  71,  thereby  reducing  the  size  of  the  prob¬ 
lem.  This  works  fine  if  the  two  leaves  belong  to  the 
same  tree  in  71 :  we  simply  apply  the  algorithm  of  Sec¬ 
tion  4  to  that  tree.  If  the  two  leaves  belong  to  different 
trees,  however,  we  must  morph  the  polygon  to  bring  the 
two  leaves  into  the  same  tree  before  we  can  apply  the 
algorithm  of  Section  4. 

Let  X  and  y  be  two  sibling  leaves  in  71,  and  with¬ 
out  loss  of  generality  suppose  that  x  and  y  are  the  last 
and  first  leaves  in  Ti  and  T{,  respectively.  We  begin 
the  morph  by  shortening  the  spines  containing  x  and 
y  as  much  as  possible  (Corollary  9).  Then  the  right 
spine  of  Ti  has  at  most  four  left  children  whose  weights 
li,.. Ik  all  have  the  same  sign.  The  left  spine  of  T{ 
has  at  most  four  right  children  with  same-sign  weights 
ri , . . . ,  Tj .  The  weight  of  Ti  is  h  +  ■■■  +  Ik  +  and 
the  weight  of  is  y  -I-  ri  -h  •  •  •  -t-  Vj .  Because  Ti  and 
TI  are  both  top-level  trees,  their  weights  are  both  pos¬ 
itive,  and  their  sum  is  between  1  and  2.  Leaves  x  and 
y  are  siblings  in  71,  so  their  combined  weight  is  valid: 
|x  -f  y|  <  1.  The  following  lemma  is  our  basic  tool  for 
separating  a  top-level  tree  into  two: 

Lemma  14.  Let  T  be  a  valid  tree,  and  suppose  the  left 
(right)  spine  ofT  has  0(1)  children,  all  of  whose  weights 
are  positive.  Let  e  be  a  spine  edge  such  that  the  total 
leaf  weight  to  the  left  of  e  is  valid,  as  is  the  total  weight 
to  the  right.  Then  the  polygonal  chain  corresponding 
to  T  can  be  morphed  with  0(1)  parallel  moves  to  sep¬ 
arate  it  into  two  linearly  separated  sub  chains  joined  by 
a  single  edge,  corresponding  to  splitting  T  at  e. 

Proof:  The  children  of  the  spine  above  e  are  all  pos¬ 
itive.  Let  Te  be  the  subtree  of  T  rooted  at  the  bottom 
of  e.  Te  corresponds  to  a  microstructure  disk  Vg  in  the 
reduced  form.  If  we  erase  all  the  microstructure  disks 
that  contain  Vg ,  we  obtain  a  mostly-convex  chain  with 


Figure  11:  Splitting  a  chain  into  two  separated  chains 

Consider  the  case  in  which  the  sequence  of  weights 
li, . . .  ,lk,x,y,ri, . . .  ,rj  can  be  split  into  two  subse¬ 
quences,  each  with  a  valid  sum,  without  separating  x 
and  y.  Without  loss  of  generality  assume  the  split  is  to 
the  right  of  some  The  sum  li  +  ■  ■  ■  -f  k  is  positive, 

and  the  sum  W  =  h+i  -1 - hx  must  be  valid.  Thus  by 

Lemma  14  the  chain  corresponding  to  Ti  can  be  split  in 
two  by  stretching  the  edge  e  corresponding  to  the  split 
point  of  the  sequence.  We  perform  a  parallel  move, 
stretching  e  and  bringing  the  subchain  corresponding 
to  W  closer  to  the  chain  (that  is,  microstructure)  for 
T{ .  Let  /  be  the  edge  of  the  top-level  triangle  joining 
Ti  to  T{.  If  W  is  positive,  we  translate  /  away  from  the 
triangle  interior,  so  that  it  gets  shorter  (Figure  12a).  If 
W  is  negative,  we  hold  /  stationary  and  translate  e  out¬ 
ward,  again  shortening  /  (Figure  12b).  In  both  cases 
the  parallel  move  does  not  encroach  upon  the  triangle’s 
third  vertex.  The  move  shortens  /  enough  that  we  can 
draw  a  microstructure  disk  around  it:  the  triangle  has 
been  changed  so  x  and  y  belong  to  the  same  tree,  and 
the  algorithm  of  Sections  3  and  4  applies. 


/ 


(b) 


disks  at  its  vertices  (the  only  possible  nonconvexity  is  at 
Vg).  See  Figure  11.  We  coalesce  all  the  disks  except  Vg 
together  by  parallel  moves  (because  T  is  valid,  none  of 
the  requisite  edge-shortenings  causes  a  collision  between 
chain  edges.)  This  essentially  leaves  a  two- vertex  valid 
chain,  whose  finite  edge  can  be  stretched  arbitrarily.  □ 


Figure  12:  Modifying  the  top-level  triangle  in  one  step 

Lemma  15.  The  sequence  li, . . .  ,lk,x,y,ri, . . .  ,rj  can 
be  split  into  at  most  three  subsequences,  each  with  a 
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valid,  positive  sum,  with  x  and  y  in  the  same  subse¬ 
quence. 

Proof:  Consider  the  modified  sequence  li, . . .  ,lk,{x-{- 
y),ri,. . .  ,rj.  Without  loss  of  generality  assume  the  Ts 
are  positive.  Let  V  be  the  longest  prefix  of  the  modified 
sequence  whose  sum  is  valid.  The  sum  of  V  is  positive 
(the  sum  up  to  y  is  >  —1,  and  if  that  sum  is  negative, 
the  6’s  must  be  positive). 

If  the  suffix  after  V  is  valid,  and  hence  positive,  we  are 
done.  If  not,  we  split  the  suffix  after  its  first  element. 
This  element  is  positive  and  valid.  The  sum  of  the  last 
part  of  the  suffix  is  also  positive  (else  the  whole  suffix 
would  be  valid)  and  valid  (else  the  total  sequence  sum 
would  be  >  2).  □ 

If  the  weight  sequence  cannot  be  split  into  two  valid 
pieces  without  separating  x  and  y,  we  split  it  in  three 
pieces,  as  in  Lemma  15.  We  stretch  the  edge  corre¬ 
sponding  to  one  of  the  two  splits,  turning  the  triangle 
into  a  convex  quadrilateral  with  edge  f  —  Ty  separated 
from  the  other  two  top-level  vertices.  See  Figure  13. 
(The  quadrilateral  must  be  convex,  else  the  weight  se¬ 
quence  could  be  split  into  two  valid  pieces.)  By  trans¬ 
lating  the  quadrilateral  edge  opposite  /  outward,  we 
shrink  it  enough  to  draw  a  microstructure  disk  around 
it.  (Translating  outward  shrinks  the  edge  because  the 
top-level  turn  at  the  endpoints  of  /  sums  to  more  than 
>  1.)  Now  we  stretch  the  edge  corresponding  to  the 
other  split.  As  in  Figure  12a,  we  translate  /  outward, 
shortening  it  until  it  can  be  contained  in  a  microstruc¬ 
ture  disk.  Now  X  and  y  belong  to  the  same  tree,  and 
the  previous  algorithm  applies. 


/ 


Figure  13:  Modifying  the  top-level  triangle  in  two  steps 

Except  for  the  preliminary  spine-shortening,  we  use 
only  0(1)  parallel  moves  to  make  x  and  y  into  siblings 
in  the  same  tree.  The  game  model  of  Lemma  13  still 
applies  to  morphing  polygons:  bins  still  correspond  to 
gaps  between  leaves,  but  now  adjacent  leaves  may  be 
in  different  trees.  Thus  the  number  of  parallel  moves 
needed  to  morph  polygons  is  asymptotically  the  same 
as  the  number  needed  for  bi-infinite  chains: 


Theorem  16.  Given  two  parallel  simple  polygons  P 
and  Q  of  n  sides,  we  can  continuously  deform  P  to  Q 
by  parallel  moves  while  maintaining  simplicity.  The  de¬ 
formation  uses  0(n'^0+£)  parallel  moves. 

6  Open  questions 

The  problem  discussed  in  this  paper  and  our  solution  to 
it  raise  many  fascinating  questions. 

The  two  specific  questions  left  open  by  our  approach 
are  (1)  to  show  that  in  any  polygon  an  edge  can  always 
be  eliminated  by  an  edge  translation,  and  (2)  to  get  a 
tight  bound  on  the  number  of  parallel  moves  needed  for 
the  morph.  As  mentioned  earlier,  we  strongly  suspect 
that  this  number  is  nearly  linear. 

More  generally,  it  would  be  interesting  to  introduce 
some  measure  of  polygon  distortion  and  look  at  edge 
translation  algorithms  that  minimize  this  measure.  Our 
algorithm  is  very  bad  this  way,  for  even  if  P  and  Q 
are  very  similar-looking  polygons,  their  intermediate 
morphs  can  visually  look  quite  different. 
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Abstract 

We  provide  a  new  technique  for  deriving  optimal  sized 
polygonal  schema  for  triangulated  compact  2-manifolds 
without  boundary  in  0{n)  time,  where  n  is  the  size  of 
the  given  triangulation  T.  We  first  derive  a  polygonal 
schema  P  embedded  in  T  using  Seifert-Van  Kampen’s 
theorem.  A  reduced  polygonal  schema  Q  of  optimal  size 
is  computed  from  P,  where  a  surjective  mapping  from 
the  vertices  of  P  is  retained  to  the  vertices  of  Q.  This 
helps  detecting  null-homotopic  (contractable  to  a  point) 
cycles.  Given  a  cycle  of  length  k  we  determine  if  it  is 
null-homotopic  in  0{n  +  gk)  time  where  g  is  the  genus 
of  the  given  2-manifold.  The  actual  contraction  for  a 
null-homotopic  cycle  can  be  computed  in  0{gkn)  time 
and  space.  This  is  an  improvement  of  a  factor  of  g  over 
the  previous  best-known  algorithms  for  these  problems. 

1  Introduction 

In  recent  years  a  new  focus  has  developed  in  studying 
the  algorithmic  aspects  of  topology  [2,  4,  6,  7,  9,  10], 
a  well-developed  branch  of  mathematics.  This  emer¬ 
gent  field  has  been  called  “Computational  Topology” 
[6,  10].  It  is  generally  recognized  that  there  exists  a  Vcist 
repository  of  topological  problems  which  have  not  been 
studied  extensively  from  an  algorithmic  point  of  view. 
This  paper  deals  with  the  problem  of  computing  polyg¬ 
onal  schema,  an  efficient  representation  of  2-manifolds, 
from  their  triangulations.  An  elegant  solution  to  this 
problem  provides  an  improved  algorithm  for  detecting 
null-homotopic  cycles  on  compact  2-manifolds. 
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A  2-manifold  M  can  be  represented  by  a  polygon 
P  (polygonal  schema)  with  even  number  of  edges  such 
that  when  edges  are  appropriately  identified,  we  get 
back  M.  Given  a  triangulation  T  of  A4,  Vegter  and 
Yap  [10]  designed  an  algorithm  that  derives  a  polygo¬ 
nal  schema  (canonical)  of  optimal  size  whose  edges  be¬ 
long  to  the  edges  of  a  refined  triangulation  of  T.  This 
method  subdivides  the  edges  of  T  into  0(g)  subedges 
and  increases  the  size  of  the  triangulation  to  0(gn). 
A  lower  bound  of  it(gn)  is  known  for  deriving  such  a 
canonical  polygonal  schema  for  2-manifolds.  In  that  re¬ 
spect  the  algorithm  of  Vegter  and  Yap  is  optimal.  How¬ 
ever,  in  applications  we  may  not  need  such  polygonal 
schema.  As  shown  in  this  paper,  we  can  derive  a  polyg¬ 
onal  schema  of  optimal  size  in  0(n)  time  from  T  with¬ 
out  refining  it.  This  polygonal  schema  is  not  embedded 
in  a  refinement  of  T  but  retain  a  surjective  mapping 
from  the  vertices  of  another  polygonal  schema  embed¬ 
ded  in  T  to  its  vertices.  This  fact  is  used  to  detect 
null-homotopic  cycles  on  M.  Our  method  of  comput¬ 
ing  the  polygonal  schema  is  also  simpler  than  Vegter 
and  Yap’s  method.  We  believe  that  this  method  will 
find  more  applications  in  other  algorithms. 

Given  a  cycle  C  on  a  2-manifold,  we  say  that  C  is 
null-homotopic  if  it  can  be  contracted  to  a  single  point. 
Detecting  the  null-homotopy  of  a  given  cycle  in  a  topo¬ 
logical  space  is  a  century-old  problem,  known  as  the 
contractabitity  problem  in  topology  [8].  A  related  prob¬ 
lem  involves  determining  if  two  given  cycles  can  be  con¬ 
tinuously  deformed  to  one  another.  It  turns  out  that  a 
solution  to  the  contractability  problem  also  provides  a 
solution  to  this  problem. 

The  contractability  problem  for  2-manifolds  was  first 
solved  from  a  mathematical  point  of  view  during  the 
1880s.  It  was  established  that  a  cycle  is  null-homotopic 
if  and  only  if  its  corresponding  curve  on  the  univer¬ 
sal  covering  space  is  closed.  Universal  covering  space 
is  a  collection  of  polygons  (polygonal  schema)  appro¬ 
priately  attached  to  give  a  tessellation  of  the  plane.  In 
[6],  Schipper  used  this  result  to  give  an  0(g‘^k  -f  gn) 
time  and  space  algorithm  to  detect  the  null-homotopy 
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of  a  given  cycle.  Here  k  is  the  length  of  the  given  cycle. 
In  his  algorithm  Schipper  derives  the  canonical  polygo¬ 
nal  schema  using  the  algorithm  of  Vegter  and  Yap  [10]. 
Using  our  polygonal  schema  computation  we  design  an 
algorithm  that  runs  in  0(n  +  gk)  time  and  space.  If  the 
given  cycle  is  null-homotopic,  the  actual  contraction  can 
be  computed  in  0{gnk)  time.  This  is  an  improvement 
of  a  factor  of  g  over  Schipper’s  algorithm.  We  should 
mention  that  Schipper’s  algorithm  does  not  remain  lin¬ 
ear  in  k  and  takes  0{n  -|-  k^)  time  when  5  =  1  for  torus 
(orientable)  and  jr  =  2  for  klein  bottle  (non-orientable). 
We  treat  these  cases  separately  and  give  Q(n  +  k)  op¬ 
timal  algorithms  for  them. 

We  describe  necessary  concepts  in  homotopy  and  fun¬ 
damental  groups  of  topological  spaces  in  Section  2.  In 
the  third  section  we  explain  the  polygonal  representa¬ 
tion  (polygonal  schema)  of  2-manifoIds.  In  Section  4 
we  describe  a  technique  to  derive  a  reduced  polygonal 
schema  from  the  given  triangulation  of  a  2-manifold  us¬ 
ing  Seifert- Van  Kampen’s  theorem.  In  the  fifth  section 
we  detail  the  algorithm  for  null-homotopy  detection, 
and  we  conclude  in  Section  6. 

2  Homotopy  and  Fundamental 
Group 

Let  f  •.  T\  —*  T2  and  g  :  Ti  — >  T2  be  two  maps  be¬ 
tween  the  topological  spaces  Ti  and  Ta.  These  two 
maps  are  called  homotopic  if  there  is  a  continuous  func¬ 
tion  h  :  [0, 1]  X  Ti  — ►  r2  such  that  h(0,  x)  =  f[x)  and 
h{l,x)  =  ^(2:).  We  can  interpret  h  as  a  deformation 
process  that  transforms  /  to  (7  in  a  continuous  manner. 
We  are  interested  in  the  case  when  Ti  =  S^,  the  unit 
circle.  Then,  /  and  g  are  closed  curves,  also  called  cy¬ 
cles.  Let  Cl,  C2  be  two  cycles  that  begin  at  a  common 
point,  say  p.  The  product  operation,  “  ■  is  defined 
on  them  as  C3  =  Ci  •  C2,  where  C3  is  the  cycle  begin¬ 
ning  at  p,  then  going  around  Ci  followed  by  a  traversal 
around  C2  which  finishes  at  p.  The  inverse  C~^  of  a  cy¬ 
cle  C  is  the  cycle  lying  on  top  of  C,  but  with  opposite 
orientation. 

Given  a  fixed  origin  p  for  the  cycles  on  T2,  we  call 
C,  C  equivalent  if  there  is  a  homotopy  between  them 
which  keeps  p  fixed.  The  equivalence  class  of  C  is  de¬ 
noted  by  [Cj.  The  product  operation  “  •  ”  extends  nat¬ 
urally  to  equivalence  classes  as  [Ci]  •  [C2]  =  [Gi  •  C2]. 

These  equivalence  classes  form  a  group  under  the 
product  operation.  The  identity  element  1  is  repre¬ 
sented  by  the  equivalence  class  of  the  point  cycle  p, 
and  [C]  •  [G-i]  =  [C  ■  G"!]  =  1  giving  [G]"'  =  [G"!]. 
This  group,  denoted  7ri(72),  is  called  the  fundamental 
group  of  the  topological  space  T2.  It  turns  out  that  this 
group  is  independent  of  the  choice  of  the  origin  p  and 


is  an  invariant  property  of  the  underlying  space. 

Fact  2.1  A  cycle  C  is  contractable  or  null-homotopic 
if  and  only  if[C]  =  1  in  ttj  [8]. 

We  use  the  following  concepts  of  group  theory  in  the 
next  sections.  Let  G  be  a  group  with  the  product  opera¬ 
tion  .  A  set  of  elements  X  =  {51, 5^2,  ■■■}  of  G  is  called 
a  generator  set  if  any  non-identity  g  E  G  can  be  written 
^  g  =  g[  ■  92--9k  for  some  k>l  and  g[  E  {ffi,  52,  •••}• 
In  other  words  all  elements  of  G  are  generated  by  X. 
A  word  is  a  concatenation  of  elements  of  G  under  the 
product  operation.  For  example  w  =  91.92-93  =  919293 
is  a  word.  A  relation  r  is  a  word  which  is  set  to  1.  For 
example  r  =  g\g293  is  a  relation  if  gig293  =  1.  Rela¬ 
tions  of  the  form  99~^,9~^9  are  called  trivial  relations. 

The  structure  T  =<  91, 92...  ■  ni,r2,...  >  is  called 
a  group  presentation  of  G  if  {91,92,  ■■■}  is  a  gener¬ 
ator  set  that  generates  G  with  respect  to  the  rela¬ 
tions  {ri,r2,...}.  We  also  write  G  =<  9i,92,---  '■ 
ri,r2...  >.  Two  relations  are  equivalent  if  one  can  be 
derived  from  the  other.  A  group  is  freely  generated 
by  A  =  {gi,g2,  •••}  if  there  are  no  nontrivial  relations. 
In  that  case  the  group  has  a  presentation  of  the  form 
<  9u92,  ■■■  ■■  —  >. 

3  Polygonal  Schema 

A  2-manifold  is  a  topological  space  where  each  point  has 
a  neighborhood  homeomorphic  to  an  open  disk.  By  this 
definition,  we  are  considering  only  2-manifolds  without 
boundary.  A  2-manifold  can  be  infinite  or  finite.  More¬ 
over,  it  can  be  closed  or  open  depending  on  whether 
or  not  its  closure  coincides  with  itself.  A  closed  and 
finite  2-manifold  is  also  called  a  compact  2-manifold.  A 
sphere  and  a  klein  bottle  are  two  examples  of  compact 
2-manifolds.  A  2-manifold  is  called  orientable  if  it  has 
two  distinct  sides.  Otherwise,  it  is  non-orientable.  For 
details  see  [8]. 

A  2-manifold  is  triangulable  in  the  sense  that  it  can 
be  represented  as  the  union  of  a  set  of  triangles,  edges, 
and  vertices  satisfying  the  following  properties.  Each 
pair  of  triangles  either  share  a  single  vertex  or  a  single 
edge,  or  are  completely  disjoint.  Also,  the  triangles 
incident  on  a  vertex  can  be  ordered  circularly  so  that 
two  triangles  share  a  common  edge  if  and  only  if  they 
are  adjacent  in  this  ordering.  In  this  paper  we  consider 
only  compact  2-manifolds  without  boundary. 

Any  orientable  or  non-orientable  2-manifold  M  can 
be  represented  by  a  simple  polygon  P  with  an  even 
number  of  edges  on  bd{Py  which  is  also  called  a  polyg¬ 
onal  schema  of  M.  Each  edge  of  P  has  a  signed  label 
such  that  each  unsigned  label  occurs  twice.  See  [3,  8] 

^bd{P)  represents  the  boundary  of  the  polygon  P. 
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Figure  3.1:  A  polygonal  schema  for  double  torus 


for  details.  Two  edges  with  the  same  unsigned  labels 
are  called  partnered  edges.  Partnered  edges  can  have 
labels  with  the  same  or  opposite  signs.  Two  partnered 
edges  with  the  labels  +a:  and  -x  represent  the  same 
edge  on  M  but  are  oppositely  directed  on  P.  Figure 
3.1  shows  a  polygonal  schema  for  double  torus  {g  =  2). 
We  use  x~^  to  denote  the  complement  of  the  label  x. 
To  reconstruct  a  surface  homeomorphic  to  M  from  its 
polygonal  representation,  the  oriented  edges  with  the 
same  labels  are  identified  together  in  such  a  way  that 
their  orientations  match.  For  simplicity,  we  say  that 
M  is  obtained  from  P  by  identifying  partnered  edges 
appropriately. 

An  orientable  2-manifold  M  with  genus  j  >  0  can 
be  represented  canonically  using  a  4g-gon  where  all  4g 
vertices  represent  the  same  vertex  on  Ai.  The  size  of 
this  polygon  is  optimal  since  no  fc-gon  with  k  <  4g  can 
represent  M.  The  labels  on  the  edges  around  the  poly¬ 
gon  are  of  the  form: 

XiyiX-^yi^X2y2X2^y2^-3:gygX~^yJ^. 

Similarly,  a  non-orientable  2-manifold  with  genus 
y  >  1  can  be  represented  canonically  using  an  opti¬ 
mal  sized  2^-gon  where  the  labels  on  the  edges  around 
the  polygon  are  of  the  form: 

X \X\X2^2’ g ' 

For  g  =  0,  the  orientable  2-manifold  is  the  sphere 
which  can  be  represented  canonically  by  two  directed 
edges  xx~^.  Similarly  for  g  =  1,  the  non-orientable 
2-manifold  is  the  projective  plane  which  can  be  rep¬ 
resented  by  two  directed  edges  xx.  If  we  identify  the 
partnered  edges  of  the  canonical  polygon  appropriately, 
they  form  a  set  of  curves  glued  at  a  single  point  on  the 
2-manifold.  These  curves  are  called  canonical  genera¬ 
tors. 

Let  T  be  a  triangulation  of  a  2-manifold  M  with  n 
edges.  A  path  on  T  is  a  sequence  of  alternating  vertices 
and  edges,  VieiV2e2---^k-i'Vk,  where  the  edge  e;  has  ver¬ 
tices  u,-,uipi  as  endpoints.  A  cycle  is  a  path,  which  is 


closed. 

The  following  lemma  provides  a  method  that  flattens 
out  T  to  a  triangulation  T'  of  a  planar  polygon. 


Lemma  3.1  A  polygonal  schema  P  with  triangulation 
T'  can  he  constructed  from  T  where  there  is  a  one-to- 
one  correspondence  between  triangles  ofT'  and  T. 

Proof.  We  construct  a  sequence  of  closed  disks 
Di,D2,...,Dn  in  a  plane  incrementally  such  that  P  = 
Dn  at  the  end.  Initially,  Di  =  a  triangle  in  the 
plane  that  corresponds  to  an  arbitrarily  chosen  triangle 
(Ti  on  T.  Let  Di  =  a'l  Ucr2...U(7-  after  the  ith  step.  At 
the  i  \  th  step  we  choose  a  triangle  ai+i  on  T  which 
has  the  following  properties:  (i)  no  triangle  correspond¬ 
ing  to  (Ti+i  has  been  included  in  D,-,  and  (ii)  a  triangle 
aj  adjacent  to  cr.+i  by  an  edge  has  a  corresponding  tri¬ 
angle  in  Di-  These  two  conditions  imply  that  there  is 
an  edge  e  =  aj  f)  cr,-+i  such  that  its  corresponding  edge 
e'  on  Di  appears  on  bd[Di).  We  attach  a  triangle 
to  bd{Di)  such  that  n6d(A)  =  e'.  This  gives  the 
new  disk  A+i  =  A'  U  It  is  clear  that  if  Di  is  a 

closed  disk,  so  is  A+i-  Finally,  when  we  exhaust  all 
triangles  on  Af,  we  have  with  the  triangulation  V 
that  has  the  following  properties,  (i)  Each  triangle  in  T' 
corresponds  to  a  single  triangle  in  T  and  vice  versa,  (ii) 
Each  edge  e'  on  6d(£)„)  has  a  partnered  edge  e"  such 
that  they  both  correspond  to  a  single  edge  e  on  T.  This 
is  because  each  edge  e  on  T  has  two  incident  triangles 
and  the  edges  on  6d(Z?„)  have  a  single  triangle  incident 
on  them,  (hi)  When  partnered  edges  of  bd{Dn)  are 
identified,  we  get  back  T.  This  is  because,  by  our  con¬ 
struction,  the  two  triangles  in  T'  incident  on  partnered 
edges  e',e"  correspond  to  the  two  triangles  incident  on 
e  in  T.  Incidence  relations  of  all  other  edges  inside  T' 
are  isomorphic  to  their  corresponding  edges  on  T.  So, 
Dn  =  P,  3.  polygonal  schema  for  M  with  a  one-to-one 
correspondence  between  triangles  in  T  and  T'. 
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Figure  4.1;  The  spaces  D,  G  and  G' 


4  Computing  an  optimal  polyg¬ 
onal  schema 

The  size  (number  of  edges  on  the  boundary)  of  the 
polygonal  schema  as  deduced  in  the  last  section  can  be 
n(n),  which  is  too  large  for  our  purpose.  Here,  we  show 
a  technique  to  derive  a  polygonal  schema  Q  of  optimal 
size  from  P  that  retains  enough  information  to  preserve 
a  surjective  mapping  from  vertices  of  P  to  Q.  Due  to 
this  mapping,  we  would  be  able  to  trace  a  path  on  Q 
that  corresponds  to  a  path  on  P  and  hence  a  path  on 
T.  We  already  observed  that  the  triangulation  T'  of  P 
becomes  isomorphic  to  the  triangulation  T  oi  M  when 
partnered  edges  are  identified.  From  now  on  we  will  not 
distinguish  between  the  triangles,  edges  of  T  and  T';  we 
refer  to  them  with  the  same  name.  With  this  set  up, 
two  partnered  edges  on  bd{P)  have  the  same  name. 

Let  the  1-complex  (graph)  formed  by  the  identified 
edges  of  bd(P)  be  G.  The  space  D  =  M.  —  G  is  an  open 
disk  since  P  —  bd{P)  is  an  open  disk.  Let  us  attach  a 
thin  layer  of  an  open  set  on  two  sides  of  the  edges  of 
G  to  make  it  open,  and  denote  this  space  G';  see  figure 
4.1.  We  can  express  7ri(A4  =  G'UD)  in  terms  of  7ri(G') 
and  t^i{D)  using  the  following  theorem: 

Seifert- Van  Kampen’s  theorem:  Suppose  5  is  a  space 
which  can  be  expressed  as  the  union  of  path-connected 
open  sets  A,  B  such  that  A  fl  j3  is  path  connected  and 
such  that  tvi{A)  and  rri(5)  have  respective  representa¬ 
tions 

7ri(^)  =<  Cl,  02, ...  :  ri, ...,  r„  > 

7ri(5)  =<  61,62,  ••• :  si,...,Sg  > 

while  'Ki{A  D  B)  is  finitely  generated. 

Then  7ri(S')  =<  Oi,  02, ...,  61, 62....  : 

>  where  Ui,Vi  are 
the  expressions  for  the  same  generator  oi  -iri{A  D  B) 
in  terms  of  the  generators  of  Tti{A)  and  ■ki{B),  respec¬ 
tively. 

Since  any  cycle  (curve)  on  G'  can  be  continuously 
deformed  to  the  edges  of  G,  we  have  7ri(G)  =  7ri(G'). 


Let  a  set  of  generators  of  7ri(G)  be  7  with  the  set  of 
relations  pi.  The  fundamental  group  of  D  is  the  trivial 
group  {1}  since  D  is  an  open  disk.  Let  the  set  of  rela¬ 
tions  obtained  for  the  generators  of  7ri(DnG')  in  terms 
of  the  generators  of  7ri(D)  and  7ri(G')  be  p2.  The  spaces 
A4,  G'  and  D  satisfy  all  conditions  of  Seifert- Van  Kam¬ 
pen’s  theorem.  Then  according  to  this  theorem  7ri(Af) 
has  a  presentation  <  j  ■  pi,  P2  >■ 

A  presentation  of  ni{G)  can  be  obtained  as  follows. 
Let  y  be  a  spanning  tree  of  G,  and  p  be  a  vertex  of 
y.  Each  edge  e  of  G  defines  a  cycle  c(e)  =  wew'  orig¬ 
inating  at  p  where  w,uj'  are  the  paths  from  p  to  the 
end  points  of  e  along  the  edges  of  Y .  To  simplify  nota¬ 
tions  we  write  e  for  the  cycle  c(e).  Any  cycle  e  where 
e  is  an  edge  of  Y  is  contractable  to  p  and  hence  be¬ 
longs  to  the  identity  of  7ri(G).  Let  B  =  {6; ,  62, 6^} 
be  the  set  of  edges  of  G  that  do  not  belong  to  y.  A 
presentation  of  7ri(G)  is  <  bi,b2,...,bt  :  —  >  imply¬ 
ing  Pi  =  {}.  Consider  the  sequence  of  edges  around 
(clockwise)  bd{P).  Some  of  these  edges  belong  to  Y 
which  are  set  to  1  in  7ri(G).  In  terms  of  the  generators 
of  7ri(G),  let  the  sequence  of  edges  around  bd{P)  form 
the  word  6ill...62ll...63ll...62£ll...l,  where  6(s  repre¬ 
sent  the  edges  in  B  or  their  inverses.  This  word  is 
equivalent  to  r  =  The  generator  for  the  space 

DnG'  is  b\b2...b2i  in  terms  of  the  generators  of  7ri(G') 
and  is  1  in  terms  of  the  generator  of  D.  So  we  have 
P2  =  b\b'2...b'2i^  —  1.  Hence  a  presentation  of  7ri(A4)  is 
<  61 , 62, ...,  bi  :  p2  >. 

Lemma  4.1  £  =  2g  if  M  is  orientable  and  =  p  if  M 
is  non-orientable. 

Proof.  Consider  the  polygon  P  with  all  edges  in  Y 
contracted  to  a  single  point.  This  results  in  a  polygon 
Q  with  the  edges  b'ib'2...b'2(_  labeled  clockwise  around  it 
(figure  4.2).  The  polygon  <5  is  a  2Agon  with  partnered 
edges.  Further  we  can  assume  that  all  vertices  of  Q  have 
the  same  label,  i.e.,  when  partnered  edges  are  identified, 
all  vertices  are  identified  to  a  single  point.  Then  Q  rep¬ 
resents  a  polygonal  schema  for  a  manifold  M'  home- 
omorphic  to  M.  This  is  because  (i)  When  partnered 
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edges  are  identified  Q  form  a  2-manifold  M'  whose 
fundamental  group  is  same  as  ‘Ki{M)  (apply  Seifert- 
Van  Kampen’s  theorem)  and  (ii)  any  two  compact  2- 
manifolds  are  homeomorphic  if  and  only  if  their  funda¬ 
mental  groups  are  isomorphic.  Euler’s  characteristic  of 
M'  is  2  —  f  since  we  get  one  vertex,  I  edges  and  one 
face  when  the  edges  of  Q  are  identified.  It  is  known 
that  the  Euler’s  characteristic  of  a  surface  M  of  genus 
g  \s2  —  2g  ii  M  is  orientable  and  is  2  —  if  -M  is  non- 
orientable.  Since  M  and  M'  are  homeomorphic,  their 
Euler’s  characteristics  must  be  equal,  proving  ^  =  2^  if 
M  is  orientable  and  ^  =  y  if  tW  is  non-orientable. 

4.1  The  Algorithm 

We  construct  the  polygonal  schema  P  as  described  in 
lemma  3.1.  If  we  assume  that  the  triangulation  T  is 
represented  with  a  data  structure  that  allows  us  to  ac¬ 
cess  the  triangles  incident  on  an  edge  and  the  edges  of 
a  triangle  in  constant  time,  then  the  construction  of  P 
takes  0(1)  time  per  triangle  and  0{n)  in  total.  In  case 
T  is  represented  with  only  a  set  of  triangles  such  that 
the  edges  of  a  triangle  are  accessed  in  constant  time  but 
not  the  triangles  incident  on  an  edge,  we  spend  0(n)  ex¬ 
tra  time  to  preprocess  this  data  structure.  If  the  edges 
are  given  with  integer  names,  we  index  the  set  of  edges 
of  the  given  triangles  in  an  array  and  match  them  to 
detect  the  incident  triangles  for  each  edge.  If  the  edges 
are  given  as  pair  of  vertices,  we  sort  them  through  radix 
sort  in  linear  time  and  then  match  them. 

While  constructing  P  we  maintain  pointers  from  the 
edges  and  triangles  of  T  to  the  edges  and  triangles  of 
P.  Also,  the  edges  of  the  1-complex  G  are  detected 
during  this  step.  A  spanning  tree  Y  of  G  is  computed 
in  0{n)  time.  The  edges  not  in  Y  are  detected  and  the 
corresponding  edges  on  bd{P)  are  marked  with  the  label 
other  than  1.  All  other  edges  on  bd{P)  are  marked  1. 
All  these  take  0{n)  time. 

The  reduced  polygon  Q  of  size  2£  is  constructed  from 
P  as  follows.  Let  the  sequence  of  edges  around  bd{P) 
(clockwise)  be  b\b'2...b'2i  ignoring  the  edges  marked  1. 
We  form  the  polygon  Q  to  have  2f  edges  labeled 
around  it.  We  maintain  pointers  from  the  ver¬ 
tices  of  P  to  the  vertices  of  Q  as  follows.  Let  b'^bj^i  be 
any  two  consecutive  edges  in  the  sequence  (circular) 
b\b'2...b'2i  and  viV2...v,  be  the  vertices  of  bd{P)  between 
b'i  and  b\j^i  where  vi  is  an  endpoint  of  and  v,  is  an 
endpoint  of  All  these  vertices  point  to  the  same 

vertex  v  between  b[  and  in  Q.  See  figure  4.2.  The 
polygon  Q  can  be  thought  of  as  the  polygon  P  with 
all  edges  in  the  spanning  tree  Y  shrunk  to  a  single  ver¬ 
tex.  By  lemma  4.1  Q  has  optimal  size.  The  point¬ 
ers  from  P  io  Q  realizes  a  surjective  mapping  between 
their  vertices.  Combining  all  costs  together  Q  can  be 


constructed  from  T  in  0(n)  time. 

Theorem  4.2  Let  M  be  any  compact  2-manifold  of 
genus  g  with  triangulaiion  T  of  size  n.  A  polygonal 
schema  Q  of  optimal  size  can  be  constructed  in  0{n) 
time,  where  a  surjective  mapping  is  retained  from  the 
vertices  of  a  polygonal  schema  embedded  in  T  to  the 
vertices  of  Q. 

5  Detecting  Null-Homotopy 

Traversing  a  cycle  C  on  T  is  equivalent  to  traversing 
an  ordered  set  of  paths  U  =  {ui,  U2, ...,  Ur}  that  have 
end  points  on  bd{P)  if  C  intersects  G.  Each  of  these 
paths  Ui  is  homotopic  to  a  path  that  has  edges  only 
on  hd{P)  and  have  the  same  endpoints  as  u,.  Note  that 
no  such  path  exists  when  C  does  not  intersect  G.  In 
that  case  C  is  trivially  null-homotopic.  We  consider 
the  nontrivial  case  when  C  intersects  the  edges  of  G. 

When  edges  on  bd[P)  are  identified,  the  sequence 
u[u2,...,Ur  form  a  cycle  C'  on  T,  which  is  homotopic 
to  C.  Each  path  can  be  expressed  as  a  word  Wi  in 
terms  of  the  generators  bi,b2,--.,bi  by  listing  the  se¬ 
quence  of  edges  on  it  that  are  not  marked  1  on  bd{P). 
This  is  actually  done  on  bd{Q)  to  avoid  unnecessary  vis¬ 
its  of  edges  that  are  marked  1  on  bd{P).  We  also  write 
Wi  ^  Ui  if  the  word  Wi  corresponds  to  the  path  u,-.  The 
word  w  =  wxW2--.Wr  represents  the  cycle  C  in  7ri(A4). 
We  determine  if  ry  =  1  to  detect  if  G  is  null-homotopic 
(Fact  2.1). 

The  sequence  of  paths  {uiU2...Ur]  with  endpoints  on 
bd{P)  are  detected  through  pointers  as  we  traverse  C 
on  T.  The  homotopic  path  uj  of  Ui  is  identified  by  the 
endpoints  Vi,V2  of  uj.  We  do  not  traverse  the  path  u- 
on  bd{P)  which  can  have  U{n)  length.  To  form  the  word 
Wi  ~  Ui  we  detect  the  corresponding  vertices  of  ui ,  V2  on 
bd{Q)  through  pointers  from  P  to  Q.  We  can  consider 
the  labeled  edges  between  these  vertices  on  bd{Q)  in 
any  direction  (clockwise  or  counter-clockwise)  to  form 
the  word  Wi.  This  takes  0(|wi|)  time.  Thus  the  word 
w  for  G  is  constructed  in  0(|ty|)  time.  The  length  of  w 
can  be  at  most  0{gk)  since  each  word  Wi  can  have  at 
most  2£  —  0{g)  length  and  there  can  be  r  =  0{k)  such 
words  in  the  worst  case. 

To  determine  if  C  is  null-homotopic,  we  check  if  u;  = 
1.  The  polygon  Q  can  be  thought  of  as  a  polygonal 
schema  for  M,  where  2£  vertices  represent  the  same 
point  on  M.  So,  w  can  be  thought  of  as  a  word  formed 
by  the  concatenation  of  edges  taken  from  a  canonical 
set  of  generators  of  M-  Detecting  u;  =  1  for  such  a 
word  w  is  known  as  the  word  problem.  Schipper  gave  an 
algorithm  for  this  problem  which  runs  in  ©(s'lial)  time. 
Using  this  algorithm  we  can  detect  if  u;  =  1  in  0{g‘^k) 
time;  however,  we  can  do  better.  If  u;  =  wiW2...Wr 
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where  Wi  ~  u,-,  we  can  detect  if  to  =  1  in  0(gr)  =  0{gk) 
time. 

Our  method  is  similar  to  the  method  of  Schipper  [6]. 
Let  U{M.)  denote  the  universal  covering  space  for  M. 
We  can  construct  a  universal  covering  space  U{M.)  of 
M.  as  follows:  Let  Pm  be  a  d^-gon  (if  M  is  orientable) 
or  a  2(?-gon  (if  M.  is  non-orientable)  such  that  when  its 
edges  are  identified  we  get  a  set  of  generators  for  Ad 
meeting  at  a  single  point.  By  taking  infinitely  many 
copies  of  Pm  and  gluing  them  together  along  the  iden¬ 
tified  edges,  one  gets  a  tessellation  of  the  plane  with 
either  4^-gons  or  25-gons,  depending  on  whether  M  is 
orientable  or  not.  Since  the  polygon  Q  can  be  thought 
of  as  a  polygonal  schema  for  Ad  where  all  2t  vertices 
represent  the  same  point  on  Ad,  we  can  take  Pm  —  Q 
to  construct  U{M).  The  following  fact  is  well-known 
[8]. 

Fact  5.1:  Any  curve  C  on  M  can  be  mapped  to  a 
unique  path  u  (lifted  path)  in  Z/(Ad)  such  that  C  is 
null-homotopic  if  and  only  if  u  is  closed. 

We  construct  a  part  of  U{M)  in  an  incremental  fash¬ 
ion  which  is  sufficient  to  detect  if  the  lifted  path  u  of  w 
is  closed.  If  u  is  closed,  this  method  provides  a  closed 
disk  whose  boundary  is  u.  For  each  re,-,  the  edge  with 
which  it  starts  is  called  its  first  edge,  and  the  edge  with 
which  it  terminates  is  called  its  last  edge.  All  other 
edges  are  called  middle  edges.  The  first  edge  and  the 
last  edge  of  a  word  Wj  are  same  if  |tr:,|  =  1.  We  update 
the  constructed  structure  when  we  traverse  any  of  the 
first  and  last  edges.  Since  all  middle  edges  in  between  a 
first  edge  and  a  last  edge  are  visited  on  a  single  polygon 
(current  polygon),  we  do  not  need  to  update  the  struc¬ 
ture  during  their  visits.  Because  of  its  similarity  to  [6] 
and  shortage  of  space,  we  omit  the  details  of  updation 
here.  An  improved  version  of  this  procedure  is  reported 
in  [1]. 

An  amortized  analysis  shows  that  each  first  and  last 
edge  take  0(g)  amortized  processing  time.  Since  there 
are  0(k)  such  edges,  0(gk)  time  is  spent  on  them  al¬ 


together.  Each  middle  edge  is  traversed  in  0(1)  time 
taking  0(gk)  time  altogether  since  there  are  0(gk)  such 
edges.  Thus  the  total  time  spent  on  detecting  the 
closedness  of  the  lifted  path  for  w  is  0(gk).  Unfor¬ 
tunately,  when  g  <  1  (^  <  2  if  Ad  is  non-orientable) 
the  above  amortized  analysis  fails.  Indeed,  for  3  =  1 
(5  =  2  if  Ad  is  non-orientable),  the  above  procedure 
takes  Q(k^)  time  in  total.  We  use  different  procedures 
for  these  cases  as  detailed  below. 

5.1  Special  Cases 

Any  cycle  on  a  sphere  (g  =  0)  is  contractable  to  a  sin¬ 
gle  point.  So,  the  contractability  problem  is  trivial  for 
spheres.  The  fundamental  group  of  any  torus  (g  =  1) 
has  the  presentation  <  a,b  :  aba~^b~^  >  which  means 
ab  —  ba.  Thus  any  word  w  on  the  canonical  generators 
can  be  expressed  as  =  a'^6".  This  implies  u;  =  1  if 
and  only  if  m  =  0  and  n  =  0.  Let  an  edge  e  on  Q  (n 
rectangle)  be  traversed  Cj  times  in  the  clockwise  direc¬ 
tion  and  C2  times  in  the  counter-clockwise  direction  on 
the  paths/words  wi,'W2,  ...,Wr.  Let  c'  =  Cj  —  C2.  We 
have  m  =  0  and  n  =  0  if  and  only  if  c®  -1-c^  =0  for  each 
pair  of  partnered  edges  e,e'.  Detecting  this  fact  takes 
|wi|)  =  0(gk)  =  0(k)  time.  With  the  0(n) 
preprocessing  time  we  have  an  0(n  +  k)  time  algorithm 
for  torus. 

Similar  to  the  orientable  2-manifolds,  the  cases  when 
jr  =  1  (projective  planes)  and  g  =  2  (klein  bottle)  for 
non-orientable  2-manifolds  are  treated  separately.  Al¬ 
though  the  case  for  51  =  1  can  be  solved  trivially,  the 
case  for  g  ■=  2  (polygon  Q  is  rectangle)  cannot  be  solved 
by  the  method  used  for  torus.  Instead  we  use  a  different 
method  for  klein  bottles. 

For  klein  bottles,  the  polygon  Q  is  a  rectangle  and  the 
sequence  of  edges  around  bd(Q)  reads  either  abab^^  or 
aabb;  see  Figure  5.1.  The  universal  covering  space  with 
these  two  patterns  are  shown  in  Figure  5.1.  We  observe 
that  vertices  in  the  universal  covering  space  have  two 
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Figure  5.1:  Universal  covering  spaces  for  klein  bottles 


different  configurations  in  both  cases.  We  can  deter¬ 
mine  in  an  0(1)  step  the  edges  that  take  us  left,  right, 
up  and  down  for  each  configuration.  For  example,  from 
configuration  Fi  (Figure  5.1)  we  go  left,  right,  up  and 
down  by  the  edges  b~^,  b,  a~^  and  a  respectively.  We 
can  also  decide  in  0(1)  time  which  configuration  we  end 
up  with  when  we  move  from  a  particular  configuration 
through  a  particular  edge.  For  example,  we  go  to  Fj 
from  F{  through  the  edge  b~^  (Figure  5.1).  Once  we 
have  this  information  available  through  a  preprocessing 
step,  we  can  visit  the  edges  of  w  on  bd{Q)  and  for  each 
such  edge  we  know  the  direction  of  our  movement  in 
the  universal  covering  space  without  constructing  it  ex¬ 
plicitly.  The  path  on  the  universal  covering  space  will 
be  closed  if  and  only  if  we  move  an  equal  number  of 
times  left  as  right  and  an  equal  number  of  times  up  as 
down.  This  procedure  takes  0(|w|)  =  0{gk)  =  0(k) 
time.  With  preprocessing  times  added,  we  have  an  op¬ 
timal  0(n  +  k)  time  and  space  algorithm. 

Combining  the  costs  of  all  steps  for  all  cases  we  have 
the  following  theorem. 

Theorem  5.1  Let  Ad  be  any  compact  2-manifold  with¬ 
out  boundary  of  genus  y  >  0  with  a  triangulation  T  of 
size  n.  We  can  detect  if  a  given  cycle  C  of  length  k  is 
null-homotopic  in  0{n  -)-  gk)  time  and  space. 

5.2  Related  Problems 

Detecting  homotoplc  cycles:  Two  cycles  Ci,C2  of 
lengths  ki,  k^  respectively  are  homotopic  if  and  only  if 
[Ci  •  =  1.  Let  wi,W2  be  the  words  constructed 

corresponding  to  Ci  and  respectively.  We  have 
[Ci  •  =  1  if  and  only  if  wi.w^^  =  1.  This  can  be 

detected  in  0{g{ki  +  ^2))  time  giving  an  0{n  +  g{ki  + 
^2))  time  algorithm  for  this  problem. 

Computing  actual  contractions:  The  path  u  cor¬ 
responding  to  w  on  the  partially  constructed  universal 
space  is  closed  if  and  only  if  the  original  cycle  C  is 
null-homotopic.  In  case  u  is  closed  we  can  compute  the 
actual  contraction  as  follows.  Recall  that  the  closed 
disk  whose  boundary  is  u  is  present  in  the  constructed 


structure.  First,  the  cycle  C  is  deformed  to  the  cycle 
C  =  u[u2...u'j.  on  G.  This  is  achieved  by  deforming 
ui,U2,  ..-tUr  to  u'l,  U2, ...,  u(.  respectively  over  the  trian¬ 
gles  of  T  in  Oikn)  time.  Next  we  contract  all  the  edges 
of  G  that  are  present  in  the  spanning  tree  Y  in  0(n) 
time.  This  may  leave  some  of  the  triangles  of  T  (equiv¬ 
alently  T'  of  P)  to  be  contracted  to  a  single  edge  loop 
or  a  double  edge  loop.  These  two  steps  deform  the  cy¬ 
cle  G  to  u,  which  is  contracted  over  the  polygons  in  the 
constructed  structure.  The  deformed  triangulation  T' 
is  superimposed  on  each  such  polygon  Q.  A  contraction 
of  u  over  the  deformed  triangulations  of  the  polygons 
completes  the  contraction  of  C.  The  constructed  par¬ 
tial  structure  can  have  at  most  0{gk)  polygons  since  it 
is  constructed  in  0{gk)  time.  This  implies  that  u  can 
be  contracted  over  0{gk)  triangulations  of  size  0{n) 
each.  This  produces  an  0{gkn)  time  and  space  actual 
contraction  for  C. 

Remark:  Recently  using  our  polygonal  schema  compu¬ 
tation  and  a  clever  modification  of  Schipper’s  universal 
covering  space  construction  with  the  help  of  sophisti¬ 
cated  data  structures,  we  have  designed  an  improved 
algorithm  for  the  null  homotopy  problem  [1].  This  algo¬ 
rithm  runs  in  0{n+k^/\ogg)  time  and  in  Q{n+k)  space. 
The  actual  contraction  can  be  computed  in  Q{nk)  time 
and  space. 

6  Conclusions 

We  have  presented  a  new  technique  to  derive  a  reduced 
polygonal  schema  from  a  triangulation  of  a  given  2- 
manifold.  This  has  produced  an  improved  algorithm  for 
detecting  null-homotopic  cycles  on  compact  2-manifolds 
without  boundary.  We  believe  that  this  technique 
will  find  further  applications  in  other  algorithms  on 
2-manifolds.  The  method  can  possibly  be  adapted 
to  solve  the  contractability  problem  on  compact  2- 
manifolds  with  boundary.  An  open  question  is:  can  the 
recent  time  bound  0{n  -h  k^/\ogg)  achieved  for  null- 
homotopy  problem  be  further  improved  to  0(n  -t-  k)l 
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An  extension  of  the  null-homotopy  problem  is  to  con¬ 
sider  it  on  arbitrary  complexes.  Fundamental  group 
of  any  d-dimensional  {d  >  1)  complex  is  determined 
only  by  its  2  dimensional  subcomplex.  So,  to  solve  the 
contractability  problem  we  can  concentrate  on  only  2- 
complexes.  However,  it  is  known  that  the  contractabil¬ 
ity  problem  is  unsolvable  for  arbitrary  4-manifolds  [8]. 
Hence  the  contractability  problem  is  unsolvable  for  ar¬ 
bitrary  2-complexes.  It  is  an  interesting  and  probably 
a  difBcult  question  to  characterize  the  2-complexes  for 
which  the  contractability  problem  is  solvable.  Perhaps 
a  more  practical  problem  would  be  to  consider  the  2- 
complexes  that  are  embeddable  in  three  dimensions. 
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Abstract 

Given  a  subspace  X  C  R'*  and  a  finite  set  S  C  R'*,  we  in¬ 
troduce  the  Delaunay  simplicial  complex,  V\,  restricted 
by  X.  Its  simplices  are  spanned  by  subsets  T  C  S  for 
which  the  common  intersection  of  Voronoi  cells  meets  X 
in  a  non-empty  set.  By  the  nerve  theorem,  \}V\  and  X 
are  homotopy  equivalent  if  all  such  sets  are  contractible. 
This  paper  shows  that  \JV\  and  X  are  homeomorphic 
if  the  sets  can  be  further  subdivided  in  a  certain  way  so 
they  form  a  regular  CW  complex. 

1  Introduction 

This  paper  studies  the  problem  of  constructing  simpli¬ 
cial  complexes  that  represent  or  approximate  a  geomet¬ 
ric  object  in  some  finite-dimensional  Euclidean  space, 
R'*.  We  refer  to  the  geometric  object  as  a  topological 
space  or  subspace  of  R”^.  This  problem  arises  in  geo¬ 
metric  modeling  and  finite  element  analysis,  and  it  is 
a  special  case  of  the  grid  generation  problem  [21].  It 
is  special  because  we  only  consider  grids  or  complexes 
made  up  of  simplices.  The  problem  can  be  divided  into 
two  questions: 

How  do  we  choose  the  points  or  vertices  of  the 
grid? 

How  do  we  connect  the  vertices  using  edges, 
triangles,  and  higher-dimensional  simplices? 
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In  this  paper  we  concentrate  on  the  second  question. 
In  particular,  given  a  subspace  and  a  finite  point  set  in 
R*^,  we  give  an  unambiguous  rule  for  constructing  a  sim¬ 
plicial  complex  representing  the  subspace.  Topological 
properties  of  this  simplicial  complex,  such  as  whether 
its  domain  is  homotopy  equivalent  or  homeomorphic  to 
the  subspace,  can  be  studied  based  on  local  interactions 
between  the  subspace  and  the  Voronoi  neighborhoods 
of  the  points.  This  leads  us  back  to  the  first  question: 
additional  points  can  be  chosen  so  they  improve  the 
local  interaction  patterns.  How  this  can  be  done  in  a 
concrete,  possibly  three-dimensional  setting  ought  to  be 
the  subject  of  future  investigations. 

Simplicial  complexes  and  triangulations.  An 
affinely  independent  point  set  T  C  R**  defines  the  sim¬ 
plex  (Tt  =  convT.  Its  dimension  is  k  =  dimcrx  = 
card T—  1,  and  it  is  also  referred  to  as  a  k-simplex.  The 
points  of  T  are  the  vertices  of  <tt.  A  simplicial  com¬ 
plex,  /C,  is  a  finite  collection  of  simplices  that  satisfies 
the  following  two  properties:  if  cry  G  1C  and  U  C  T 
then  au  E  1C,  and  if  tTr ,  cry  €.  K.  then  (Ty  n  cry  =  (XTnV  ■ 
The  first  property  implies  0  G  /C,  and  the  first  and 
second  properties  combined  imply  ctt  Ci  ay  E  1C.  The 
vertex  set  of  fC  is  vert/C  =  [jaTefc'^’  dimension 
is  dim/C  =  max^gK  dimer,  and  the  underlying  space  is 
subcomplex  of  /C  is  a  simplicial  com¬ 
plex  C  C  1C. 

A  particular  simplicial  complex  defined  by  a  non¬ 
degenerate  finite  set  5  C  R'*  is  the  Delaunay  simplicial 
complex,  V  =  Vs  [6].  It  consists  of  all  simplices  (Tt, 
T  C  S,  for  which  there  exists  an  open  ball,  B,  with 
5  n  clR  =  T  and  5  0  5  =  0.  Given  5,  V  is  unique, 
dimP  =  minjd,  card 5  —  1},  and  [JX>  =  conv5.  In 
computational  geometry,  V  is  usually  referred  to  as  the 
Delaunay  ‘triangulation’  of  5  [7,  18].  To  avoid  confu¬ 
sion  with  the  topology  notion  of  a  triangulation,  which 
is  adopted  in  this  paper,  we  choose  to  call  V  a  simpli¬ 
cial  complex.  Following  the  tradition  in  combinatorial 
topology,  a  triangulation  of  a  topological  space  X  is  a 
simplicial  complex  1C  together  with  a  homeomorphism 
between  X  and  |J/C  [17,  19].  If  there  exists  a  simplicial 
complex  K.  such  that  [J/C  is  homeomorphic  to  X,  then 
X  is  triangulable. 
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Outline.  Our  approach  to  constructing  a  simplicial 
complex  that  represents  a  topological  space  X  C  is 
based  on  a  finite  set  S  CU‘^  and  the  Delaunay  simplicial 
complex  of  this  set,  V  =  Vs-  Section  2  introduces  the 
concept  of  a  Delaunay  simplicial  complex  Vx  restricted 
by  X,  which  is  a  sub  complex  of  V.  This  concept  is  a 
common  generalization  of  ideas  developed  by  Martinetz 
and  Schulten  [15],  Chew  [4],  and  Edelsbrunner  [8].  Vx 
is  defined  for  every  X  C  and  every  non-degenerate 
finite  S'  C  R^.  If  S  satisfies  the  assumptions  of  the 
nerve  theorem,  see  section  2,  then  [JX>x  and  X  are  ho- 
motopy  equivalent.  Section  3  presents  some  topological 
concepts  and  discusses  the  meaning  of  non-degeneracy 
in  detail.  Sections  4  and  5  study  conditions  on  S  that 
guarantee  y}Vx  be  homeomorphic  to  X.  An  explicit 
construction  of  a  homeomorphism  is  also  given.  Sec¬ 
tion  6  mentions  directions  for  further  research. 

2  Restricted  Delaunay  Simpli¬ 
cial  Complexes 

Coverings  and  nerves.  Let  X  C  R'*  be  a  topological 
space  and  5  C  R‘*  a  finite  point  set.  We  assume  non¬ 
degenerate  position  of  the  points  in  5,  which  generically 
means  that  anything  vanishing  under  a  slight  perturba¬ 
tion  of  the  points  is  precluded.  For  example,  we  require 
that  T  C  5  be  affinely  independent  if  cardT  <  d  -t-  1, 
and  that  no  d  +  2  points  of  S  be  cospherical.  Further 
particulars  of  this  assumption  will  be  discussed  in  sec¬ 
tion  3.  The  Voronoi  cell  of  p  £  S  is 

Fp  =  {a;  G  R'^  I  |pa;|  <  \qx\,  q  G  S'}, 

where  \yz\  denotes  the  Euclidean  distance  between 
points  y,z  £  R'^.  The  collection  of  Voronoi  cells  is 
V  =  Vs  =  {Vp  1  p  G  S}  [22].  The  Voronoi  cell  restricted 
to  X  of  p  G  S  is  Vp^x  =  X  n  V},,  and  the  collection  of 
restricted  Voronoi  cells  is  Vx  =  Vs,x  =  |  P  £  S). 

For  a  subset  T  C  S  we  have  corresponding  subsets  Vy  = 
{Vp  I  p  G  T}  C  V  and  Vy.x  =  {fp.x  I  P  £  ?’}  C  Vx. 
We  will  consider  their  common  intersections, 
n  =  X  n  P  Vy. 

A  covering  of  X  is  a  collection  C  of  subsets  of  X  so 
that  X  =  (JC.  It  is  a  closed  (open)  covering  if  each 
set  in  C  is  closed  (open),  and  it  is  a  finite  covering  if 
C  is  finite.  For  a  subset  D  C  C  consider  the  common 
intersection,  (ID.  The  nerve  of  a  finite  covering  C  is 

nerve  C  =  {DCC|[^D7^  0}. 

We  remark  that  the  nerve  can  be  defined  for  a  finite 
covering  of  any  abstract  set,  not  just  for  subsets  of  R'*. 
A  geometric  realization  of  nerve  C  is  a  simplicial  com¬ 
plex,  K,  together  with  a  bijection  (3  between  C  and  the 


vertex  set  of  A,  so  that  D  G  nerve  C  iff  the  simplex 
spanned  by  ^(D)  is  in  K,. 

Observe  that  the  collection  of  Voronoi  cells  restricted 
to  X,  Vx,  is  a  finite  closed  covering  of  X.  The  Delaunay 
simplicial  complex  restricted  by  X,  Vx  =  Vs,x,  is  the 
geometric  realization  of  nerve  Vx  defined  by  /?(Vp)  =  p, 
for  all  p  £  S.  That  is,  Vx  —  {ctt  \  T  C  S,  p  Vt,x  7^ 
0}.  Note  that  Vx  is  a  subcomplex  of  the  Delaunay 
simplicial  complex  V  =  V^d  of  S.  See  figure  2.Tfor  an 
example.  The  nerve  theorem  of  combinatorial  topology 


Figure  2.1:  (a)  The  Voronoi  cells  often  points  decompose 
three  spaces,  Xi,  X2,  and  X3.  (b)  The  three  correspond¬ 
ing  Delaunay  simplicial  complexes  consist  of  an  edge,  a 
cycle  of  three  edges,  and  two  triangles  sharing  an  edge, 
respectively. 

[2,  13,  23]  sheds  some  light  on  the  relationship  between 
X,  Vx,  and  Vx-  See  section  3  for  a  formal  definition  of 
homotopy  equivalence  and  contractibility. 

Theorem  (nerve).  Let  C  be  a  finite  closed  covering 
of  a  triangulable  space  X  C  R*^  so  that  for  every 
D  £  C,  P  D  is  either  empty  or  contractible.  Let  1C 
be  a  geometric  realization  of  nerve  C.  Then  X  and 
[J/C  are  homotopy  equivalent. 

In  particular,  if  p  D  is  empty  or  contractible  for  every 
D  C  Vx  then  X  and  IJ'Dx  are  homotopy  equivalent. 
Related  earlier  work.  Martinetz  and  Schulten  [15] 
study  neural  nets  modeling  a  topological  space  X.  In  ge¬ 
ometric  language,  a  neural  net  consists  of  finitely  many 
points  and  edges  between  them.  The  algorithm  in  [15] 
constructs  the  net  by  choosing  a  finite  set  5  C  X  and  se¬ 
lecting  edges  based  on  points  sampled  from  X  found  in 
Voronoi  cells  associated  with  point  pairs.  A  cell  is  asso¬ 
ciated  with  every  pair  {p,  q}  C  S  for  which  Vj,  n  V}  7^  0, 
see  [7,  18].  This  cell  can  be  interpreted  as  a  “thick¬ 
ened”  version  of  the  face  common  to  Vp  and  Vq,  and 
a  point  of  X  sampled  in  this  cell  is  taken  as  evidence 
that  X  n  fj,  n  V}  7^  0.  Martinetz  and  Schulten  call  the 
resulting  net  the  induced  Delaunay  triangulation  of  S 
and  X;  in  the  limit  it  is  the  same  as  the  edge-skeleton 
of  Vx- 
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Chew  [4]  introduces  a  method  for  constructing  a  sim- 
plicial  complex  approximating  a  (two-dimensional)  sur¬ 
face  in  R^.  Let  X  be  such  a  surface  and  5  C  X  a  finite 
point  set.  Three  points  p,q,r  €  S  span  a  triangle  in 
the  approximating  complex  if  they  lie  on  the  bound¬ 
ary  of  an  open  ball  B  C  with  center  in  X  so  that 
S  D  B  =  0.  Assuming  non-degeneracy,  such  a  ball  B 
exists  iff  X  n  V"p  n  Vg  n  K  #  0-  Chew’s  method  can  thus 
be  seen  as  a  special  case  of  our  definition  of  restricted 
Delaunay  simplicial  complexes. 

Finally,  Edelsbrunner  [8]  defines  the  dual  complex  of 
a  union  of  balls  in  Consider  the  case  where  all  balls 
are  equally  large.  Let  S  C  R'^  be  a  finite  point  set,  and 
define  X  =  {ar  G  R"^  |  minpgs  \xp\  <  p},  for  some  fixed 
positive  p  G  R.  The  Voronoi  cells  of  S  decompose  X  into 
closed  convex  regions,  and  the  dual  complex  is  defined 
as  the  nerve  of  these  regions,  geometrically  realized  by 
the  map  /?(V"p,x)  =  P,  for  all  p  £  S.  We  see  that  it 
is  the  same  as  the  Delaunay  simplicial  complex,  Vx, 
restricted  by  X.  The  common  intersection  of  any  subset 
of  these  regions  is  convex  and  therefore  contractible,  so 
the  nerve  theorem  implies  that  the  underlying  space  of 
the  dual  complex  is  homotopy  equivalent  to  X. 

3  Some  Topological  Concepts 

Neighborhoods,  homotopies,  homeomorphisms, 
and  manifolds.  An  open  ball  in  R*^  is  a  set  B  = 
B{x,p)  =  {y  G  R'*  1  \yA  <  P)  fo”^  X  G  R'* 

and  some  positive  p  G  R;  x  is  the  center  and  p  the  ra¬ 
dius  of  B.  For  Y  C  X,  a  neighborhood  of  Y  in  X  is  an 
open  subset  of  X  that  contains  Y . 

Let  X  and  Y  be  two  topological  spaces.  Two  maps 
f^g-X^Yare  homoiopic  if  there  is  a  continuous  map 
/i  ;  X  X  [0, 1]  Y  with  h{x,  0)  =  /(x)  and  h{x,  1)  =  g{x) 
for  all  X  G  X.  The  two  spaces,  X  and  Y,  are  homotopy 
equivalent  if  there  are  continuous  maps  /  ;  X  — >  Y  and 
g  Y  X  so  that  g  o  f  is  homotopic  to  the  identity 
map  in  X  and  /  o  p  is  homotopic  to  the  identity  map 
in  Y.  X  is  contractible  if  it  is  homotopy  equivalent  to  a 
point. 

Topological  spaces  X  and  Y  are  homeomorphic,  writ¬ 
ten  X  pa  Y,  if  there  is  a  bijective  map  ip  ;  X  — *■  Y  so  that 
p  and  are  continuous,  p  is  a  homeomorphism  be¬ 
tween  X  and  Y,  and  X  and  Y  are  homeomorphs  of  each 
other.  Homeomorphs  of  open,  half-open,  and  closed 
balls  of  various  dimensions  play  an  important  role  in 
the  forthcoming  discussions.  For  k  >  0,  let  o  be  the 
origin  of  R*’  and  define 

H*’  =  {x  =  (6,.--,6)eR'' 16  >0}, 

=  {x  G  R*  1  Ixo]  <  1},  and 

=  {xGRMkol  =  l}- 


For  convenience,  we  define  R*  —  H*  —  B*’  —  S*’  —  0  if 
ik  <  0.  An  open  k-ball  is  a  homeomorph  of  R*’,  a  half¬ 
open  k-ball  is  a  homeomorph  of  H*,  a  closed  k-ball  is 
a  homeomorph  of  B*’,  and  a  (k  -  l)-sphere  is  a  homeo¬ 
morph  of  5*“^  For  ik  >  1  these  are  disjoint  classes  of 
spaces,  that  is,  open  balls,  half-open  balls,  closed  balls, 
and  spheres  are  pairwise  non-homeomorphic.  This  is 
not  true  for  ifc  =  0:  open,  half-open,  and  closed  0-balls 
are  points,  and  a  0-sphere  is  a  pair  of  points. 

Our  first  theorem  is  about  topological  spaces  that  are 
manifolds,  with  or  without  boundary.  X  C  R'^  is  a  k- 
manifold  without  boundary  if  each  x  G  X  has  an  open 
ik-ball  as  a  neighborhood  in  X.  X  C  R*^  is  a  k-manifold 
with  boundary  if  each  x  G  X  has  an  open  or  half-open 
jfe-ball  as  a  neighborhood  in  X,  and  there  is  at  least  one 
X  G  X  that  has  no  open  fc-ball  as  a  neighborhood.  The 
set  of  points  without  open  fc-ball  neighborhood  forms 
the  boundary,  bdX,  of  X.  Note  that  the  boundary  of  a 
half-open  ik-ball  is  an  open  {k  —  l)-ball,  which  is  there¬ 
fore  without  boundary.  From  this  it  follows  that  the 
boundary  of  a  A;-manifold  with  boundary  is  a  (^  —  1)- 
manifold  without  boundary.  The  interior  of  a  manifold 
X  is  intX  =  X  -  bdX;  it  is  the  set  of  points  with  open 
ik-ball  neighborhoods.  Note  that  our  definition  distin¬ 
guishes  between  manifolds  with  and  without  boundary, 
which  is  somewhat  non-standard  as  the  set  of  manifolds 
without  boundary  is  usually  considered  a  subset  of  the 
set  of  manifolds  with  boundary.  A  manifold  X  is  com¬ 
pact  if  every  open  covering  of  X  has  a  finite  sub  covering, 
or  equivalently,  if  it  is  closed  and  bounded.  A  manifold 
Y  C  X  is  a  submanifold  of  X. 

Non- degeneracy.  In  section  4,  we  are  interested  in 
the  intersection  between  manifolds  and  affine  flats.  An 
Amanifold  T  C  R"^  is  an  i-flat  if  it  is  the  affine  hull  of 
t-\-l  points,  or  equivalently,  it  is  the  intersection  of  d-i 
hyperplanes,  or  linear  functionals.  Let  X  C  R'^  be  an 
m-manifold.  Intuitively,  a  point  x  G  R*^  has  d  degrees  of 
freedom,  and  it  looses  d-moi  them  if  it  is  constrained 
to  lie  in  X.  Similarly,  x  looses  d  —  i  degrees  if  it  is 
constrained  to  lie  in  F.  So  if  x  G  X  D  F  then  x  should 
have  lost  2d-m-£  degrees  of  freedom.  Hence,  X  n  F 
should  be  empty  if  d— (2d  — m  — 6  =  m  +  £  —  d  <  0.  In 
general,  we  expect  x  to  have  m+£—d  degrees  of  freedom 
and  XnF  to  be  an  (m-f  £-d)-manifold.  Algorithmically, 
such  a  non-degeneracy  assumption  can  be  simulated  by 
conceptual  perturbation  techniques,  see  e.g.  [9,  24]. 

This  intuitive  argument  can  be  formalized  for  smooth 
or  piecewise  smooth  manifolds.  We  need  some  def¬ 
initions  from  differential  topology.  For  an  open  set 
X  C  R'",  a  map  /  ;  X  R"  is  smooth  if  it  has  contin¬ 
uous  partial  derivatives  of  all  orders  and  at  all  points 
of  X.  For  arbitrary  X  C  R"*,  /  is  smooth  if  for  all 
X  G  X  there  exists  an  open  ball  B  =  B{x,e)  C  R”  and 
a  smooth  map  g  B  — >■  R"  so  that  g  equals  /  on  X  fl  5. 
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Topological  spaces  X  C  R™  and  Y  C  R”  are  diffeomor- 
phic  if  there  exists  a  homeomorphism  y?  :  X  — >■  Y  so 
that  Ip  and  are  smooth.  An  m-manifold  X  with 
or  without  boundary  is  smooth  if  each  x  &  X  has  a 
neighborhood  diffeomorphic  to  R*^  or 

Now,  let  X  be  a  smooth  m-manifold,  and  let  /  :  X  — > 
R  be  a  smooth  map.  Then  t/  €  R  is  a  regular  value  of  / 
if  for  every  x  E  f~^  (y)  some  partial  derivative  of  /  at  a; 
is  non-zero;  otherwise,  y  is  a  critical  value  of  /.  By  the 
preimage  theorem  in  differential  topology,  the  preimage 
of  any  regular  value  is  a  smooth  submanifold  of  X  with 
dimension  m—  1,  and  by  Sard’s  theorem,  the  set  of  crit¬ 
ical  values  has  measure  0  in  R,  see  e.g.  [10,  11].  This  im¬ 
plies  that  with  probability  1,  the  intersection  between  a 
smooth  m-manifold  X  and  a  hyperplane  with  prescribed 
normal  direction  is  a  smooth  (m  —  l)-manifold.  Hence, 
with  probability  1,  the  intersection  between  X  and  an 
Aflat  F  with  prescribed  normal  (d  —  £)-flat  is  a  smooth 
(m  +  £  —  d)-manifold.  By  non-degenerate  position  of  F 
we  mean  that  this  is  indeed  the  case,  and  it  is  reason¬ 
able  to  assume  non-degenerate  position  because  F  just 
needs  to  avoid  a  measure  zero  set  in 

One  of  the  conditions  necessary  for  our  results  is 
the  non-degeneracy  of  the  intersection  between  Voronoi 
cells  and  the  manifold.  We  thus  extend  the  above  no¬ 
tions  to  Voronoi  cells  and  their  intersections.  An  in¬ 
tersection  of  Voronoi  cells  is  the  common  intersection 
of  finitely  many  closed  half-spaces,  and  thus  a  convex 
polyhedron.  Let  P  be  a  convex  polyhedron  and  let  X  be 
a  manifold  without  boundary.  We  say  that  P  intersects 
X  generically  ifXnP  =  0orXnP  has  the  right  dimen¬ 
sion  and  X  n  int  P  =  int  (X  0  P).  If  X  is  a  manifold  with 
boundary,  then  P  intersects  X  generically  if  P  intersects 
intX  and  bdX  generically.  By  non-degenerate  position 
of  P  we  mean  that  P  intersects  X  generically.  Again, 
this  is  a  reasonable  assumption  to  make. 

4  Triangulating  Compact  Mani¬ 
folds 

We  are  now  ready  to  state  conditions  under  which  the 
restricted  Delaunay  simplicial  complex  is  a  triangula¬ 
tion  of  X.  These  conditions  will  be  applied  only  when 
X  is  a  compact  manifold.  To  avoid  any  confusion,  we 
note  that  our  results  do  not  settle  the  open  question 
whether  or  not  all  compact  manifolds  are  triangulable. 
The  closed  ball  property.  Throughout  this  section 
we  assume  non-degenerate  position  of  flats  and  discrete 
point  sets.  Let  m  >  0  and  let  X  C  R'^  be  a  compact 
m-manifold  with  or  without  boundary.  Let  5  C  R'^ 
be  a  finite  point  set.  We  say  that  S  has  the  generic 
intersection  property  for  X  if  for  every  subset  T  C  S, 
P|Vt  intersects  X  generically.  We  say  that  S  has  the 


closed  ball  property  for  X  if  for  every  £  <  m  and  every 
subset  T  C  S  with  card  T  =  m  +  I  —  £,  the  following 
two  conditions  hold; 

(Bl)  n  Vt,x  is  either  empty  or  a  closed  Aball,  and 

(B2)  n  ^T.bdx  is  either  empty  or  a  closed  {£  —  l)-ball. 

In  section  3,  we  argued  that  it  is  reasonable  to  ex¬ 
pect  X  n  P|Vt  be  an  Amanifold,  if  X  is  a  smooth  m- 
manifold.  This  is  reflected  in  condition  (Bl).  A  similar 
non-degenerate  position  assumption  is  implied  by  (B2) 
for  the  boundary  of  X.  We  will  see  that  the  closed  ball 
property  guarantees  that  dimension  is  preserved.  As  an 
example  consider  Xi  and  2?Xi  in  figure  2.1.  Condition 
(B2)  is  violated  by  the  common  edge  of  the  two  Voronoi 
cells  intersecting  Xi.  Indeed,  because  the 

dimension  of  Xi  is  two  and  that  of  [jX>Xi  is  one. 

Two  technical  lemmas.  Before  proving  the  firts  the¬ 
orem  we  establish  two  facts  about  the  closed  ball  prop¬ 
erty.  The  first  says  that  the  closed  ball  property  pre¬ 
serves  dimension  locally,  and  the  second  is  a  statement 
about  the  way  Voronoi  cells  intersect  a  compact  mani¬ 
fold  and  its  boundary.  We  work  with  arbitrary,  that  is, 
possibly  non-smooth  m-manifolds  since  we  do  not  need 
any  smoothness  properties  for  our  proofs.  A  simplex 
cct  G  'Dx  is  a  principal  simplex  if  there  is  no  proper  su¬ 
perset  U  DT  with  CTu  £  Vx  ■  The  first  fact  is  formalized 
in  the  following  lemma. 

Lemma  4.1  (preservation  of  dimension).  Let  m  >  1,  let 
X  C  R'^  be  a  compact  m-manifold  with  or  without 
boundary,  and  let  5  C  R'*  be  a  non-degenerate  fi¬ 
nite  set  of  points  that  has  the  generic  intersection 
property  for  X.  If  S  has  the  closed  ball  property 
for  X  then  every  principal  simplex  of  Vx  is  an  m- 
simplex. 

Proof.  Let  cry  £  Vx  be  a  principal  simplex,  and  define 
F'  =  f]\/T  and  F  =  n^T.x-  By  condition  (Bl),  F  = 
X n P'  is  a  closed  Aball,  with  £  =  m+l  —  card  T.  Since 
cry  £  Vx,  we  have  F  ^  (!),  which  implies  £  >  0  and 
therefore  card  T  <  m-fl.  If  card  T  =  m-1-1  then  cry  is  an 
m-simplex  and  we  are  done.  So  suppose  cardT  <  m-|-l. 
Since  cry  is  a  principal  simplex,  it  follows  that  F  C 
intP',  for  otherwise  there  is  a  proper  face  G'  —  f]\/u 
of  T',  so  T  C  U,  with  X  fl  G"  yf  0.  If  follows  that 
(^u  G  Fx,  which  contradicts  the  principality  of  cry. 

Finally,  we  show  that  F  C  int  F'  also  leads  to  a  con¬ 
tradiction.  As  mentioned  above,  T  is  a  closed  Aball 
and  hence  hdF  is  an  (^-  l)-sphere.  This  {£-  l)-sphere 
is  contained  in  intP',  so  it  must  lie  in  bdX.  Indeed, 
bdP  =  bdXn  PL  This  contradicts  condition  (B2), 
which  requires  bdX  D  P'  be  a  closed  {£  —  l)-ball.  0 

For  the  next  lemma  we  need  a  classic  result  on  sub¬ 
divisions  of  a  certain  type  of  complex.  A  closed  ball  is 
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Figure  4.1  (c)  shows  the  harycentric  subdivision  of  the 
simplicial  complex  in  4.1  (b).  Note  that  sd/C  can  be 


called  a  cell,  or  a  k-cell  if  its  dimension  is  k.  A  finite 
collection  of  non-empty  cells,  Tl,  is  a  regular  CW  com¬ 
plex  if  the  cells  have  pairwise  disjoint  interiors,  and  the 
boundary  of  each  cell  is  the  union  of  other  cells  in  TZ. 
A  subset  of  is  a  chain  if  its  elements  can  be  ordered 
so  that  each  contains  its  predecessors  and  is  contained 
in  its  successors.  Let  C-ji  be  the  set  of  chains  in  TZ  and 
note  that  nerve  C-ji  is  well  defined.  The  result  men¬ 
tioned  is  that  any  geometric  realization  of  nerve  Cn  is 
homeomorphic  to  \JTZ.  It  can  be  found  in  [5,  chapter 
V]  and  [14,  chapter  III]  and  also  in  [3]  where  it  is  ap¬ 
plied  to  manifolds  subdivided  by  the  cells  of  a  regular 
CW  complex.  Another  classic  result  needed  is  the  weak 
Schonflies  theorem,  see  e.g.  [20,  chapter  3],  which  im¬ 
plies  that  if  A  is  a  piecewise  linear  fc-sphere  and  B  C  A 
is  a  piecewise  linear  closed  fc-ball  then  A  —  int  B  is  also 
a  closed  fc-ball. 

Lemma  4.2  (complimentary  closed  ball  property).  Let 
m,  X,  and  5  be  as  in  lemma  4.1.  Let  T  C  5  be  so 
that  G  =  n^T.bdX  ^  0;  and  define  F  = 
and  i=m-  cardT.  If  S  has  the  closed  ball  prop¬ 
erty  for  X  then  hdF  -  intG  is  a  closed  ^-ball. 

Proof.  By  condition  (B2),  P  is  a  closed  -f  l)-ball, 
and  thus  an  (.£-l-l)-manifold  with  boundary.  Let  TZ  con¬ 
sist  of  all  sets  f]  V|7,x  and  f]  \/u,hd  \  over  all  17  C  S  with 
T  CU.  Assuming  S  has  the  closed  ball  property  for  X, 
these  sets  are  cells,  so  is  a  regular  CW  complex  and 
F  =  [jTZ.  Let  /C  be  a  geometric  realization  of  nerve  Cn 
and  let  y?  :  P  — »■  U  ^  ^  homeomorphism;  it  exists  be¬ 
cause  of  the  homeomorphism  result  mentioned  above. 
Note  that  y?(bd  P)  =  bd  [J  /C  is  the  underlying  space  of 
a  subcomplex  of  1C,  and  similarly,  <p{G)  =  [JG  for  a 
subcomplex  T  C  /C.  By  construction,  bd[J/C  is  a  piece- 
wise  linear  ^-sphere  and  [J£  C  bd[J^  is  a  piecewise 
linear  closed  Gball.  The  weak  Schonflies  theorem  im¬ 
plies  that  bd  U  /C  -  int  U  £  is  a  closed  Aball.  Since  is  a 
homeomorphism,  y?“^(bd (J /C  —  int [J £)  =  bbT  — intG 
is  also  a  closed  Gball,  as  claimed.  0 

Theorem  for  manifolds.  We  need  a  few  additional 
definitions.  The  harycentric  coordinates  of  a  point 
X  with  respect  to  a  simplex  ctt,  T  =  {tiQ,  •  ■  • , 
are  real  numbers  so  that  =  1  and 

~  unique  and  non-negative  if 

X  £  (Tt-  Let  K-  and  £  be  two  simplicial  complexes,  and 
let  /  :  vert  K,  -*  vert  £  take  the  vertices  of  any  simplex 
in  1C  to  the  vertices  of  a  simplex  in  £.  The  simplicial 
map  implied  by  /  is  :  [J  /C  — »■  [J  £,  which  maps  a  point 
X  e  (7t,T  =  {vo, .  to  g(x)  =  We 

will  use  the  fact  that  if  /  is  a  bijection  then  gr  is  a  home¬ 
omorphism.  The  barycenter  of  ct  is  bj  =  X^j_o 
and  the  harycentric  subdivision  of  1C  is 

sd/C  =  {conv  {br  ]  T  G  G}  ]  G  G  Cx:}. 


X 

(a)  (b)  (c)  (d) 

Figure  4.1:  (a)  The  space  X  is  a  closed  2-ball  in  the  form 
of  a  boomerang.  It  is  covered  by  the  restricted  Voronoi 
cells  of  six  points,  which  define  a  regular  CW  complex  TZ 
with  [J72.  =  X.  (b)  The  Delaunay  simplicial  complex,  X>x, 
consists  of  four  triangles  connecting  six  points,  (c)  The 
harycentric  subdivision,  sdX>Xt  has  the  same  underlying 
space  as  'Dx.  (d)  A  geometric  realization  of  nerve 
consists  of  a  simplicially  homeomorphic  copy  of  sd  T>x  (not 
shaded),  surrounded  by  a  collar  of  triangles  (shaded). 

constructed  inductively  by  connecting  b^  to  all  sim- 
plices  subdividing  the  proper  faces  of  cry.  The  star  of 
a  vertex  nGAiisstn  =  {crG/CluGcr}. 

We  will  use  these  tools  to  show  our  first  result  stated 
in  theorem  4.3  below.  The  proof  constructs  a  home¬ 
omorphism  between  X  and  [jPx  one  step  at  a  time. 
In  this  process,  the  pasting  lemma  of  point  set  topol¬ 
ogy  [16,  19]  is  employed.  It  can  be  stated  as  follows. 
If  y  ;  _v  y  and  j  B  -+  Y  are  continuous  maps 
that  agree  on  ACiB  and  A,  B  are  closed  in  A  U  B,  then 
h  :  A  U  B  ^  Y,  which  agrees  with  f  on  A  and  with  g 
on  B  is  continuous. 

Theorem  4.3  Let  X  C  be  a  compact  manifold, 
with  or  without  boundary,  and  let  S  C  R"^  be  a 
non-degenerate  finite  point  set  that  has  the  generic 
intersection  property  for  X.  If  5  has  the  closed  ball 
property  for  X  then  ^ 

Proof.  For  each  i,  define  Vi  =  {QVt.x  i  cardT  = 
m  +  1  -  i}  and  note  that  because  of  the  closed  ball 
property  all  elements  in  V,-  are  closed  i-balls.  We  in¬ 
ductively  construct  simplicial  complexes  JCi  and  home- 
omorphisms  (pi  :  [JV,-  — »■  (J/Cj,  so  that  /Ci_i  C  /Cj  and 
ipi  agrees  with  y?s_i  on  [JVi_i.  When  we  arrive  at 
K.  =  Km  and  (p  =  ipm  show  there  is  a  simplicial 
homeomorphism  between  K  and  sd  Vx  ■  The  result  fol¬ 
lows  because  X  =  IJVm  «  (J/C  Rs  [JsdX>x  =  \jTC>x- 
To  start  the  induction,  let  each  T  C  S  correspond 
to  a  point  vt  in  R®.  Let  e  be  large  enough  and 
choose  the  points  in  R*"  so  that  any  collection  of  sim- 
plices  of  dimension  up  to  m  spanned  by  these  points 
satisfy  the  properties  of  a  simplicial  complex.  Define 


^0  =  {vt  I  ^  C  5,  cardT  =  m+1,  fjW.x  7^  0}-  At 
this  stage  the  homeomorphism  i^o  :  Vo  ^  ICq  defined  by 
V^o(nVT,x)  =  vt  is  just  a  bijection  between  two  finite 
point  sets. 

Suppose  0  <  i  <  m  —  1  and  Kj  and  ipj  :  U  Vj  — ^ 
\JK.j  are  constructed.  Let  i  =  j  +  I  and  initialize 
Ki  =  JCj .  Let  T  C  S  with  card  T  =  m  +  1  —  i  so 
that  F  =  pjVr.x  7^  0  and  vt  is  not  yet  in  Ki.  Define 
G  =  fjVj.bdX-  We  add  simplices  to  Ki  that  will  allow 
us  to  extend  the  homeomorphism  so  it  includes  F  €  V,- . 
Specifically,  consider  all  sets  U  C  S,  with  T  C  U  and 
nV[/,x  7^  0-  The  corresponding  simplices  au  belong  to 
Kj  and  are  contained  in  (pj  (bd  F  —  int  G).  Add  all  sim¬ 
plices  a-uu{vT]  to  Ki-  To  extend  the  homeomorphism 
we  distinguish  two  cases.  Assume  first  that  G  =  0,  and 
therefore  F  C  int  X.  Since  pj  is  a  homeomorphism  and 
F  is  a  closed  i-ball,  bd  F  and  r?j(bd  F)  are  both  (i  -  1)- 
spheres.  It  follows  that  (JstriT  is  a  closed  i-ball,  and  a 
homeomorphism  px  :  T  IJ  st  uy  that  agrees  with  (pj 
on  bdf  can  be  constructed.  Now  assume  G  ^  By 
lemma  4.2,  bd  F  -  int  G  and  therefore  pj  (bd  F  —  int  G) 
are  closed  (i  —  l)-balls.  Furthermore,  bd  G  is  an  {i  -  2)- 
sphere.  Let  C  C  Ki  consist  of  the  simplices 
with  au  C  9Jj(bdG).  (JT  is  a  closed  (f  —  l)-ball,  and 
a  homeomorphism  p'r^,  :  G  — *■  (J  £  that  agrees  with  pj 
on  bdG  can  be  established.  Now  we  are  in  the  same 
situation  as  in  the  first  case,  and  a  homeomorphism 
Pt  :  F  — »■  (J  st  ■wt  that  agrees  with  p'j.  on  G  and  with 
Pj  on  bd  F  — int  G  can  be  constructed.  After  adding  all 
F  =  pl  Vt,x  with  cardT  =  m-f  1  —  i  in  this  fashion,  we 
get  Pi  by  combining  all  px  using  the  pasting  lemma. 

Observe  that  K  and  sdFx  contain  a  vertex  for  each 
T  C  S  with  PiVt.x  7^  0,  so  vertsdFx  =  {by  |  vx  G 
vertF}.  It  follows  that  /  ;  vert/C  — >■  vertsdFx  de¬ 
fined  by  f(vx)  =  by  is  a  bijection.  By  construction 
of  Ai,  if  a  collection  of  vertices  belong  to  a  common 
simplex  in  K,  then  their  images  belong  to  a  common 
simplex  in  sdVx-  It  follows  that  the  simplicial  map 
g  .[JK  implied  by  /  is  a  homeomorphism. 

Therefore,  Q/C  IJsdFx  =  I'be  assertion 

of  the  theorem  follows.  PI 

Remark.  As  illustrated  in  figure  4.1,  the  Delaunay 
simplicial  complex  restricted  by  X  is  related  to  the  com¬ 
plex  obtained  from  the  chains  of  the  regular  CW  com¬ 
plex  defined  by  the  Voronoi  cells.  Besides  being  smaller, 
an  advantage  of  the  Delaunay  simplicial  complex  is  that 
it  naturally  imbeds  in  the  same  space  that  contains  X 
and  S. 

Remark.  For  manifolds  of  dimension  3  or  less,  condi¬ 
tion  (B2)  is  implied  by  (Bl)  and  the  weaker  requirement 

(B2’)  fj  V^^bdX  is  either  empty  or  contractible. 

Of  course,  this  is  interesting  only  for  manifolds  wUh 
boundary,  for  otherwise  (B2)  and  (B2’)  are  void.  To 


see  why  (B2’)  is  sufficient,  let  1  <  m  <  3  and  suppose 
X  and  S  satisfy  the  assumptions  in  lemma  4.1.  Assume 
also  that  S  satisfies  condition  (Bl),  that  is,  for  every 
£  <  m  and  every  T  C  S  with  card  T  =  m  -f-  1  - 
F  =  nVr.x  is  either  empty  or  a  closed  Gball.  Note 
that 

G  =  Pl  VT,bdx  =  bdF  -  P  intpV[;,x 

TCUCS 

is  either  empty  or  an  (£  —  l)-manifold  contained  in  an 
(f— l)-sphere,  namely  bdF.  For£  <  3,  the  contractibil- 
ity  of  G  0  implies  that  G  is  a  closed  {£  —  l)-ball;  so 
(B2’)  implies  (B2).  For.  m  =  4,  (B2’)  implies  (B2)  if  the 
Poincare  conjecture  for  3-spheres  is  true,  see  e.g.  [1]. 

5  General  Topological  Spaces 

In  this  section,  we  generalize  the  result  for  compact 
manifolds  to  more  general  topological  spaces.  The  re¬ 
quirements  will  automatically  exclude  spaces  that  can¬ 
not  be  expressed  as  the  underlying  space  of  a  finite  reg¬ 
ular  CW  complex.  In  order  to  generalize  theorem  4.3, 
we  need  extensions  of  the  generic  intersection  and  the 
closed  ball  properties.  Let  X  C  R'*  be  a  topological 
space  and  let  5  C  R*^  be  a  non-degenerate  finite  point 
set.  S  has  the  extended  closed  ball  property  for  X  if 
there  is  a  regular  CW  complex  TZ,  with  X  =  (JTI,  that 
satisfies  the  following  properties  for  every  T  C  S  with 
n  Vt,x  7^  0: 

(El)  there  is  a  regular  CW  complex  Tlx  C  72.  so  that 

n  Vt,x  = 

(E2)  the  set  72y  =  {7  E  72  |  int  7  C  intpjVT}  contains 
a  unique  cell,  t]x,  so  that  r]x  C  j  for  every  7  E  72^, 

(E3)  if  r]x  is  a  j-cell  then  t]x  H  bdfjVT  is  a  {j  —  1)- 
sphere,  and 

(E4)  for  each  integer  k  and  each  fc-cell  7  E  72y  -  {rjx}, 
7  n  bd  Pl  Vy  is  a  closed  (k  —  l)-ball. 

We  call  Tjx  the  hub  of:72y.  Furthermore,  S-  has  the 
extended  generic  intersection  property  for  X  if  for  every 
T  C  S  and  every  7  E  72t  -  72y  there  is  a  5  E  72^  so 
that  J  C  6. 

It  is  not  difficult  to  see  that  if  X  is  a  compact  manifold 
and  S  has  the  generic  intersection  and  the  closed  ball 
properties  for  X,  then  the  extended  generic  intersection 
and  the  extended  closed  ball  properties  follow.  Indeed, 
the  regular  CW  complex,  72,  required  by  condition  (El) 
consists  of  all  non-empty  sets  fjVj^X  and  fjVTbdX, 
T  C  S.  Fix  a  subset  T  C  S  and  define  F  =  Pl'Vy_x 
and  G  =  f)  Vy^bdX-  If  non-empty,  F  and  G  are  closed 
balls,  and  if  G  0  then  T  y^:  0  and  the  dimension  of 
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F  exceeds  the  dimension  of  G  by  one.  There  are  three 
possible  cases.  If  F  =  G  =  0  then  plVy^x  =  F  =  0  and 
conditions  (E2)  through  (E4)  are  void.  If  F  0  and 
G  =  0  then  =  {F},  and  rjr  —  F  satisfies  conditions 
(E2)  and  (E3);  condition  (E4)  is  void.  If  F  0  and 
G  ^  0  then  Tl^  =  {F,  G},  ryr  =  G  satisfies  conditions 
(E2)  and  (E3),  and  7  =  F  satisfies  condition  (E4).  In 
any  case,  the  establishment  of  the  homeomorphism  in 
the  proof  of  theorem  4.3  can  be  viewed  as  introducing 
a  vertex  vt  for  r]T  and  connecting  it  to  the  simplices 
inductively  constructed  for  the  cells  in  •  This 

idea  also  works  in  the  general  case. 

Let  X  C  be  a  topological  space,  and  let  S  C  be 
a  non-degenerate  finite  point  set  that  has  the  extended 
generic  intersection  and  the  extended  closed  ball  prop¬ 
erties  for  X.  To  construct  a  homeomorphism  between 
X  =  U  7^  and  (J  Vx ,  we  consider  one  subset  T  C  S  at  a 
time,  in  order  of  non-increasing  cardinality.  Inductively, 
we  assume  the  cells  in  TIt  -  Tl^  are  already  mapped 
homeomorphically  to  appropriate  simplices.  We  extend 
the  homeomorphism  to  the  cells  of  72.^ ,  again  induc¬ 
tively  in  order  of  non-decreasing  dimension.  We  intro¬ 
duce  a  vertex,  vt,  for  the  hub,  r]T-  If  t]t  is  a  fe-ball, 
its  boundary  is  a  (I:  —  l)-sphere,  and  by  (E3)  and  the 
induction  hypothesis,  the  cells  in  this  {k  —  l)-sphere  are 
already  part  of  the  homeomorphism.  After  connecting 
Vt  to  the  simplices  that  correspond  to  the  cells  in  bd  rjT, 
we  can  extend  the  homeomorphism  to  tjt  ■  Every  other 
cell  in  IZt  is  treated  the  same  way,  only  that  the  reason 
the  homeomorphism  can  be  extended  is  now  a  combi¬ 
nation  of  the  two  induction  hypotheses.  This  implies 
the  generalization  of  theorem  4.3  to  topological  spaces 
other  than  manifolds. 

Theorem  5.1  Let  X  C  R'*  be  a  topological  spa.ce,  and 
let  S  be  a  non-degenerate  finite  point  set  that  has 
the  extended  generic  intersection  property  for  X.  If 
S  has  the  extended  closed  ball  property  for  X  then 
[jVx^X. 

6  Remarks  and  Further  Work 

This  paper  is  targeted  at  problems  requiring  the  dis¬ 
cretization  of  possibly  complicated  geometric  objects  in 
finite-dimensional  Euclidean  spaces.  Such  problems  are 
abundant  in  the  computational  science  literature,  see 
e.g.  Kaufmann  and  Smarr  [12],  where  the  discretization 
of  continuous  domains  is  common  practice.  The  dimen¬ 
sions  of  the  domain  and  the  imbedding  space  can  be 
the  same,  as  e.g.  common  in  fluid  dynamics  problems, 
or  they  can  be  widely  different,  as  in  the  study  of  many 
dynamical  systems.  The  restricted  Delaunay  simplicial 
complex  introduced  in  this  paper  is  a  general  method 
that  produces  simplicial  discretizations  in  all  cases. 


The  introduction  of  a  general  concept  typically  gives 
rise  to  many  specific  questions  and  directions  for  fur¬ 
ther  research.  We  see  three  directions  of  progressively 
more  basic  work  necessary  to  bring  restricted  Delaunay 
simplicial  complexes  closer  to  the  targeted  application 
areas.  The  first  is  the  design  of  efficient  algorithms 
that  constructs  the  Delaunay  simplicial  complex  of  a 
finite  point  set,  S,  restricted  by  a  domain  or  space, 
X.  Special  cases  under  different  assumptions  on  how 
X  is  specified  are  considered  in  [15,  4,  8].  The  second 
direction  is  the  design  of  methods  that  choose  finitely 
many  points  resulting  in  good  quality  discretizations  of 
X.  Such  methods  have  a  long  history  in  the  somewhat 
different  context  of  finite  element  analysis,  see  e.g.  [21]. 
The  third  direction  is  the  study  of  maps  from  a  prob¬ 
lem  specific  domain  to  a  space,  possibly  imbedded  in 
higher  dimensions,  that  depends  on  functionals  studied 
over  the  domain  or  on  approximate  solutions  thereof. 
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Abstract 

We  give  a  deterministic  polynomial  time  method 
for  finding  a  set  cover  in  a  set  system  {X,'R)  of 
VC-dimension  d  such  that  the  size  of  our  cover  is 
at  most  a  factor  of  0{dlog{dc))  from  the  optimal 
size,  c.  For  constant  VC-dimension  set  systems, 
which  are  common  in  computational  geometry,  our 
method  gives  an  O(logc)  approximation  factor. 
This  improves  the  previous  0(log|A|)  bound  of 
the  greedy  method  and  beats  recent  complexity- 
theoretic  lower  bounds  for  set  covers  (which  don’t 
make  any  assumptions  about  VC-dimension).  We 
give  several  applications  of  our  method  to  compu¬ 
tational  geometry,  and  we  show  that  in  some  cases, 
such  as  those  that  arise  in  3-d  polytope  approxi¬ 
mation  and  2-d  disc  covering,  we  can  quickly  find 
0(c)-sized  covers. 


1  Introduction 


A  set  system  (X,  U)  is  a  set  X  along  with  a  collec¬ 
tion  U  of  subsets  of  X,  which  are  sometimes  called 
ranges  [24].  Such  entities  have  also  been  called  hy¬ 
pergraphs  and  range  spaces  in  the  computational 
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geometry  literature  (e.g.,  see  [5,  9,  10,  11,  12,  13, 
14,  15,  18,  23,  24,  33,  35,  34,  40,  36,  38,  39]),  and 
they  can  be  used  to  model  a  number  of  interesting 
computational  geometry  problems. 

There  are  a  host  of  XP-hard  problems  defined 
on  set  systems,  with  one  of  the  chief  such  problems 
being  that  of  finding  a  set  cover  of  minimum  size 
(e.g.,  see  [20,  22]),  where  a  set  cover  is  a  subcol¬ 
lection  C  C  TZ  whose  union  is  X  and  the  size  of 
C  is  simply  the  number  of  sets  in  C.  A  related 
(in  fact,  equivalent)  problem  is  that  of  finding  a 
hitting  set  of  smallest  size,  where  a  hitting  set  is  a 
subset  H  C  X  such  that  H  has  a  non-empty  inter¬ 
section  with  every  set  R  in  71.  There  are  a  num¬ 
ber  of  problems  that  can  be  reduced  to  these  two 
problems,  and  many  of  these  problems  are  formu¬ 
lated  as  computational  geometry  problems.  Thus, 
our  interest  is  in  finding  minimum  set  covers  and 
smallest  hitting  sets  as  quickly  as  possible. 

Unfortunately,  the  corresponding  decision 
problems,  SET  COVER  and  Hitting  Set,  as  we 
will  call  them,  are  both  AP-complete  [22,  28]. 
Moreover,  even  if  each  element  of  X  is  contained  in 
just  two  sets,  the  Hitting  Set  problem  is  still  NP- 
complete  [28]  (and  is  known  as  Vertex- Cover). 
Thus,  unless  P  =  NP,  if  we  desire  a  polynomial¬ 
time  algorithm,  we  must  content  ourselves  with  an 
approximation  algorithm,  which,  if  we  let  c  denote 
the  size  of  a  minimum  set  cover,  produces  a  set 
cover  of  size  ac  for  some  (hopefully)  small  approx¬ 
imation  factor  a.  The  only  known  polynomial¬ 
time  approximations  algorithms  for  Set  Cover 
with  guaranteed  performance  ratios  are  the  greedy 
algorithm  [16,  27,  31],  which  achieves  an  approxi¬ 
mation  factor  of  a  =  (1  -F  In  A),  where  A  denotes 
the  size  of  the  largest  set  in  P,  and  an  algorithm  of 
Hochbaum  [25],  which  achieves  an  approximation 


factor  of  a  =  maxa;£x{|”^x|}i  where  TZ^  denotes 
the  set  of  all  i?  G  7^  containing  x.  Note  that  in  the 
former  case  a  can  be  as  large  as  (1  +  ln  |X|)  and  in 
the  latter  a  can  be  as  large  as  \7l\.  Thus,  as  worst 
case  bounds,  the  greedy  algorithm  achieves  the 
best  approximation  factor,  and  Johnson  [27]  shows 
there  are  set  systems  where  the  greedy  algorithm 
does  indeed  produce  a  set  cover  with  approxima¬ 
tion  factor  a  =  r2(ln  |X|  -f  1).  Interestingly,  Lund 
and  Yannakakis  [32]  have  recently  shown  that,  un¬ 
less  NP  C  DTIME[nP°^y  ”],  no  polynomial  time 
algorithm  can  approximate  Set  Cover  within  a 
factor  better  than  a  =  |ln|Y|  in  the  worst  case, 
and,  unless  NP  C  Bellare  et 

al.  [4]  showed  that  no  polynomial  time  algorithm 
can  approximate  Set  Cover  within  a  factor  bet¬ 
ter  than  oi  —  Jln|Y|  in  the  worst  case,  for  any 
constant  5  <  1/8. 

Our  results.  In  this  paper  we  give  a  (determin¬ 
istic)  polynomial-time  algorithm  that  finds  a  set 
cover  of  size  within  a  factor  of  a  =  0{d\og{dc)) 
of  the  size  c  of  the  minimum  set  cover,  where  d 
stands  for  the  VC-exponent  of  the  set  system  (see 
Section  2  for  definitions).  Thus,  for  set  systems 
with  bounded  VC-exponent,  we  actually  beat  the 
complexity-theoretic  lower  bounds  [4,  32]  (which 
make  no  assumptions  about  the  VC-exponent).  In 
section  4,  we  indicate  some  examples  on  which 
Hochbaum’s  method  [25]  or  the  greedy  meth¬ 
ods  [16,  27,  31]  perform  poorly,  and  show  that  our 
method  outperforms  them  in  some  cases. 

Our  algorithm,  which  is  actually  for  the  dual 
Hitting  Set  problem,  is  based  upon  a  deter¬ 
ministic  analogue  of  a  randomized  natural  selec¬ 
tion  technique  used  by  Clarkson  [17,  19],  Little- 
stone  [30],  and  Welzl  [44].  This  technique  can  be 
viewed  as  “algorithmic  Darwinism,”  for  we  itera¬ 
tively  select,  in  polynomial-time,  a  small-sized  sub¬ 
set  of  X  (known  in  the  literature  as  an  a- net  [24]) 
that  intersects  all  highly-weighted  sets  in  77,  and, 
if  this  set  is  not  a  hitting  set,  then  we  increase 
the  weight  of  the  elements  in  an  i?  G  77  missed 
by  this  set.  By  continuing  this  process  over  sev¬ 
eral  “generations”  we  guarantee  that  the  “fittest” 
elements,  which  belong  to  the  optimal  hitting  set, 
eventually  are  included.  Fortunately,  the  number 
of  generational  iterations  we  must  perform  is  at 
most  0(clog(|V|/c));  hence,  this  approach  yields 
a  (relatively  simple)  polynomial-time  algorithm. 


We  show  the  utility  of  our  approximation  algo¬ 
rithm  by  giving  several  applications  of  our  method 
to  problems  in  computational  geometry  and  learn¬ 
ing  theory,  including  polytope  approximation,  geo¬ 
metric  point-probe  decision  tree  construction,  and 
disk  cover.  Our  methods  improve  the  running 
time  and/or  the  approximation  factors  of  previ¬ 
ous  methods.  In  fact,  for  3-d  polytope  approxima¬ 
tion  and  2-d  disc  covering  we  show  how  to  adapt 
our  method  to  achieve  a  constant  approximation 
factor.  We  also  describe  an  implementation  of  our 
approach,  and  show  that  it  indeed  beats  the  greedy 
method  in  practice  in  some  cases. 

Before  we  describe  our  methods,  however,  let 
us  first  review  the  relevant  properties  and  defini¬ 
tions  regarding  the  VC-dimension  notion. 

2  Set  Systems  of  Finite  VC- 
Dimension 

In  this  section  we  recall  the  basic  facts  about  set 
systems  of  finite  VC-dimension.  Let  (V,  77)  be  a 
given  set  system.  Given  Y  C  X,  all  the  subsets  of 
Y  obtained  as  the  intersection  of  Y  and  R  ranging 
over  77,  form  a  set  system  called  the  system  in¬ 
duced  by  77  on  Y,  and  denoted  by  77|y.  Y  is  shat¬ 
tered  by  77  if  77|y  =  2^.  {X,  77)  is  said  to  have  VC- 
dimension  d,  if  d  is  the  smallest  integer  such  that 
no  d-l-1  point  subset  Y  C  V  can  be  shattered.  If  Y 
is  finite,  it  is  well  known  [42,  43]  that  the  number 
of  sets  of  77|y  is  less  than  ( )+•■•+  <  |Y|‘^, 

where  d  is  the  VC-dimension  of  (V,  77).  We  call  the 
VC-exponent^  the  infimum  of  all  numbers  s  such 
that  |77|y|  =  0(|Y|®)  for  any  finite  subset  Y  of  X. 
The  aforementioned  result  shows  that  s  <  d,  but 
equality  does  not  always  occur,  as  there  are  sys¬ 
tems  in  which  the  VC-exponent  is  non  integral  [2]. 
However,  the  VC-dimension  is  finite  if  and  only 
if  the  VC-exponent  is  finite  as  well;  s  never  takes 
values  in  the  interval  [0, 1],  and  it  is  0  if  an  only  if 
77  is  finite  [2]  (whereas  d  is  0  if  and  only  if  77  con¬ 
sists  of  one  set).  In  particular,  the  VC-exponent 
concept  only  makes  sense  for  an  infinite  set  sys¬ 
tem,  whereas  the  VC-dimension  is  more  general. 
Consider,  for  the  sake  of  an  example,  a  common 
set  system  arising  often  in  computational  geome¬ 
try  applications — the  halfspace  set  system.  In  this 

^This  value  has  also  gone  by  the  names  real  density  [2] 
and  scaffold  dimension  [23]. 
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system  X  is  taken  as  a  set  of  halfspaces  in  IR 
and  7^  is  taken  to  be  all  combinatorially  distinct 
ways  if  intersecting  halfspaces  in  X  by  a  simplex. 

It  is  well  known  (e.g.,  [2])  that  this  system  has  a 
VC-dimension  and  a  VC-exponent  of  d. 

The  dual  set  system  (7^,X*)  is  defined  by 
X*  =  {7^^  :  X  £  X},  where  consists  of  all 
the  sets  R  elZ  that  contain  x.  One  way  to  look  at 
dual  systems  is  to  consider  {X,'JZ)  as  a  (possibly 
infinite)  boolean  matrix  M  that  has  a  column  for 
each  X  £  X  and  each  row  corresponds  to  an  inci¬ 
dence  relation  for  a  set  i?  G  and  view  the  dual 
as  a  transpose  of  this  matrix.  It  is  also  well  known 
(e.g.,  see  Assouad  [2])  that  the  VC-dimension  d* 
of  the  dual  (7^,X*)  is  less  than  2'^+\  where  d  is 
the  VC-dimension  of  the  primal  {X,TZ). 

Let  (X,  7^)  be  a  set  system.  If  a  subset  N  CX 
intersects  each  set  R  £  TZ  oi  size  bigger  than  £|X|, 
then  we  call  N  an  e-net  [24].  As  has  been  observed 
by  Haussler  and  Welzl  [24],  set  systems  of  VC- 
dimension  d  admit  (l/r)-nets  of  size  0(dr  log(dr)). 
This  bound  has  been  improved  by  Blumer  et  al.  [7] 
to  0(dr  log  r)  and  this  has  been  proved  tight  by 
Komlos  et  al.  [29].  We  can  generalize  this  defini¬ 
tion  by  putting  an  additive^  weight  function  w  on 
2^.  In  this  formulation,  an  e-net  is  required  to 
intersect  every  set  of  weight  at  least  ew{X).  To 
allow  for  efficient  computation,  we  need  that  the 
set  system  be  described  somewhat  more  efficiently 
than  by  the  list  of  its  subsets. 

Definition  2.1  We  say  that  a  set  system  (X,77) 
has  a  subsystem  oracle  of  degree  D  if  there  is  an 
algorithm  which,  given  a  subset  V  C  X,  returns 
(y,77|y)  (as  a  boolean  matrix)  in  time  0(|y|®+^). 
R  is  a  witnessed  oracle  if,  for  any  set  R  of  IZ^y, 
can  provide  a  set  R'  of  TZ  such  that  R  =  R  n  V  in 
0(|X|)  time. 

If  (X,  1Z)  has  a  subsystem  oracle  of  degree  D, 
and  VC-exponent  d,  it  is  clear  that  d  <  D.  Un¬ 
der  this  assumption,  it  has  also  been  shown  [9] 
that  one  can  find  a  (l/r)-net  for  (X,77)  of  size 
0{dr\og{dr))  in  time  0(d)^^r^  log®  (r’d)|X|,  for 
both  the  uniform  and  weighted  cases. 

^The  term  additive  refers  to  the  fact  that  w{Y)  = 
X  w{y),  with  the  usual  abuse  of  notation  w{y)  = 


3  The  Main  Algorithm 

The  goal  of  this  section  is  to  prove  our  main  the¬ 
orem.  Let  s  be  a  non  decreasing  function,  let 
n  =  |X|,  and  let  m  =  \TZ\. 

Definition  3.1  An  s-net  finder  for  (X,  77.)  is  an 
algorithm  A  that,  given  r  and  a  weight  distribu¬ 
tion  w  on  X,  returns  a  {lfr)-net  of  size  s{r)  for 
(X,77)  with  weights  w.  Also,  a  verifier  is  an  algo¬ 
rithm  B  that,  given  a  subset  H  C  X,  either  states 
(correctly)  that  H  is  a  hitting  set,  or  returns  a 
nonempty  R  £TZ  such  that  RD  H  =  ^. 

Using  size(a;)  to  denote  the  length  of  the  en¬ 
coding  of  an  object  x,  let  us  say  that  A  runs  in 
T^(size(X,7^),size(w),r)  tW  and  that  B  runs 
in  TB(size(X,77),size(7T))  time.  We  will  suppose 
that  both  Ta  and  Tb  are  (non  constant)  polynomi¬ 
als,  so  that  T{0{x))  =  0(T{x))  and  Z)/c<a;T'(2^)  = 
0(T(2^)). 

Theorem  3.2  Let  (X,  77.)  be  a  set  system  that 
admits  both  an  s-net  finder  A  and  a  veri¬ 
fier  B.  Then  there  is  an  algorithm  AiA,B) 
that  computes  a  hitting  set  of  size  at  most 
s(4c),  where  c  stands  for  the  size  of  an  op¬ 
timal  hitting  set.  The  time  taken  by  A  is 
0(clog(n/c)  (T^(nm,nlog(n/c),c)  ■+-TB(nm,c)}. 

If  X  has  finite  VC-dimension,  then  we  may  im¬ 
plement  the  net  finder  using  the  greedy  method 
[13],  and  the  verifier  by  inspecting  all  of  (X,77), 
both  in  polynomial  time  (in  either  the  real  RAM 
or  bit  models).  But  under  standard  computational 
assumptions,  there  are  better  implementations  of 
the  net  finder  and  verifier. 

Corollary  3.3  Let  (X,77)  be  a  set  system  given 
by  a  witnessed  subsystem  oracle  of  degree  D, 
which  admits  a  hitting  set  of  size  c.  Let  d 
stand  for  the  VC-exponent  of  (X,77.),  and  as¬ 
sume  0  0  77.  Then  one  can  find  a  hitting 

^Throughout  the  literature  in  computational  geometry, 
the  unit-cost  (or  real  RAM)  model  is  usually  assumed.  In 
this  case,  size(u))  is  simply  n.  However,  due  to  to  the  partic¬ 
ular  relevance  of  our  algorithm  in  the  realm  of  AP-hardness, 
and  since  the  weights  can  (and  will)  become  exponential  in 
the  forthcoming  algorithm,  we  have  to  be  careful  that  the 
algorithm  still  runs  in  polynomial  time  in  the  bit  model. 
As  it  turns  out,  our  running  time  is  the  same  in  both  mod¬ 
els,  but  the  choice  of  the  encoding  and  the  value  of  Ta  are 
different. 
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set  for  {X,7Z)  of  size  0{dclog{dc))  in  T(|X|)  = 
O  log(|X|/c)|X| j  time  in  the  real 

RAM  model,  or  O  {cT{\X\)  log(|X|/c))  time  in  the 
bit  model. 

Proof:  It  suffices  to  show  how  to  implement  the 
net  finder  and  the  verifier  and  show  their  complex¬ 
ity  bound.  But  one  can  find  a  (l/r)-net  for  {X,TZ) 
of  size  O{drlog{dr))  in  O (d)^^ r'^^  log^ {rd)\X\ 
time,  for  both  the  uniform  and  weighted  cases,  us¬ 
ing  the  algorithm  of  Bronnimann,  Chazelle,  and 
Matousek  [9],  in  the  real  RAM  model.  In  the  bit 
model,  the  complexity  of  the  weights  would  have 
to  be  taken  into  account,  which  may  add  at  most 
an  0(c  log (|X I /c))  factor  to  the  running  time,  as 
proven  in  the  next  section.  As  for  the  verifier,  one 
simply  runs  the  witnessed  subsystem  oracle  for  H. 
If  the  oracle  fails  to  list  0  as  being  in  we  may 
conclude  that  H  is  a  hitting  set.  Otherwise,  we 
ask  for  a  witness  R  of  the  fact  that  0  G  Rih-  Thus 
RCiH  =  ih,  and  R  is  nonempty  as  0  0  7^.  The  time 
spent  by  the  verifier  is  +  |Aii).  Plugging 

these  bounds  into  Theorem  3.2  yields  the  corollary. 

□ 


Remarks.  (1)  If  we  change  the  definition  of  the 
VC-exponent  to  fit  better  the  VC-dimension  con¬ 
cept,  by  requiring  that  for  all  finite  Y  C  X  we 
have  |Rjyi  ~  ^  +  ■  •  •  +  which  is  also 

0{\Y\/d  -I-  1)^,  then  we  can  reduce  the  size  of  the 
hitting  set  to  O{dclogc)  [14]. 

(2)  For  a  particular  c,  the  algorithm  will  either 
say  that  there  is  no  hitting  set  of  size  c,  or  it  will 
output  a  hitting  set  of  size  0{dc\og{dc)),  which, 
of  course,  is  different  than  saying  that  there  is  a 
hitting  set  of  size  c. 

(3)  The  corollary  can  be  stated  with  d  as  the 
dual  VC-exponent,  with  an  oracle  for  the  dual 
set  system,  and  as  yielding  a  set  cover  of  size 
0{dc  log((ic)). 

We  now  turn  to  the  proof  of  Theorem  3.2. 

3.1  The  algorithm 

Let  us  assume,  for  the  time  being,  that  we  know 
the  size  c  of  a  smallest  hitting  set  (we  will  show 
later  why  this  is  a  reasonable  assumption). 


Our  strategy  can  be  thought  of  in  terms  of  evo¬ 
lutionary  biology,  in  that  it  is  based  upon  a  notion 
of  “survival  of  the  fittest.”  Intuitively,  we  want  to 
simulate  the  growth  of  a  population  where  some  el¬ 
ements  are  advantaged  because  they  hit  more  sets 
than  others.  The  idea  is  to  put  weights  on  the  el¬ 
ements  (initially  uniformly)  and  use  the  net  finder 
A  to  select  a  (l/2c)-net  of  size  s(2c).  If  it  doesn’t 
hit  a  particular  set  R  eTZ,  as  returned  by  the  ver¬ 
ifier  B  on  the  net,  we  double  the  weights  of  the 
points  in  R.  Because  of  the  definition  of  a  hit¬ 
ting  set,  at  least  one  point  in  an  optimal  hitting 
set  falls  in  R,  and  has  its  weight  doubled.  How¬ 
ever,  the  property  of  a  (l/2c)-net  implies  that  the 
weight  of  R  is  at  most  a  fraction  1  /2c  of  the  total 
weight,  and  the  total  weight  does  not  increase  too 
much.  Therefore,  we  soon  expect  an  optimal  hit¬ 
ting  set  to  be  included  in  the  chosen  set.  The  next 
lemma  shows  that  this  is  indeed  the  case. 

Lemma  3.4  If  there  is  a  hitting  set  of  size  c, 
the  doubling  process  cannot  iterate  more  than 
4clog(n/c)  times,  and  the  total  weight  will  not  ex¬ 
ceed  /c^ . 

Proof:  Our  proof  follows  arguments  of  Clark¬ 
son  [17,  19],  Littlestone  [30],  and  Welzl  [44].  Let 
R  be  a  hitting  set  of  size  c.  Because  the  set  R 
returned  by  B  at  each  iteration  satisfies  w{R)  < 
w{X)/2c,  the  weight  of  X  is  not  multiplied  by 
more  than  a  factor  1  -I-  l/2c  in  any  iteration.  How¬ 
ever,  HnR  is  not  empty,  by  definition  of  H.  There¬ 
fore,  after  k  iterations,  if  each  h  e  H  has  been 
doubled  Zh  times,  we  have 

1  ^  k 

w{X)  <  +  <  ne2c,  and 

2c 

w{H)  =  ^  2^'“,  where  E  Zh  >  k. 

heH  heH 

Using  the  convexity  of  the  exponential  function,  we 
conclude  that  w{H)  >  c2''/'’.  Since  w{H)  <  w{X), 
we  finally  have 

k  3^ 

c2c  <  ne2c  <  n24c  ^ 

from  which  k  <  4clog((:)  follows.  The  bound  of 
rA/c^  on  w{X)  is  an  immediate  consequence.  □ 

If  the  process  exceeds  this  guaranteed  number 
of  iterations,  that  only  means  that  there  is  no  cover 
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of  size  c.  Thus,  we  can  use  this  procedure  to  ac¬ 
tually  determine  an  approximate  bound  for  c.  To 
start  with,  we  conjecture  a  value  c!  of  c,  which  ini¬ 
tially  can  be  set  to  1.  In  general,  if  our  routine 
fails  to  find  a  hitting  set,  there  is  no  hitting  set  of 
size  so  we  increase  c!  by  a  factor  of  two.  At  the 
stage  when  we  find  a  hitting  set,  d  <  2c,  so  that 
the  hitting  set  returned  is  of  size  at  most  s(4c). 


3.2  £-nets  with  or  without  weights? 

Typically,  e-net  algorithms  are  designed  for  the 
uniform  case  not  for  the  weighted  case.  Here 
we  mention  a  simple  method  for  reducing  the 
weighted  case  to  an  unweighted  one,  as  outlined  by 
Matousek  [35].  First  scale  the  weights  such  that 
w{X)  =  n.  Then  take  [w{x)  +  Ij  copies  of  each 
element  x  &  X.  Note  that  the  multiset  X'  thus 
obtained  contains  all  the  elements  of  X  and  has  a 
cardinal  of  at  most  2n.  Take  then  an  s-net  for  the 
set  X':  it  is  also  an  £-net  for  the  original  set  X 
with  weights  w. 


3.3  Time  analysis 

For  a  given  d ,  it  is  clear  that  there  can  be  at  most 
4c'log(n/c')  iterations,  each  of  which  takes  time 
O  (Ta  (size ( A,  7^) ,  size (u) ) ,  c' )  +  Tb  (size ( A,  7^) ,  c' ) ) . 
But  the  total  weight  can  never  exceed  n^/c^,  which 
proves  that  size{w)  =  0(n log(n/c')).  When  d  in¬ 
creases  geometrically,  by  our  choice  of  polynomial 
functions  for  and  Tg,  the  sum  of  all  running 
times  is  dominated  by  their  last  term,  for  which 
c  <  d  <  2c.  Finally,  a  standard  encoding  of 
(A,  A)  gives  that  size(A,7^)  =  0{nm),  conclud¬ 
ing  the  proof  of  Theorem  3.2. 

Note  that  in  real  RAM  model,  the  size  of  the 
weights  is  irrelevant,  and  one  simply  accounts  for 
the  number  of  arithmetic  operations  and  not  their 
complexity.  As  argued  in  Section  6,  the  weights 
almost  never  go  past  a  few  digits  in  practice,  so 
such  an  assumption  is  not  unrealistic.  In  any 
case,  since  our  weights  are  always  multiples  of 
2,  we  can  encode  each  weight  using  a  register  of 
0(log  n)  bits  and  we  can  use  such  registers  to  per¬ 
form  any  weight  arithmetic  in  0(clog(n/c))  time 
(by  Lemma  3.4). 


4  Lower  bounds  for  other  meth¬ 
ods 

In  this  section  we  give  some  specific  cases  where 
our  method  improves  the  previous  methods.  We 
begin  with  the  greedy  method. 


4.1  The  greedy  method 

The  following  example  has  been  communicated  to 
us  by  Jifi  Matousek  (private  communication).  Let 
S  =  {pi^j  :  i  =  1,2,  j  =  1 .. .  2P+1  -  1}  be  a  set 
of  n  =  2P+^  -  2  distinct  points,  and  let  its  valid 
subsets  be  Sj  =  {pi^k  ■  i  =  1,2,  k  =  2X  . .  2-^+^  — 
1},  for  i  e  {1,2},  j  6  (0. .  .p},  as  well  as  the  two 
particular  sets  Tj  =  {pij  :  j  =  1 .. .  2^^^  —  1}  (see 
Figure  1).  The  greedy  method  will  pick  the  cover 
which  is  of  size  p  =  log(n  -t-  2)  -  2.  The 
optimal  cover  is  (Ti)j=i,2  and  is  of  size  2.  The  dual 
VC-dimension  of  our  example  is  2,  as  every  point 
is  covered  by  exactly  2  subsets,  and  our  algorithm 
(in  a  dual  setting  to  obtain  a  set  cover)  will  pick 
the  optimal  cover  for  a  value  of  c  =  2. 
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Figure  1:  A  greedy  method’s  worst  case  that  has 
finite  VC-dimension. 


4.2  Hochbaum’s  method 

In  Hochbaum’s  method,  one  denotes  the  set  sys¬ 
tem  of  m  sets  over  n  elements  as  a  0-1  {n,m)- 
matrix  A  each  column  Aj  of  which  is  the  inci¬ 
dence  vector  of  one  of  the  sets.  The  algorithm 
then  solves  the  linear  program  maxe^.y  subject 
to  y^.A  <  em,y  >  (0,  ...,0),  where  is  a  k- 
vector  whose  components  are  all  1.  Let  J  denote 
the  sets  of  tight  constraints  at  the  optimum  y* 
(in  mathematical  notation,  j  G  J  if  and  only  if 
{y*)'^.Aj  =  1).  Then  J  is  a  set  cover  within  / 
times  the  optimum,  where  /  is  the  maximum  row 
sum  of  A. 

Take  k  to  be  any  constant.  Let  us  put  n  =  (™) 
and  take  for  A  the  (n,  m)-matrix  whose  rows  are 
made  up  of  all  possible  vectors  with  m-fe  I’s  and  k 
O’s  (in  no  particular  order).  The  corresponding  set 
system  has  n  elements  and  m  sets.  By  symmetry. 
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the  method  will  pick  all  the  sets  in  the  cover,  which 
gives  a  cover  of  size  m,  guaranteed  within  m  —  koS 
the  optimum.  Any  k  +  I  elements  will  constitute 
an  optimal  cover. 

This  example  has  both  dual  VC-exponent  and 
dual  VC-dimension  k,  so  our  algorithm  will  return 
a  cover  of  size  0{k‘^  log  k). 

Remark.  Interestingly  enough,  the  bad  example 
for  the  greedy  (resp.  Hochbaum’s)  method  can  be 
solved  efficiently  with  our  as  well  as  Hochbaum’s 
(resp.  the  greedy)  method.  We  don’t  know  of  an 
example  for  which  our  method  would  outperform 
both  of  them. 

5  Applications 

In  this  section,  we  cover  mainly  computational  ge¬ 
ometry  applications,  therefore  it  is  customary  to 
assume  the  real  RAM  model  where  all  the  integers 
and  their  elementary  arithmetic  operations  have 
unit  complexity. 

5.1  Separating  polyhedra 

Let  Q  C  P  be  two  convex  polytopes  in  IR*^.  The 
problem  of  finding  a  separator  polytope  between 
P  and  Q  with  as  few  facets  as  possible  is  believed 
to  be  AP-hard,  even  in  the  three-dimensional 
case  [21],  but  one  can  find  good  approximations 
for  it.  In  [41]  Mitchell  and  Suri  cast  the  problem 
as  a  hitting  set  problem.  Let  K  =  P(Q)  denote 
the  set  of  supporting  hyperplanes  of  Q.  Let  the 
cover  set  system  be  (P,  {Pp  :  p  G  dP}),  where,  for 
any  point  p,  Tip  consists  of  those  hyperplanes  in 
P  for  which  p  and  Q  lie  on  opposite  sides.  (For 
definiteness,  we  agree  that  if  p  is  on  h,  then  h 
is  not  in  Tip.)  They  show  that  a  hitting  set  for 
the  cover  set  system  gives  a  subset  of  facets  of  Q 
whose  bounding  halfspaces  define  a  polytope  be¬ 
tween  Q  and  P.  A  polytope  Q'  between  Q  and 
P  such  that  each  facet  of  Q'  contains  a  facet  of 
Q  is  called  canonical.  The  smallest  (with  respect 
to  the  number  of  facets)  canonical  polytope  con¬ 
tained  in  P  can  be  obtained  from  a  minimal  hit¬ 
ting  set  of  this  set  system  and  is  within  d  times 
the  optimal  separating  polyhedron  (in  number  of 
faces).  Therefore  the  greedy  strategy  returns  a 
polyhedron  within  0{d?  log  |Q|)  of  the  optimal.  In 
a  recent  twist,  Clarkson  [19]  applies  ideas  he  used 


for  small-dimensional  linear  programming  to  give 
a  polynomial-time  randomized  method  that  pro¬ 
duces  an  approximation  within  0{d^\og{dc))  of 
the  optimal  c.  In  fact,  the  algorithm  of  this  paper 
is  a  deterministic  analogue  of  Clarkson’s  method  in 
our  more  general  framework^.  Interestingly,  how¬ 
ever,  for  the  important  case  of  d  =  3,  we  are  able 
to  improve  on  Clarkson’s  result  to  achieve  an  ap¬ 
proximation  factor  that  is  0(1). 

But  let  us  first  describe  our  method  for  gen¬ 
eral  d  >  4.  We  need  to  exhibit  an  appropriate  net 
finder  and  a  verifier.  Note  that  since  P  contains 
Q,  0  is  not  in  the  cover  set  system.  The  basic 
observation  is  that  the  cover  set  system  is  a  sub¬ 
system  of  the  halfspace  set  system,  and  therefore 
has  VC-dimension  and  VC-exponent  d  (since  d  +  l 
hyperplanes  define  at  most  2^^+^  -  1  regions  of  the 
space,  and  the  complexity  of  an  arrangement  of  m 
halfspaces  is  0[m‘^)).  Therefore  our  hope  is  to  find 
an  s-net  finder,  where  s{x)  =  0{dx\og{dx)). 

To  find  a  (l/r)-net  for  the  weighted  cover 
set  system,  we  simply  use  the  algorithm  of 
Bronnimann  et  al.  [9],  which  computes  a  weighted 
(l/r)-net  in  0(nr'^  log‘^(dr))  time,  using  the  reduc¬ 
tion  of  Section  3.2  to  take  care  of  the  weighted 
case.  This  provides  a  (l/r)-net  for  the  more  gen¬ 
eral  set  system  consisting  of  a// halfspaces,  but  this 
is  a  fortiori  a  (l/r)-net  for  the  cover  set  system, 
and  the  bound  on  its  size  is  still  0(dr  log(dr)). 

As  for  the  verifier,  let  denote  the  canonical 
polytope  with  k  faces  associated  with  the  hyper¬ 
planes  of  some  Y  C  Ti  of  size  k.  In  particular, 
=  Q-  All  we  are  asking  for  is  whether 
is  entirely  contained  in  P,  or  else  to  give  a  point 
of  dP  which  is  in  Y^.  We  can  simply  compute 
the  intersection  of  all  the  halfspaces  defined  by  the 
planes  Y  U  Ti{P)  and  test  whether  there  is  a  facet 
that  belongs  to  P.  If  so,  we  can  return  Tip  where 
p  is,  e.g.,  the  centroid  of  this  face.  Otherwise,  Y^ 
is  certainly  contained  in  P. 

We  can  find  a  (l/r)-net  in  O  log'^(dr)|Q| j 

time  and  verify  k  halfspaces  in  0{k  -f  \P\)^^^ 
time  (d  >  4)  or  in  0{{k  +  |P|)  log(/c  -f  |P|))  time 
(d  =  2,3).  Plugging  those  bounds  into  our  main 
theorem,  we  get  that 

Theorem  5.1  Let  Q  P  P  be  two  convex  nested 
polytopes  in  IR"^  (d  >  4)  with  a  total  of  n  facets. 

^Clarkson’s  exposition  is  restricted  to  polytopal 
problems. 
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It  is  possible  to  find  a  separator  of  size  within 
0{d^logc)  of  the  optimum  c,  in  a  deterministic 
time  0/  O  (^n  1-2 J  +  c‘^nlog'^(dc))  clog (n/c),  which 

IS  always  O  log'^  nj . 

Note  that  the  case  where  c  =  is  of  lit¬ 

tle  interest,  because,  in  this  instance,  our  algo¬ 
rithm  offers  no  better  a  performance  ratio  than 
the  greedy  method  (at  least  asymptotically).  Thus 
we  could  make  the  algorithm  faster  (for  the  same 
guaranteed  ratio)  by  switching  to  the  greedy 
method  when  c  becomes  too  large. 

But  in  three  dimensions,  we  can  actually  do 
much  better.  In  particular,  we  recall  from  [38,  37] 
that  the  three  dimensional  halfspace  set  system 
admits  e-nets  of  size  0(l/e),  and  as  indicated  in 
Section  3.2,  their  algorithm  can  be  extended  to 
the  weighted  case  as  well.  The  cost  of  the  com- 
putaition  of  a  (l/c)-net  of  size  0(c)  is  0(nc),  if 
c  <  n“  [37],  and  the  cost  of  verifying  is  0(n  log  n) 
as  argued  above.  Therefore  we  obtain  the  follow¬ 
ing  strengthening  of  our  previous  theorem: 

Theorem  5.2  Let  Q  C  P  be  two  nested  polyhedra 
in  with  a  total  of  n  facets,  one  of  them  being 
convex.  It  is  possible  to  deterministically  find  a 
separator  of  size  within  0(1)  from  the  size  c  of 
an  optimum  separator  in  0{nc{c  +  logn)  log(n/c)) 
time,  if  c<n°‘  for  some  a  >  0. 

In  particular,  it  should  be  noted  that  the  run¬ 
ning  time  is  always  0(ni+2a  logn),  which  improves 
on  the  O(n^)  time  bound  of  Mitchell  and  Suri’s 
method  [41]  for  the  general  case.  However,  for 
large  c  >  n“,  we  have  to  switch  to  a  O(n^)  method 
to  find  the  net  [38],  yielding  0(cnMog(n/c))  time. 
Note  that  this  is  still  O(n^),  whereas  the  time 
consumed  by  the  greedy  method  is  strictly  0(n'^) 
(though  Mitchell  and  Suri  reduce  it  to  O(n^)  when 
both  polyhedra  are  convex.) 

5.2  Decision  Trees 

In  [1],  Arkin  et  al.  describe  how  to  construct 
a  point-probe  decision  tree  for  a  set  of  non¬ 
degenerate  polygons  in  the  plane  whose  height  is 
s  -  1  -t-  [log(A:/(s  -  1))1,  when  given  a  hitting 
set  of  size  s.  For  some  applications,  the  result 
serves  to  decide  the  position  of  a  polygon  in  a 
scene  [1].  While  their  approach  is  more  general. 


the  next  result  shows  that  we  can  get  better  per¬ 
formance  ratio  in  the  height  of  a  decision  tree  for 
non-degenerate  (i.e.,  general  position)  inputs.  We 
define  the  point  set  system  of  a  family  S  of  plane 
objects  by  all  subsets  Sp  of  S  for  all  points  p,  where 
Sp  consists  of  all  objects  in  S  containing  p. 

Lemma  5.3  Let  P  be  a  polygon  in  the  plane,  S  be 
a  family  of  congruent  copies  of  P,  and  let  {S,IZ) 
be  the  point  set  system  of  S.  Then  {3,11)  has  VC- 
exponent  at  most  2  (the  constant  2  cannot  be  im¬ 
proved  in  general),  and  admits  a  witnessed  subsys¬ 
tem  oracle  of  degree  2.  The  exponent  is  still  2  if  P 
convex. 

Proof:  Clearly,  the  number  of  sets  of  (5,7^)  is 
at  most  the  number  of  cells  is  the  arrangement 
of  S.  If  all  m  polygons  in  S  have  k  vertices  each, 
their  arrangement  has  complexity  0(/c^m^),  which 
corresponds  to  the  complexity  of  the  arrangement 
of  their  supporting  lines.  The  complexity  drops 
to  0{km^)  if  the  polygons  are  convex,  which  pro¬ 
vides  somewhat  better  constants  for  the  shatter 
function.  However,  it  is  easy  to  come  up  with  ex¬ 
amples  for  which  those  bounds  are  tight  and  also 
reflect  the  number  of  different  subsets  in  the  point 
set  cover.  This  also  provides  the  construction  of 
the  witnessed  oracle:  The  oracle  constructs  the 
arrangement  and  simply  picks  a  point  in  each  of 
the  cells.  If  asked  to  provide  a  witness,  the  point  is 
tested  against  all  the  polygons,  in  0{m  log  k)  time. 
(Here,  we  treat  k  as,  a.  constant,  and  we  are  only 
interested  the  number  of  elements  of  S.)  □ 


Remark.  Clearly,  the  VC-dimension  depends  on 
the  number  of  sides  of  P.  If  5  consists  of  congruent 
copies  of  a  single  convex  set,  even  a  convex  poly¬ 
gon  with  infinitely  many  sides,  then  its  point  set 
system  (5,77.)  need  not  have  finite  VC-exponent 
(resp.  VC-dimension). 

Our  algorithm,  combined  with  the  above  ob¬ 
servation,  and  the  result  of  Arkin  et  al.  [1]  gives 
us  a  decision  tree  of  smaller  asymptotic  depth  than 
the  greedy  method  for  the  non-degenerate  arrange¬ 
ment  of  Arkin  et  al.^  Moreover,  our  method  can 
also  be  used  to  derive  point-probe  decision  trees 

®For  degenerate  arrangements  they  design  a  new  greedy 
strategy,  which  seems  not  to  be  describable  as  a  set  cover 
strategy. 
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of  improved  depth  for  any  set  of  general-position 
/c-gons  in  the  plane,  for  constant  k,  since  this  point 
set  system  also  has  bounded  VC-dimension  (as  the 
proof  above  shows  as  well). 

5.3  Disc  cover 

Let  5  be  a  set  of  n  points  in  the  plane  and  let  T> 
be  a  family  of  discs.  The  disc  cover  problem  is  to 
find  a  minimum  number  of  discs  in  V  that  cover 
the  points  in  S  [26].  This  problem  is  motivated 
by  VLSI  design  criteria  as  well  as  viewing  prob¬ 
lems  in  astronomy.  Matousek  et  al.  [38]  show  that 
this  set  system  admits  0(r)  sized  (l/r)-net  and, 
as  indicated  in  Section  3.2,  one  can  extend  their 
algorithm  to  weighted  point  sets.  The  resulting  al¬ 
gorithm  runs  in  0{rn\ogn)  time,  and  implies  the 
following: 

Theorem  5.4  Let  S  be  a  set  of  points  in  the 
plane,  and  T>  be  a  set  of  discs  whose  union  con¬ 
tains  S.  It  is  possible  to  find  a  disc  cover  of  S 
from  V  (a  subset  of  V  whose  union  still  contains 
P)  whose  size  is  no  more  than  0(1)  times  the  opti¬ 
mal  size  c  of  such  a  cover  in  0(c^nlognlog(n/c)) 
time. 

Note  that  our  method  never  takes  more  than 
O(n^logn)  time.  This  bound  contrasts  the  bound 
of  Hochbaum  and  Maas  [26]  for  finding  a  constant- 
factor  approximation  to  a  disc  cover,  as  their 
method  requires  all  the  discs  in  T>  to  have  the  same 
radius  and,  even  when  all  the  parameters  in  their 
method  are  optimized  for  the  running  time,  their 
method  takes  O(n^)  time. 

5.4  Learning  a  Union  of  Halfspaces  in 
Fixed  Dimension 

Let  H  he  a  set  of  s  halfspaces  in  IR^^,  and  write 

for  their  union,  and  H~  for  its  complement. 
Let  D  be  an  (unknown)  probabilistic  distribution 
on  A  learning  algorithm  A  for  iL+,  given  m 
examples  drawn  from  D,  outputs  a  hypothesis  G 
such  that,  with  confidence  1  -  d  (on  the  examples 
fed  to  the  algorithm),  G  verifies  D{H^  A  G+)  <  e. 
(A  denotes  the  symmetric  difference.)  An  example 
is  labeled  positive  if  it  falls  in  iL+,  negative  other¬ 
wise.  This  ensures  that,  for  subsequent  examples, 
the  label  as  computed  with  G  will  be  correct  (ac¬ 
cording  to  H)  with  probability  at  least  1  —  e. 


It  has  been  proved  by  Blum  and  Rivest  [6]  that 
this  problem  admits  no  proper  learning  algorithm 
(for  which  the  hypothesis  also  consists  of  s  halfs¬ 
paces)  whose  running  time  is  polynomial  in  d,  s, 
even  if  s  =  2  (unless  P  =  NP).  How¬ 
ever,  Baum  [3]  argues  that  this  only  shows  that  we 
should  look  for  hypotheses  with  more  halfspaces. 
Indeed,  Blumer  et  al.  [7]  and  Baum  [3]  have  pro¬ 
posed  algorithms  that  output  a  hypothesis  of  size 
0(s log m).  These  algorithms  are  polynomial  in 
fixed  dimension  (however,  the  dependence  in  the 
dimension  is  exponential).  Unfortunately,  with  the 
purpose  of  training  a  neural  net  in  mind,  we  see 
that  each  halfspace  of  the  hypothesis  corresponds 
to  a  gate  of  the  neural  net,  and  therefore  the  size 
of  the  neural  net  increases  with  precision. 

The  algorithm  of  Blumer  et  al.  proceeds  by 
first  labeling  the  examples,  then  forming  a  set  sys¬ 
tem  of  the  halfspaces  containing  only  positive  ex¬ 
amples,  and  finally  returning  a  small  set  of  halfs¬ 
paces  covering  all  the  positive  examples.  For  this 
last  step  we  can  use  our  method:  The  hypothesis  is 
guaranteed  to  have  size  0(ds  log(ds)),  independent 
of  the  number  of  examples.  Moreover,  we  can  im¬ 
prove  on  the  results  in  two  and  three  dimensions. 
This  proves 

Theorem  5.5  For  any  fixed  d  >  1,  given  s  halfs¬ 
paces  H  in  IR'^,  and  any  distribution  on  points  in 
IR^^,  there  exists  a  neural  net  of  size  0{dslog{ds)) 
which  can  be  trained,  in  time  polynomial  in  s,  e,  S 
and  with  confidence  1  —  d,  to  recognize  whether  a 
point  belongs  to  the  union  of  H  with  a  probability 
of  error  less  than  e.  The  size  of  the  neural  net  can 
be  brought  down  to  0(s)  if  d  =  2,3. 

The  result,  of  course,  is  also  valid  for  learning 
the  union  of  concepts  taken  from  a  concept  class 
that  has  finite  VC-dimension. 

6  Experimental  Results 

Our  algorithm  has  been  implemented  and  the  re¬ 
sults  will  be  described  in  [8].  There  are  many  pos¬ 
sible  choices  for  implementation,  as  we  have  many 
ways  to  compute  an  e-net  in  practice,  and  they 
result  in  different  performances. 

The  first  consists  in  choosing  a  random  sample 
of  size  0(dc log  c).  It  turns  out  that  this  performs 
badly  against  the  greedy  method,  as  the  values  of  n 
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we  consider  are  never  big  enough  to  justify  the  use 
of  our  method  with  the  random  sampling  against 
the  greedy  method. 

The  second  possible  choice  is  to  use  a  com¬ 
puted  e-net,  with  the  algorithm  of  Chazelle  and 
Matousek  [14],  and  this  seems  to  yield  results  com¬ 
parable  to  the  greedy  method. 

The  third  method  of  choice  is  to  compute  the 
net  using  the  greedy  method  [13].  Even  though 
this  does  not  guarantee  the  same  performance  as 
the  second  choice,  it  performs  well  in  practice,  and 
never  returns  anything  bigger  than  that  returned 
by  the  greedy  set  cover  method.  Moreover,  the 
lower  bound  as  returned  by  our  method  is  sub¬ 
stantially  higher  than  that  returned  by  the  greedy 
method.  Therefore,  if  it  doesn’t  produce  better 
hitting  sets,  at  least  it  is  able  to  give  better  lower 
bounds  in  practice. 

For  our  three  choices,  the  fear  that  weights 
might  increase  exponentially  and  therefore  require 
a  multi-precision  treatment  is  completely  dispelled 
by  empirical  observation.  In  the  largest  example 
we  tried  (of  size  about  1000^),  after  some  700  iter¬ 
ations,  the  maximum  weight  was  a  bare  16384 — as 
opposed  to  a  possible  2^°°.  This  strongly  suggests 
that  the  normal  precision  (of  32  bits)  will  suffice 
in  all  reasonable  instances  of  the  problem.  This  is 
due,  of  course,  to  the  fact  that  only  small  weights 
tend  to  be  doubled  through  the  algorithm. 

One  last  issue  concerns  the  generation  of  “ran¬ 
dom”  set  systems  of  finite  VC-dimension.  As  this 
is  a  hard  problem,  we  simply  generated  random  set 
systems  in  such  a  way  that  the  generation  process 
itself  guaranteed  the  finite  (parameterizable)  VC- 
dimension.  We  also  tested  the  method  on  more 
natural  set  systems  such  as  the  cover  set  systems 
of  two  polyhedra. 

7  Conclusions 

We  have  shown  an  approximation  algorithm  for 
the  hitting  set  and  set  cover  problems,  and  proved 
that  it  performs  well  if  the  VC-dimension  of  the 
set  system  is  bounded.  Furthermore,  our  algo¬ 
rithm  “defeats”  known  complexity-theoretic  lower 
bounds  [4,  32]  (which  make  no  assumptions  about 
the  VC-exponent),  by  being  adaptive:  the  com¬ 
petitive  ratio  depends  on  the  size  of  the  optimal 
solution.  We  have  exhibited  a  variety  of  compu¬ 
tational  geometry  problems  that  reduce  to  or  use 


hitting  sets,  and  for  which  the  set  systems  have 
bounded  VC-dimension. 

Our  algorithm  has  been  implemented,  and  the 
practical  results  are  encouraging. 

The  main  open  question  pertaining  to  this  re¬ 
search  is  whether  our  algorithm  can  be  made  to 
run  in  parallel  in  polylogarithmic  time  and  polyno¬ 
mial  number  of  processors.  It  seems  that  the  main 
problem  lies  in  the  iteration  process:  if  we  are  to 
double  the  weights  of  many  sets  at  once,  we  are 
unable  to  enforce  that  the  weights  of  many  points 
in  an  optimal  hitting  set  double  at  each  stage. 

An  e-net  is  a  hitting  set  for  the  heavy  subsets. 
The  algorithm  can  be  thus  adapted  to  find  an  al¬ 
most  minimum  sized  e-net.  A  related  problem  that 
would  be  of  great  practical  interest  is  to  find  a  sim¬ 
ilar  algorithm  for  e-approximations.  It  is  not  even 
known  if  finding  a  minimum  sized  approximation 
is  AP-complete,  much  less  if  there  is  any  algorithm 
that  returns  an  almost  optimal  e-approximation. 

The  greedy  method  is  shown  to  attain  an 
0(log  n)  approximation  ratio  in  the  polytope  sepa¬ 
ration,  but  it  might  be  possible  to  use  the  geometry 
to  show  that  it  in  fact  yields  a  constant  ratio.  Such 
a  claim  has  neither  been  proved  nor  disproved. 

Finally,  in  the  decision  tree  problem,  we  were 
unable  to  find  a  strategy  that  would  work  for  de¬ 
generate  arrangements.  Is  such  an  adaptation  pos¬ 
sible? 
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An  Efficient  Algorithm  for  the  Euclidean  Two-Center  Problem 
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Abstract 


We  present  a  new  algorithm  for  the  two-center 
problem:  “Given  a  set  S  of  n  points  in  the  real 
plane,  find  two  closed  discs  whose  union  contains 
all  of  the  points  and  the  radius  of  the  larger  disc 
is  minimized.”  An  almost  quadratic  O(n^logn) 
solution  is  given.  The  previously  best  known  al¬ 
gorithms  for  the  two-center  problem  have  time 
complexity  O(n^log^n).  The  solution  is  based 
on  a  new  geometric  characterization  of  the  op¬ 
timal  discs  and  on  a  searching  scheme  with  so- 
called  lazy  evaluation.  The  algorithm  is  sim¬ 
ple  and  does  not  assume  general  position  of  the 
input  points.  The  importance  of  the  problem 
is  known  in  various  practical  applications  inclu¬ 
ding  transportation,  station  placement,  and  fa¬ 
cility  location. 
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1  Introduction 

We  study  the  following  geometric  optimization 
problem  (the  “two-center  problem”):  “Given  a 
set  S  of  n  points  in  the  real  plane,  find  two  closed 
discs  whose  union  contains  all  of  the  points  and 
the  radius  of  the  larger  disc  is  minimized.”  The 
distance  is  measured  with  the  Euclidean  metric. 

This  is  an  instance  of  a  well-known  fc-center 
problem,  where  the  objective  is  to  find  k  clo¬ 
sed  discs  minimizing  the  maximum  radius.  The 
problem  stems  from  practice.  For  example,  we 
want  to  place  k  emergency  units  so  the  worst- 
case  response  time  to  each  of  n  given  sites  will  be 
minimized.  The  problem,  specifically  for  k  =  1, 
is  known  under  various  names:  minimum  span¬ 
ning  circle,  minimax  location  problem,  minimax 
detection  station  problem.  Euclidean  messenger 
boy  problem  (see  [FI,  F2,  HM]).  These  names 
tell  more  about  potential  applications. 

In  addition  to  having  a  practical  flavor  the 
problem  is  known  for  its  stimulating  impact  on 
inventing  new  algorithmic  techniques.  Since  the 
problem  was  posed  by  Sylvester  in  1857  [S]  se¬ 
veral  algorithms  for  fc  =  1  have  been  presen¬ 
ted.  Recent  computationally-efficient  solutions 
include  an  0{nlogn)  time  algorithm  based  on 
the  farthest  Voronoi  diagram  (see  Shamos  and 
Hoey  [SH]).  An  optimal  0{n)  algorithm  based 
on  a  powerful,  and  beautiful,  prune-and-search 
strategy  was  given  by  Megiddo  [M].  The  pro¬ 
blem  becomes  A^R-complete  when  the  parame- 
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ter  A;  is  a  part  of  the  input.  Therefore  it  is  in¬ 
teresting,  and  again  important  in  view  of  prac¬ 
tical  applications,  to  look  at  fixed  values  of  k. 
Specifically,  for  k  =  2  the  problem  was  studied 
by  Drezner  [D]  in  the  context  of  efficient  trans¬ 
portation.  He  gave  an  0{n^)  time  complexity 
algorithm.  Agarwal  and  Shamir  [AS]  provided 
the  first  algorithm  with  the  cost  close  to  qua¬ 
dratic.  Specifically,  their  methods  based  on  in¬ 
teresting  modifications  and  additions  to  the  Me- 
giddo’s  parametric  search  technique  resulted  in 
a  O(n^log^n)  algorithm.  An  essential  part  of 
the  algorithm  is  a  decision  procedure  by  Hersh¬ 
berger  and  Suri  [HS].  This  procedure,  given  a 
real  number  r  >  0,  determines  in  0{n\ogn) 
time  if  there  are  two  closed  discs  of  radius  r 
whose  union  contains  the  given  set  of  points. 
Moreover,  it  tells  if  the  radius  of  two  minimal  en¬ 
closing  discs  is  larger  or  smaller  than  r.  Another 
algorithm  with  the  same  0(n^  log^  n)  time  com¬ 
plexity  was  presented  by  Katz  and  Sharir  [KS]  at 
last  year’s  Symposium  on  Computational  Geo¬ 
metry.  Their  paper  brought  out  new  ideas,  ba¬ 
sed  on  expander  graphs,  for  this  and  other  geo¬ 
metric  optimization  problems.  The  complexity 
of  the  resulting  algorithm  for  the  two-center  pro¬ 
blem  was  the  same  0(n^  log^  n),  however. 

In  this  paper  we  will  present  a  new  algorithm 
for  the  two-center  problem.  The  algorithm  has 
0(n^  log  n)  time  complexity  and  is  based  on  new 
observations  regarding  the  geometry  of  the  pro¬ 
blem.  They  allow  for  an  efficient  search  in  the 
set  of  discs  that  are  candidates  for  the  opti¬ 
mal  solution.  The  algorithm  provides  more  in¬ 
sight  into  the  geometry  of  the  problem  than 
the  previous  ones.  (In  fact,  a  lack  of  a  di¬ 
rect  geometric  flavor  is  a  known  disadvantage 
of  parametric-search  based  algorithms  for  geo¬ 
metric  problems.)  Due  to  this  insight  the  resul¬ 
ting  algorithm  is  simple,  uniform,  and  works  for 
arbitrary  sets  of  points. 

2  Geometric  preliminaries 

Our  algorithm  is  based  on  a  new  characteriza¬ 
tion  of  the  optimal  discs.  To  proceed  furt¬ 
her  we  need  some  definitions.  Assume  that 


the  plane  is  endowed  with  a  standard  ortho¬ 
gonal  coordinate  system.  Consider  a  set  of  12 
unit  length  vectors  that  form  with  the  axis  OX 
an  angle  i  =  0...11.  Specifically,  Ui  = 
[cos((?’7r)/6), sin((?'7r)/6)].  With  each  point  p  in 
S  we  associate  the  set  of  12  cones  determined 
by  the  pairs  of  orthogonal  vectors  from  this  set 
of  vectors.  Each  cone  has  its  apex  at  p.  Intui¬ 
tively,  we  can  think  about  this  cone  as  the  first 
quadrant  (with  the  bounding  half-  lines)  of  the 
coordinate  system  centered  at  p  and  determined 
by  vectors  [0, 1],  [1,0].  All  the  other  cones  asso¬ 
ciated  with  p  can  be  obtained  by  consecutive 
rotations  of  this  quadrant  by  tt/G  (30°)  about  p. 
Clearly,  the  number  of  cones  is  0{n). 

Consider  now  the  set  S  oi  n  points.  We  say 
that  a  pair  [d* ,d*)  of  two  closed  discs  is  optimal 
for  S  if  all  the  points  in  S  are  contained  in  the 
union  d*  U  d*,  the  larger  of  these  two  discs,  say 
d*,  has  radius  r*  minimal  over  all  pairs  of  discs 
whose  union  contains  S,  and  d*  has  the  smallest 
possible  radius  for  r*.  Disc  d*  will  be  called  an 
optimal  disc,  li  Si  =  {p  G  S  :  p  E:  d*}  and  S2  = 
S\Si  then  d*  is  the  minimal  disc  containing  S2. 
d*  will  be  called  a  complementary  disc.  Clearly, 
an  optimal  pair  of  discs  is  a  solution  to  the  two- 
center  problem. 

There  are  several  simple  and  well-known  pro¬ 
perties  of  the  two  optimal  circles.  They  will  be 
u.seful  through  the  rest  of  the  paper  and  we  will 
list  them  below. 

Each  disc  in  the  optimal  pair  is  determined  eit¬ 
her  by  a  pair  of  points  in  S  or  by  some  three  of 
them.  That  is,  the  radius  of  a  disc  is  determined 
by  half  the  distance  between  some  two  points,  or 
is  equal  to  the  radius  of  the  circumscribing  circle 
for  some  three  points  in  S.  Note  that,  since  we 
do  not  assume  general  position,  the  points  deter¬ 
mining  the  optimal  discs  are  not  unique.  Howe¬ 
ver,  among  the  points  determining  the  optimal 
discs  there  are  points  with  special  properties. 
We  will  formulate  these  for  the  case  when  both 
discs  are  defined  by  three  points.  The  same  will 
hold  true  for  discs  defined  by  two  points. 

There  exist  determining  points  Ai,Bi,Ci  for 
d*  that  do  not  belong  to  the  interior  of  J*,  and 
there  are  A2,  i?2,  C2  that  are  not  in  the  interior 
oid*. 
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There  exist  points  Ai,i?i,Ci  (and  A2,  B21C21 
respectively)  such  that  all  angles  in  the  triangle 
/AAiBiCi  are  acute.  In  other  words,  there  are 
such  Ai,Bi,Ci  that  the  center  of  their  circum¬ 
scribing  circle  is  in  the  interior  of  AAiBiCi- 

The  above  facts  are  elementary  and  follow 
from  the  observation  that  otherwise  at  least  one 
of  the  discs,  contrary  to  their  optimality,  could 
be  made  smaller.  Hereafter,  by  the  determining 
points  for  d*  and  d*  we  will  mean  points  with 
the  above  properties.  Note  that  one  (or  two)  of 
the  points  determining  d*  (the  optimal  disc)  can 
lie  on  the  boundary  of  the  complementary  disc 
d*  (but  that  this  point  does  not  determine  d*  in 
the  above  sense). 

From  the  above  facts  it  is  easy  to  derive  that 
the  radius  of  d*  is  not  bigger  than  the  radius 
of  the  optimal  spanning  circle  (the  solution  to 
the  1-center  problem)  for  a  given  set.  To  this 
end,  consider  the  optimal  disc  containing  S  and 
split  S  into  two  sets  using  a  line  containing  the 
center  of  this  disc  and  not  containing  any  points 
in  S.  Each  of  these  subsets  can  be  enclosed  in  a 
smaller  disc. 

As  opposed  to  the  1-center  problem,  which  al¬ 
ways  has  a  unique  solution,  the  2-center  problem 
can  admit  more  than  one  optimal  pair  of  discs. 
For  example,  the  set  of  vertices  of  a  square  have 
two  symmetric,  optimal  solutions. 

Additionally  to  the  above  facts  we  will  use  a 
non-trivial  result  due  to  Megiddo  [M]  that  pro¬ 
vides  a  linear  time  algorithm  to  construct  the 
minimal  enclosing  disc  containing  a  given  set  of 
points.  This  algorithm  will  be  useful  for  our 
purposes. 

3  Ordered  sequences  of  discs  for 
cones 

We  will  associate  with  each  cone  a  fixed  num¬ 
ber  of  sequences  of  discs.  Consider  a  cone  C  for 
a  point  p  from  S.  Without  loss  of  generality, 
we  may  assume  that  p  is  at  the  origin  of  the 
coordinate  system  and  the  cone  coincides  with 
the  first  quadrant  of  this  system.  Otherwise  we 
can  apply  an  affine  transformation.  Consider 
Sc  =  S  D  C  and  its  convex  hull  CH{Sc),  and 


also  consider  the  set  of  discs  that  pass  through  p, 
that  contain  CH{Sc),  and  whose  circles  contain 
at  least  one  point  in  Sc  beyond  p.  The  locus  of 
their  centers  is  the  boundary  7  of  the  intersec¬ 
tion  of  those  halfplanes  determined  by  the  bisec¬ 
tors  of  the  segments  joining  p  with  the  vertices 
of  CH{Sc)  that  do  not  contain  p.  This  inters¬ 
ection,  and  the  boundary,  can  be  found  with  a 
standard  0(nlog  n)  algorithm  (see  [E]).  Another 
way  to  see  this  is  to  view  7  as  the  envelope  of  the 
set  of  the  above  described  bisector  with  the  view 
point  at  (+00,  0).  If  the  order  of  the  segments 
joining  p  with  the  vertices  of  CH{Sc)  is  known 
(that  is,  the  order  of  the  slopes  of  bisectors  is 
known)  then  this  envelope  can  be  found  in  0{n) 
time  with  a  modification  of  Keil’s  algorithm  for 
the  envelope  of  lines  [K]  (see  also  [V]). 

7  is  monotone  with  respect  to  the  y-axis.  This 
follows  from  the  fact  that  all  bisectors  of  the  seg¬ 
ments  for  points  in  the  first  quadrant  are  mono¬ 
tonic  and  that  the  intersection  of  halfplanes  is  a 
convex  set. 

Now  consider  the  points  qi,. . .  ,qm  of  S  that 
are  in  the  second  and  fourth  quadrants  of  the 
coordinate  system  (the  quadrants  are  numbe¬ 
red  in  the  counterclockwise  orientation,  with  the 
cone  corresponding  to  the  first  quadrant).  Each 
point  q  in  these  quadrants  determines  the  mi¬ 
nimal  disc,  with  respect  to  the  radius,  whose 
circle  passes  through  p  and  q  and  which  con¬ 
tains  CH{Sc)-  It  defines  a  sequence  of  di¬ 
scs  d{qij,. . .  ,d{qi^)  and  the  corresponding  se¬ 
quence  of  radii  . . .  ,ri^.  We  will  show  how 
the  order  of  these  radii  can  be  found  efficiently. 
We  wish  to  do  so  without  explicitly  finding  the 
discs;  this  would  kill  our  hopes  for  an  efficient 
algorithm.  Before  showing  the  method,  let  us 
explain  at  this  point  what  our  goal  is.  This  or¬ 
dered  sequence  of  radii  will  allow  us  to  perform 
a  binary  search  provided  that  some  kind  of  order 
can  simultaneously  be  obtained  for  the  comple¬ 
mentary  discs  (the  disc  for  the  complement  of  S 
with  respect  to  the  points  contained  in  the  first 
disc.)  Consider  a  point  q  and  the  segment  pq. 
The  center  of  the  disc  with  the  circle  passing 
through  p  and  q  is  located  on  the  perpendicular 
bisector  Ipg  of  pq.  In  order  to  find  the  center  of 
the  smallest  such  circle  that  in  addition  contains 
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CH(Sc)  we  need  to  consider  two  cases  that  de¬ 
pend  on  how  the  middle  point  M  of  the  segment 
pq  is  located  with  respect  to  7.  Its  location  de¬ 
pends  on  whether  the  disc  that  we  consider  is 
determined  by  three  or  by  two  points.  If  M  is 
on  that  side  of  7  that  contains  p  then  the  cen¬ 
ter  is  on  the  intersection  of  the  bisector  with  7. 
This  intersection  can  be  located,  with  the  help 
of  binary  search,  in  O(logn)  time.  The  radius 
of  the  disc  is  equal  to  the  distance  of  this  inters¬ 
ection  to  p.  Otherwise,  if  M  is  on  the  opposite 
side  of  7  from  p  then  M  is  the  required  center. 
Again,  the  radius  is  equal  to  the  distance  from 
M  to  p.  Clearly,  M  can  be  located  with  respect 
to  7  in  0(log  72)  time.  We  put  the  centers  of  di¬ 
scs  based  on  triples  of  points  or  based  on  pairs 
of  points  in  different  sequences.  Sorting  the  di¬ 
stances  yields  the  increasing  order  of  the  discs. 
Concluding,  we  have: 

Lemma  3.1  The  inci^easing  oi'der  of  the  discs 
associated  with  the  cone  and  determined  by  the 
points  in  the  second  quadi'ant  can  be  found  in 
0{nlogn)  time. 

Similarly,  we  can  find  the  order  of  discs  determi¬ 
ned  by  the  points  in  the  fourth  quadrant.  Note 
that  these  centers  of  discs  that  are  on  7  do  not 
need  to  be  sorted  since  the  distance  of  points  on 
7  to  p  is  a  monotonic  (unimodal,  to  be  precise) 
function.  The  smallest  radius  corresponds  to  the 
minimal  enclosing  disc  do  for  CH{Sc)  and  it 
splits  7  into  two  monotonic  subsequences  corre¬ 
sponding  to  the  second  and  the  fourth  quadrants 
respectively.  We  will  consider  each  such  part  of 
the  sequence  separately.  Note  that  the  above 
construction  works  also  if,  instead  of  considering 
the  set  of  all  the  points  in  the  second  and  fourth 
quadrants,  we  consider  only  their  subsets. 

4  Monotone  sequences  of  radii 

A  cone  Cp  associated  with  p  determines  an  or¬ 
thogonal  coordinate  system  centered  at  p  with 
the  axes  corresponding  to  the  vectors  defining 
Cp.  The  fii’st  quadrant  quadranti  coincides 
with  the  cone.  Let  quadrant2,  quadrant^,  and 
quadrant4  be  the  remaining  quadrants  (in  coun¬ 
terclockwise  order).  We  will  look  at  the  points 


Qt  =  S  f]  quadrant^  (the  first  quadrant  contains 
the  positive  axis,  i.e.  is  closed,  the  negative  x- 
axis  belongs  to  quadraiit^,  and  the  negative  y- 
axis  belongs  to  quadrant 4). 

As  we  know,  Cp  together  with  Q2  determines 
a  sequence  of  discs  ordered  with  respect  to  non¬ 
decreasing  radii.  (Similarly,  another  sequence  is 
defined  by  Cp  with  Q4.)  Instead  of  constructing 
the  sequence  for  all  points  in  Q2  {Q4)  we  will 
construct  it  only  for  those  points  in  Q2  {Q4,  re¬ 
spectively)  that  are  not  in  disc  do  which  is  the 
minimum  enclosing  disc  for  Qi.  Ignoring  those 
points  will  not  affect  the  correctness  of  the  over¬ 
all  algorithm  for  the  2-center  problem.  As  we 
will  show  later,  points  in  the  second  and  fourth 
quadrants  will  be  examined  as  potential  candi¬ 
dates  for  the  point  that  determines,  together 
with  the  apex  of  the  cone  and  another  point 
in  CH{Sc)  an  optimal  circle.  It  follows  from 
the  definition  of  the  sequences  and  from  simple 
geometric  facts  that  points  that  are  inside  do  are 
not  feasible  candidates  since  they  could  not  form 
an  acute  triangle  with  the  cone  center  and  ano¬ 
ther  point  in  CH{Sc)  and  simultaneously  pass 
through  the  cone  center  and  contain  all  points 
in  CH{Sc).  Denote  the  obtained  sequence  by 
(di)o^=(do,  hi, ... ,  dkc)-,  the  corresponding  radii 
by  r(d,),  and  the  set  of  points  of  S  in  di  by  dfS). 
The  sequence  starts  with  disc  do,  the  minimum 
enclosing  set  for  Qi  that  can  be  found  in  0{n) 
time  with  Megiddo’s  algorithm  [M].  Others  of 
these  sets  are  not  explicitly  known;  they  can  be 
found  with  an  additional  0{n)  time  per  set,  but 
this  is  too  costly  an  operation  for  us  so  we  do  not 
intend  to  compute  all  of  them.  Each  di  determi¬ 
nes  its  complementary  disc  di  which  is  the  mini¬ 
mal  enclosing  disc  for  the  complement  di{S)  of 
di{S),  i.e.  for  S  \  dj(S').  Denote  by  f,-  the  radius 
of  di.  It  gives  us  a  sequence  of  pairs  of  numbers 
{(^ij^i)}-  Let  us  emphasize  that  the  first  ele¬ 
ments  of  the  pairs  are  known  from  the  algorithm 
constructing  the  sequence  (see  the  previous  sec¬ 
tion).  However,  it  requires  an  extra  effort  (e.g. 
running  Megiddo’s  algorithm),  which  we  want 
to  avoid  whenever  possible,  to  find  f,-.  Direct 
search  of  the  sequence  for  the  optimal  pair  of  di¬ 
scs  (passing  through  p)  is  not  effective  since  (fj) 
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is  not  monotonic  in  general.  This  stems  from 
the  fact  that,  in  general,  the  sequence  of  sets 
di{S)  is  not  monotone  with  respect  to  inclusion. 
Before  proceeding  further,  in  order  to  see  where 
the  difficulty  lies,  let  us  analyze  what  happens  to 
the  sets  di{S)  of  points  in  the  sequence  (growing 
with  respect  to  the  radius)  of  discs  associated 
with  the  cone  and  Q2-  To  focus  our  attention, 
let  us  consider  those  sequences  with  discs  whose 
circles  pass  through  at  least  one  other  point  of 
S  in  the  cone  beyond  p.  The  growing  discs  con¬ 
tain  more  and  more  points  (in  the  sense  of  set 
inclusion)  from  Q2  and  Q3.  This  follows  from 
the  observation  that  their  circles  must  intersect 
in  Qi  in  order  to  contain  ^i.  Therefore,  there  is 
no  intersection  in  the  third  quadrant;  this  also 
translates  into  the  fact  that  all  points  in  Q2  (and 
Q3)  that  are  in  the  smaller  disc  must  be  also  in 
the  larger  disc.  By  the  same  argument,  the  discs 
can  loose  points  from  Q4  (the  fourth  quadrant) 
as  they  grow.  Because  of  these  points  in  Q4, 
the  monotonicity  (w.r.t.  inclusion)  of  di{S),  and 
therefore  the  monotonicity  of  S  \di{S),  can  be 
lost.  We  will  soon  see  that  there  are  yet  some 
cones  that  give  us  control  over  the  events  occur¬ 
ring  in  the  fourth  quadrant.  We  now  have  the 
following  theorem,  formulated  here  for  the  case 
of  the  optimal  disc  defined  by  some  three  points. 
The  case  of  discs  defined  by  some  two  points  is 
similar  and  will  be  discussed  later. 

Theorem  4.1  Let  d*  be  an  optimal  disc  (the 
larger  of  the  two  discs  in  an  optimal  pair)  whose 
circle  passes  through  p.  There  exists  a  cone  Cp 
for  p  such  that  {S  fl  Cp)  C  d*{S)  and  one  of  the 
following  holds  true: 

1.  All  three  points  determining  d*  are  in  Cp 
(p  being  one  of  them) 

2.  Cp  contains  two  points  determining  d*  (p 
being  one  of  them),  and  no  points  of  S  that  are 
in  the  optimal  disc  are  in  Q4  (or,  symmetr'ically, 
in  Q2)-  That  is,  d*  f]  Q4  =  9- 

3.  Cp  contains  two  points  determining  d*  (p 
being  one  of  them),  and  the  first  disc  di  (not  do) 
contains  all  the  points  in  Q4  (or  in  Q2,  respec¬ 
tively)  that  are  in  d* .  That  is,  (d*(5')  fl  Q4)  C 
di{S)  (or,  {d*{S)  n  Q2)  C  di{S)),  respectively. 

Let  A,  B,C  be  points  that  determine 


d*.  Let  the  projection  of  A  on  the  line  joining 
the  centers  of  d*  and  d*  be  the  closest  to  the 
center  of  d*,  and  let  the  projection  of  C  be  the 
farthest  (with  ties  broken  arbitrarily).  We  will 
carry  out  the  proof  under  the  assumption  that 
A  is  “below”  the  line  joining  the  centers  of  the 
discs,  and  that  we  rotate  cones  in  the  clockwise 
direction.  Other  cases  are  symmetric.  We  have 
the  following  three  possibilities. 

1.  There  exists  a  cone  with  p  =  A  that  contains 
both  B  and  C  such  that  no  point  in  S’  \  d*(S') 
belongs  to  this  cone.  Then  the  minimal  disk 
enclosing  the  points  in  this  cone  (and  passing 
through  A),  that  is  do,  is  an  optimal  disk. 


2.  The  above  case  does  not  hold  true  and  one 
of  the  angles  BAC  or  ABC  is  smaller  than  7r/6 
(30°).  We  will  consider  both  cases, 
a)  BAC  <  7r/6. 


Take  the  cone  for  A  that  contains  B  and  such 
that  the  next  cone  does  not  contain  B.  This 
cone  contains  some  points  in  S'  \  d*{S).  Now 
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take  the  similarly  oriented  cone  Cb  for  B  and 
consider  the  cone  second  next  to  it  (i.e.  rotate 
the  cone  about  B  by  tt/S.)  We  will  show  that 
this  cone  has  the  desired  properties.  This  cone 
determines  four  quadrants,  as  described  earlier. 
First,  note  that  C  belongs  to  this  cone  (the  first 
quadrant)  since  the  angle  ABC  is  at  least  7r/3, 
and  the  angle  between  AB  and  one  of  the  arms 
of  Cb  is  not  smaller  than  7r/2  and  not  bigger 
than  27r/3.  Note  that  no  point  of  S  in  the  op¬ 
timal  disc  belongs  to  the  fourth  quadrant.  This 
follows  from  the  fact  that  angle  ABO,  O  being 
the  center  of  d*,  is  smaller  than  tt/G  and  there¬ 
fore  a  half  line  of  the  tangent  line  to  d*  at  B  is 
in  the  first  quadrant.  Also,  no  point  in  S\d*{S) 
is  in  the  first  quadrant  (the  cone)  since  it  lies 
completely  in  the  halfplane  determined  by  the 
perpendicular  to  the  segment  joining  the  cen¬ 
ters  of  d*,  and  d*  through  point  B.  Hence,  the 
cone  has  the  required  properties, 
b)  ABC  <  tt/G. 


Take  the  cone  for  A  that  contains  B  and  such 
that  the  next  cone  does  not  contain  B.  Now, 
consider  the  cone  second  next  to  it  (i.e.  rotate 
the  cone  about  A  by  7r/3.)  We  will  show  that 
this  cone  has  the  desired  properties.  This  cone 
determines  four  quadrants,  as  described  earlier. 
C  belongs  to  this  cone  (the  first  quadrant)  since 
the  angle  BAC  is  smaller  than  7r/2  and  bigger 
than  7r/3.  Note  that  no  point  of  S  in  the  optimal 
disc  belongs  to  second  quadrant.  This  follows 
from  the  fact  that  angle  BAO,  O  being  the  cen¬ 
ter  of  d*,  is  smaller  than  tt/G  and  therefore  a  half 


line  of  the  tangent  line  to  d*  at  A  is  in  the  first 
quadrant.  Also,  no  point  in  S’  \  d*{S)  is  in  the 
first  quadrant  (the  cone)  since  it  lies  completely 
in  the  halfplane  determined  by  the  perpendicu¬ 
lar  to  the  segment  joining  the  centers  of  d* ,  and 
d*  through  point  A  (which  follows  from  the  fact 
that  angle  ABC  is  less  than  tt/G).  Hence,  the 
cone  has  the  required  properties. 

3.  The  first  case  does  not  hold  and  both  the 
angles  ABC  and  BAC  are  at  least  vr/G.  Note 
that  at  least  one  of  the  right  angles  centered  at 
A  or  B,  with  one  arm  passing  through  C  and 
the  other  not  intersecting  d*,  does  not  contain 
points  in  S  \  d*(S).  Assume  that  it  is  the  right 
angle  centered  at  A;  denote  it  by  Ra.  Take  the 
cone  determined  by  A  that  contains  C  and  the 
next  one  that  does  not  have  this  property.  We 
will  show  that  this  cone  has  the  desired  pro¬ 
perties.  First  note  that  the  first  quadrant  does 
not  contain  points  in  S'  \  d*{S). 


Those  points  that  belong  to  d*{S)  and  are  in 
the  fourth  quadrant  are  contained  in  the  disc 
dAc  with  the  diameter  AC.  Furthermore,  the  in¬ 
tersection  of  the  first  disc  di  in  the  sequence  ge¬ 
nerated  by  the  second  quadrant  with  the  fourth 
quadrant  does  not  contain  any  point  in  S\d*[S) 
as  it  is  completely  included  in  right  angle  Ra- 
This  follows  from  the  fact  that  the  circle  of 
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intersects  the  circle  of  cIac  at  A  and  in  the  arc 
AC.  This  intersection  also  contains  the  inters¬ 
ection  of  d*  and  the  fourth  quadrant  Q4.  The¬ 
refore,  di  contains  all  the  points  of  S  that  are 
in  d*{S)  n  Q4.  Hence,  the  cone  has  the  required 
properties.  □ 

The  immediate  consequences  of  the  above 
theorem  are: 

Corollary  4.1  1.  The  optimal  disc  is  defined 
by  the  minimum  spanning  disc  of  the  points  in 
some  cone;  or 

2.  the  optimal  disc  is  in  a  trailing  seg¬ 
ment  ds,...,dkc  of  the  sequence  of  discs 
do,  di, . . . ,  dk^  for  some  cone.  This  segment  con¬ 
sists  of  those  dj-  ’s  that  do  not  contain  any  points 
of  Q4.  Hence,  we  have  ds{S)  C  . . .  C  dkciS);  or 

3.  the  optimal  disc  is  in  an  initial  segment 
di, . . . ,  dfe^  of  the  sequence  di, . . . ,  d^^  for  some 
cone  Cp.  (Note  that  we  start  this  sequence  at  di 
rather  than  at  do).  This  initial  segment  consists 
of  those  di ’s  that  contain  the  same  points  0/(^4 
(orQ2)  as  disc  di  does.  Hence,  we  have  di(S')  C 
...Cdk>^{S). 

Note  that  the  segments  of  the  sequences  from 
items  2  and  3  of  the  above  corollary  can  be  fo¬ 
und  from  the  sequence  of  the  discs  in  (9(nlogn) 
time  using  a  simple  binary  search.  In  later  sec¬ 
tions  we  will  always  assume  that  the  sequences 
of  discs  are  shortened  to  appropriate  segments, 
as  described  above. 

Recall  that  the  monotonicity  (w.r.t.  set  inclu¬ 
sion)  of  di{S)  implies  the  monotonicity  (in  the 
opposite  direction)  of  the  complements  dfiS)  = 
S  \  di{S).  This,  in  turn,  implies  the  monoto¬ 
nicity  of  the  corresponding  radii  fi.  This  holds 
only  for  some  cones,  as  described  in  the  theo¬ 
rem.  Nevertheless,  we  will  soon  see  that  it  is 
sufficient  to  find  the  optimal  discs  efficiently. 

We  need  to  comment  why  the  third  quadrant 
is  ignored  in  building  sequences  of  candidate  di¬ 
scs.  Clearly  points  in  the  third  quadrant  can  be¬ 
long  to  d*.  However,  no  point  in  the  third  qua¬ 
drant  can  determine  an  optimal  disc  together 
with  p  (the  center  of  the  cone)  and  a  point  in 
the  first  quadrant  as  they  form  an  obtuse  tri¬ 
angle. 


We  will  end  this  section  with  results  for  an 
optimal  disc  that  is  determined  by  a  pair  rather 
than  a  triple  of  points.  In  this  case  the  segment 
joining  this  pair  of  points  is  a  diameter  of  the 
disc. 

Theorem  4.2  Let  d*  he  an  optimal  disc  whose 
circle  contains  point  p  ^  S .  There  exists  a  cone 
Cp  for  p  such  that  S  f]  Cf  C  d*{S)  and  one  of 
the  following  holds  true: 

1.  Both  point  determining  d*  (p  being  one  of 
them)  belong  to  Cp,'  or 

2.  The  other  than  p  point  determining  d*  is  not 
in  Cp.  Then  d*{S)  Pi  Q4  =  0  or  d*{S)  fl  (^2  =  0- 

Proof:  Let  A  and  B  be  the  points  determining 
d*.  Let  A  be  this  point  whose  projection  on  the 
line  joining  the  centers  of  d*  and  d*  is  closer  to 
the  center  of  d*  than  the  projection  of  C.  We 
will  carry  out  the  proof  under  the  assumption 
that  A  is  “below”  the  line  joining  the  centers 
of  the  discs,  and  that  we  rotate  cones  in  the 
clockwise  direction.  Other  cases  are  symmetric. 
Consider  this  cone  Ca  that  does  not  contain  any 
points  in  6'\d*(5')  and  the  previous  one  does  not 
have  this  property.  Such  a  cone  always  exists  as 
there  is  a  halfplane  determined  by  a  line  passing 
through  A  that  do  not  contain  any  points  in  S'  \ 
d*(S).  Ca  determines  four  quadrants  as  usual, 
the  cone  itself  being  the  first  one.  We  have  two 
cases: 

1.  B  belongs  to  Ca-  Then  the  cone  has  the 
required  properties. 

2.  B  is  not  in  Ca-  Assume  that  B  is  in  Q2  (the 
second  quadrant).  Then  the  tangent  line  to  d*  at 
A  passes  through  the  first  and  third  quadrants 
and  therefore  no  point  in  d*{S)  belongs  to  Q4. 

□ 

Note  that  the  above  proof  is  significantly 
simpler  than  its  counterpart  for  discs  determi¬ 
ned  by  triples  of  points.  One  of  the  reasons 
is  that  we  do  not  need  to  find  a  cone  that  ne¬ 
cessarily  contains  the  second  determining  point 
(beyond  p)  in  its  interior  as  it  is  required  in  the 
case  of  triples. 

To  gain  a  benefit  from  the  above  theorem  we 
need  another  straightforward  observation. 
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If  (i*  is  an  optimal  circle  determined  by  a  pair 
of  points  p  and  q  (with  p’s  projection  on  the 
segment  joining  the  centers  of  d*  and  d*  being 
closer  to  the  center  of  d*)  then  at  least  one  of  the 
right  angles  centered  at  p  with  one  arm  including 
pq  does  not  contain  points  in  5  \  d*{S).  This 
follows  from  the  fact  that  p  does  not  belong  to 
the  interior  of  d*  so  d*  can  not  overlap  these 
both  right  angles  at  the  same  time. 

Now,  the  above  theorem  leads  to 

Corollary  4.2  d*  is  either  identical  with 
do  or  it  belongs  to  the  sequence  of  discs 
di,  d2, .  . . ,  dc  as  defined  in  Section  2. 

4.1  Searching  with  lazy  evaluations 

The  searching  in  the  algorithm  will  be  based  on 
the  following  general  scheme.  Assume  that  we 
are  given  a  family  Vl  of  sequences  of  pairs  of 
numbers  {{afih))],  j  =  I . .  .m.,,  i  =  I . . .  k,  and 
set  m  =  Denote  by  0*  the  set  of  all 

pairs  {a,b)  G  0  such  that  b  <  a.  The  searching 
problem  is:  “Find  a  lexicographically  smallest 
pair  {a*,b*)  G  D*.”  That  is,  we  want  to  find  a 
pair  with  the  smallest  a,  b  not  greater  than  a, 
and  with  b  as  small  as  possible  for  a  given  a. 

Assume  that,  in  addition,  the  sequences  in  0 
have  the  following  properties.  There  exists  io 
such  that  {af^bf}  contains  {a*,b*);  {af]  is  a 
nondecreasing  sequence;  and  {6*°}  is  a  nonin¬ 
creasing  sequence.  In  other  words,  the  optimal 
is  contained  in  some  sequence  of  pairs 
whose  first  elements  form  a  monotonic  sequence, 
and  whose  second  elements  also  form  a  monoto¬ 
nic  sequence  (in  the  opposite  direction).  The 
collection  0  with  the  above  properties  will  be 
called  quasi-monotonic. 

We  are  interested  in  the  case  when  the  a’s 
are  known  and  the  b's  can  be  computed  with 
a  certain  cost  penalty.  To  make  the  discussion 
more  specific  assume  that  the  number  of  sequen¬ 
ces  k  =  0(n),  the  lengths  of  the  sequences  are 
0{n),  and  the  cost  to  find  each  b  is  also  0{n). 
In  this  case  a  standard  search  for  the  maximum 
would  result  in  O(n^)  time.  Due  to  the  special 
properties  of  quasi-monotonic  families  we  can 
perform  that  search  more  efficiently. 


The  following  simple  binary-like  search  can  ef¬ 
ficiently  locate  (a*,  6*).  For  each  sequence  per¬ 
form  a  binary  search  with  respect  to  the  a’s  in 
order  to  find  the  smallest  a  with  b  <  a  (a  local 
minimum).  If  the  a’s  form  a  nondecreasing  and 
the  b’s  a  nonincreasing  sequence,  a  binary  search 
will  correctly  locate  the  required  pair.  However, 
not  all  sequences  are  monotonic  with  respect  to 
the  a’s  and  b’s,  and  therefore  the  search  can  lead 
to  one  of  two  possible  outcomes.  Here  the  search 
is  guided  by  the  result  of  comparing  the  current 
a  with  the  corresponding  b  (that  must  be  eva¬ 
luated).  For  example,  if  6  >  a,  the  search  will 
continue  in  the  right  half.  Either  we  will  not 
find  any  pair  with  b  <  a  and  no  pair  will  be 
returned  or  the  search  will  continue  and  a  pair 
will  be  returned.  In  this  case  the  returned  pair 
is  not  guaranteed  to  be  the  smallest  in  a  given 
sequence.  Now,  from  all  the  returned  pairs  we 
choose  the  lexicographically  smallest.  This  must 
be  our  (a*,  6*)  since  it  is  an  element  of  at  least 
one  monotonic  sequence;  the  binary  search  cor¬ 
rectly  locates  it.  The  total  cost  of  the  search  is 
bounded  by  0{n^  logn).  Concluding,  we  have: 

Theorem  4.3  The  lexicographically  smallest 
pair  {a*,b*),  with  b  <  a,  of  a  quasi-monotonic 
collection  of  0{n)  sequences,  with  0(n)  pairs  in 
the  sequence  and  0{n)  penalty  to  evaluate  pairs, 
can  be  found  in  0(n^  logn)  time. 

5  Algorithm 

We  will  use  the  schema  of  searching  wdth  lazy 
evaluations  for  locating  the  optimal  solution 
{d*,d*).  We  have  the  following  lemma 

Lemma  5.1  The  family  Q  of  all  the  sequences 
associated  with  the  cones  is  quasi  —  monotone. 

Proof:  Follows  directly  from  the  construc¬ 

tion  of  the  sequences  and  from  Corollary  4.1  and 
4.2.  □ 

The  algorithm  works  as  follows.  For  all  the 
points  in  S  construct  the  corresponding  cones, 
and  the  sets  Qi,  Q2,  Q3,  and  Q4.  Find  the  mi¬ 
nimal  enclosing  disc  do  for  and  the  minimal 
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disc  do  for  S\d{S)  and  add  it  to  the  pool  of  can¬ 
didates.  Then  construct  ordered  sequences  of  di¬ 
scs  for  each  cone  as  described  in  Section  2.  This 
takes  O(nlogn)  time  per  cone  and  O(n^logn) 
for  all  cones.  From  these  sequences  we  can  con¬ 
struct  those  sequences  described  in  items  2  and 
3  of  Corollary  4.1  in  an  additional  O(nlogn) 
time  per  cone.  For  example,  with  the  help  of  a 
binary  search  we  can  locate  the  last  disc  in  the 
sequence  that  contains  the  same  points  of  Q4  as 
the  first  disc  di  of  this  sequence  does.  Based  on 
Theorem  4.1  and  4.2  the  set  of  sequences  con¬ 
tains  an  optimal  disc.  By  virtue  of  Lemma  5.1 
this  set  of  sequences  is  quasi-monotone.  Per¬ 
form  the  lazy  evaluations  search  on  this  set  and 
return  pairs  of  discs  into  the  pool  of  candidates. 
This  can  be  done  by  Theorem  4.3  in  0(n^  logn) 
time.  The  penalty  for  each  lazy  evaluation  is 
0{n)  if  Megiddo’s  algorithm  is  used  to  find  the 
complementary  discs.  Finally,  find  the  global 
minimum  (d*,  J*)  (or  all  pairs  of  discs  that  are 
optimal)  in  0{n)  time.  This  leads  to  the  follo¬ 
wing  main  result: 

Theorem  5.1  The  pair  of  discs  id*,d*)  that 
solves  the  two-center  problem  for  a  set  of  n 
points  in  the  Euclidean  plane  can  be  found  in 
(9(n^logn)  time. 


6  Conclusions 

We  have  presented  a  new  algorithm  for  the  two- 
center  problem.  Its  cost  of  0{n^  log  n)  is  the 
best  known  to  date.  The  algorithm  is  based 
on  an  interesting  insight  into  the  geometry  of 
the  problem  that  leads  to  a  simple  searching 
scheme  called  searching  with  lazy  evaluations. 
The  algorithm  is  robust  with  respect  to  dege¬ 
nerate  configurations  of  points.  It  also  handles 
uniformly  optimal  discs  that  are  defined  either 
by  three  or  by  two  points.  The  searching  scheme 
is  simple  and  therefore  it  will  be  interesting  to 
see  whether  or  not  there  is  some  room  for  im¬ 
provement  in  the  time  complexity.  Interesting 
directions  of  further  research  include  studies  on 
more  general  fc- center  problems  for  a  fixed  va¬ 
lue  of  k,  and  for  non-Euclidean  metrics.  We 


hope  that  the  geometric  properties  of  the  opti¬ 
mal  discs  presented  in  this  paper  will  find  more 
applications  in  geometric  optimization. 
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Abstract 

We  investigate  the  problem  of  finding  the  best  solu¬ 
tion  satisfying  all  but  k  of  the  given  constraints,  for 
an  abstract  class  of  optimization  problems  introduced 
by  Sharir  and  Welzl  —  the  so-called  LP-type  problems. 
We  give  a  general  algorithm  and  discuss  its  efficient  im¬ 
plementations  for  specific  geometric  problems.  For  in¬ 
stance,  for  the  problem  of  computing  the  smallest  circle 
enclosing  all  but  k  of  the  given  n  points  in  the  plane, 
we  obtain  an  0{nlogn  +  k^n^)  algorithm;  this  improves 
previous  results  for  k  small  compared  to  n  but  moder¬ 
ately  growing. 

We  also  establish  some  results  concerning  general 
properties  of  LP-type  problems. 

1  Introduction 

Smallest  enclosing  circles.  We  begin  by  dis¬ 
cussing  the  following  geometric  problem: 

Given  a  set  P  of  n  points  in  the  plane  and  an 
integer  q,  find  the  smallest  circle  enclosing  at 
least  q  points  of  P. 

This  problem  has  recently  been  investigated  in  several 
papers  [A1KS91],  [EE93],  [DLSS93],  [ESZ93],  [Mat93b] 
and  it  motivated  also  this  paper.  In  these  previous 
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works,  algorithms  were  obtained  with  roughly  0{nq) 
running  time^ .  There  seems  to  be  little  hope  at  present 
at  improving  these  bounds  substantially  in  the  full 
range  of  values  of  q.  In  particular,  as  observed  by  D. 
Eppstein  (private  communication),  for  q  close  to  n/2, 
a  subquadratic  solution  would  imply  a  subquadratic 
solution  for  several  other  basic  problems,  for  which 
one  seems  unlikely  with  present  methods  (information 
about  this  class  of  problems,  collected  and  publicized 
under  the  name  ‘n^-hard  problems’  by  M.  Overrnars 
and  his  coworkers,  can  be  found  in  [G09.3];  see  also 
Erickson  and  .Seidel  [ES93]  for  lower  bounds  in  a  weak 
model  of  computation  for  some  problems  of  this  type). 

In  this  paper  we  investigate  improvements  over  the 
roughly  0{nq)  bound  in  the  situation  when  k  =  n  —  q 
is  small  compared  to  n  (all  but  few  points  should  be 
enclosed  by  the  circle);  this  question  was  raised  e.g.,  by 
Efrat  et  al.  [ESZ93].  One  of  the  first  methods  com¬ 
ing  to  mind  for  solving  the  problem  is  to  construct  the 
gth  order  Voronoi  diagram  for  the  point  set  P,  and  find 
the  required  circle  by  inspecting  all  its  cells  (this  ap¬ 
proach  was  pointed  out  by  Aggarwal  et  al.  [AIK.S91]). 
It  is  known  that  the  combinatorial  complexity  of  the 
^th  order  Voronoi  diagram  is  0((n  —  q)q),  and  it  has 
been  shown  recently  that  it  can  be  constructed  in  ex¬ 
pected  time  0{n  log®  n  +  {n  —  q)q  log  n)  [AdBMS94],  (see 
also  [Cla87],  [AM91]  for  previous  results).  In  our  set¬ 
ting,  this  says  that  the  smallest  circle  enclosing  all  but 
k  points  can  be  found  in  0{n  log®  n  -f-  nk  log  n)  time. 

In  this  paper  we  show  that  one  can  do  still  better  for 
small  k,  namely  that  the  problem  can  be  solved  in  close 
to  linear  time  with  k  as  large  as 

Theorem  1.1  The  S7nallesi  circle  containing  all  but  at 
most  k  of  the  given  n  'points  in  the  plane  can  be  com¬ 
puted  in  0{n\ogn  -f-  k^iP)  time^ . 

^Here  are  the  specific  running  times:  Eppstein  and  Erick¬ 
son  [EE93]  solve  the  problem  in  Ofngloggr  -|-  nlogn)  time  with 
Ofnlogn  nq  log?)  space,  and  Datta  et  al.  [DLSS93]  give 
an  algorithm  with  the  same  rumiing  time  and  space  improved  to 
0(n -h  log  g).  Efrat  e(  al.  [ESZ93]  acliieve  Ofnglog^  n)  time 
with  0{nq)  space  or  alternatively  Ofnglog^  nlog(n/g))  time  with 
Ofnlogn)  space,  and  the  author  [Mat93b]  has  0{nlogn  -|-  nq) 
time  with  0{nq)  space  or  time  0(n logn  -|-  nqlogq)  with  0{n) 
space. 

^Throughout  the  paper,  e  in  exponents  stands  for  a  positive 
constant  which  can  be  made  arbitrarily  small  by  adjusting  the 
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A  predecessor  of  our  technique  is  a  result  of  Megiddo 
[Meg83],  who  demonstrated  that  the  fe  =  0  case,  the 
smallest  enclosing  circle  for  n  points,  can  be  solved  in 
0(n)  time. 

LP-type  problems.  The  problem  of  finding  the 
smallest  enclosing  circle  belongs  to  a  class  of  optimiza¬ 
tion  problems  known  as  LP-type  problems  (or  ‘gener¬ 
alized  linear  programming’  problems).  This  class  was 
introduced  by  Sharir  and  Welzl  [SW92].  It  captures 
the  properties  of  linear  programming  relevant  for  the 
description  and  analysis  of  their  linear  programming 
algorithm.  The  definition  and  more  information  on  LP- 
type  problems  will  be  given  later;  here  we  only  mention 
few  important  facts. 

Each  LP-type  problem  has  an  associated  parameter, 
the  so-called  combinatorial  dimension  (or  dimension  for 
short).  For  instance,  for  a  feasible  linear  programming 
problem  (one  with  a  solution  satisfying  all  the  con¬ 
straints),  this  combinatorial  dimension  equals  to  the 
geometric  dimension  of  the  underlying  space.  Random¬ 
ized  algorithms  are  known  for  solving  an  LP-type  prob¬ 
lem  with  n  constraints  and  of  dimension  bounded  by 
a  constant  in  0{n)  expected  time,  see  [Cla88],  [SW92]. 
For  a  linear-time  deterministic  algorithm  under  some¬ 
what  more  restrictive  assumptions  see  [CM93].  The  al¬ 
gorithms  have  originally  been  developed  for  linear  pro¬ 
gramming,  but  under  suitable  computational  assump¬ 
tions  they  work  for  all  LP-type  problems.  At  the  same 
time,  the  class  of  LP-type  problems  turned  out  to  con¬ 
tain  also  many  other  optimization  problems  of  geomet¬ 
ric  flavor  besides  linear  programming,  such  as  the  small 
enclosing  circle  computation  or  finding  the  distance  of 
two  convex  polyhedra.  See  also  [Kal92],  [MSW92]  for 
algorithms  with  expected  running  time  subexponential 
in  the  dimension,  [Gar92]  for  another  but  related  ab¬ 
stract  class  of  optimization  problems  and  [Ame93]  for 
further  properties  and  applications  of  the  LP-type  class. 

The  problem  of  finding  the  best  solution  satisfying  all 
but  at  most  k  of  the  given  constraints  can  be  posed  and 
solved  within  the  framework  of  LP-type  problems.  We 
need  that  the  considered  LP-type  problem  is  nondegen¬ 
erate.  In  geometric  situations,  this  roughly  means  that 
the  constraints  are  in  general  position,  which  can  usu¬ 
ally  be  achieved  by  perturbation  techniques  similar  to 
the  ones  for  removing  degeneracies  in  geometric  prob¬ 
lems.  However,  one  has  to  proceed  carefully  so  that  the 
combinatorial  structure  of  the  problem  is  not  changed 
in  an  undesirable  way.  Exact  definitions  for  the  notions 
in  the  following  theorem  are  given  in  the  next  section. 

Theorem  1.2  Let  d  be  a  constant,  let  {H,w)  be  a  non¬ 
degenerate  LP-type  problem  of  dimension  d  with  n  con¬ 
straints  and  let  k  be  a  given  integer,  1  <  k  <  n.  Then 

parameters  of  the  algoritlims.  Multiplicative  constants  implicit 
in  the  0()  notation  may  depend  on  e. 


the  minimum  basis  violated  by  at  most  k  of  the  con¬ 
straints  can  be  computed  in  0{nk'^)  time,  by  finding  op¬ 
tima  for  0{k'^)  LP-type  problems  of  the  form  {G,  w) 
with  G  C  H . 

Paper  overview  and  further  results.  In  sec¬ 
tion  2,  we  first  review  the  definitions  and  some  proper¬ 
ties  for  LP-type  problems  (sec.  2.1).  Then  we  define  and 
discuss  nondegenerate  LP-type  problems  (section  2.2). 
In  section  2.3  we  consider  bounds  on  the  number  of 
bases  violated  by  at  most  k  constraints  and  by  exactly 
k  constraints,  phrasing  results  and  proofs  known  for  lin¬ 
ear  programming  in  the  LP-type  problems  setting.  In 
section  2.4  we  give  a  simple  algorithm  for  finding  the 
minimum  basis  violated  by  at  most  k  constraints  in  a 
nondegenerate  LP-type  problem,  and  we  also  show  that 
our  notion  of  “removing  degeneracies”  is  meaningful  in 
this  context. 

The  general  algorithm  can  sometimes  be  imple¬ 
mented  more  efficiently  in  specific  geometric  situations 
using  dynamic  data  structures.  Theorem  1.1  is  one  such 
case.  In  section  3,  we  discuss  this  and  some  other  geo¬ 
metric  problems,  mainly  the  following  one  (derived  from 
linear  programming): 

Problem  1.3  Given  a  collection  H  of  n  closed  halfs¬ 
paces  in  ]R‘^  and  an  integer  k,  1  <  k  <  n,  find  the 
lexicographically  smallest  point  of  IR'^  contained  in  all 
but  at  most  k  of  the  halfspaces  of  H . 

Other  geometric  problems  amenable  to  a  similar 
treatment  will  be  only  mentioned  without  going  into 
details. 

After  we  deal  with  technical  obstructions  concern¬ 
ing  degeneracies.  Problem  1.3  can  be  solved  using  the 
general  algorithm  from  Theorem  1.2.  There  are  two 
substantially  different  cases:  if  the  linear  programming 
problem  defined  by  LI  is  feasible  (that  is,  the  intersec¬ 
tion  of  all  halfspaces  of  H  is  nonempty),  the  dimension 
of  the  corresponding  LP-type  problem  is  d,  while  for 
an  infeasible  problem  the  dimension  is  d  -|- 1  (note  that 
“feasible” ,  “infeasible”  refers  to  the  existence  of  a  point 
in  all  halfspaces  of  LI,  not  to  the  existence  of  a  point 
lying  in  all  but  at  most  k  halfspaces  of  H). 

By  using  dynamic  data  structures,  the  complexity  of 
the  general  algorithm  can  be  improved.  The  results  are 
summarized  in  the  following  theorem  (we  do  not  write 
out  the  improved  complexities  for  larger  dimensions,  as 
the  improvements  become  less  significant): 

Theorem  1.4  (i)  In  fAe  feasible  case,  Problem  1.3  can 
be  solved 

•  in  time  0{nk^)  by  the  general  algorithm,  for  any 
fixed  d, 

•  in  time  0{nlogn  +  k^n^)  in  dimension  3,  and 
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•  in  time  0{n\ogn  +  in  di¬ 

mension  4- 

(ii)  In  the  infeasible  case,  we  get 

•  0{nk‘^'^^)  time  for  the  general  algorithm, 

•  (9(n  log  ?z  +  log^  7i)  time  in  dimension  2  and 

•  O {n  log  n  k'^n^)  time  in  dimension  3. 

Let  us  summarize  previous  work  on  Problem  1.3.  In 
the  feasible  case,  the  problem  can  be  rephrased  as  find¬ 
ing  the  lexicographically  smallest  point  of  the  ^-level  in 
an  arrangement  of  n  hyperplanes. 

In  the  plane,  there  is  an  0(n  log^  n)  algorithm  (this  is 
why  we  do  not  mention  the  planar  case  in  Theorem  1.4). 
It  is  based  on  the  observation  that  for  a  given  horizontal 
line,  one  can  determine  in  0(?ilog?i)  time  whether  it 
intersects  the  i-level.  Then  parametric  search  is  used 
to  determine  the  low'est  line  which  still  intersects  the 
L'-level.  This  algorithm  (implicitly)  appears  in  Cole  et 
al.  [CSY87].  It  is  not  clear  at  present  how  efficiently 
can  this  approach  be  generalized  to  higher  dimensions. 

A  natural  approach  in  higher  dimension  is  to  con¬ 
struct  the  ^;-level  or  the  (<  ^)-level  in  the  arrangement®. 
In  dimension  3,  an  output-sensitive  algorithm  is  known 
which  constructs  the  fc-level  in  0{n^(b  -f  n))  time, 
where  b  denotes  the  complexity  of  the  constructed  level 
[AM91].  The  worst-case  complexity  of  the  ^-level  is 
0(n®/®)  [DE93]  (see  also  [ACE+91]).  In  higher  dimen¬ 
sions,  only  weak  worst-case  bounds  on  the  A;-level  com¬ 
plexity  are  known,  and  also  output-sensitive  construc¬ 
tion  algorithms  become  less  effective,  so  that  one  may 
consider  constructing  the  whole  (<  A;)-level.  This  can 
be  done  in  expected  0(nk^  log  j)  time  in  dimension 
3  and  in  worst-case  optimal  0(nL^/®-l  )  expected 

time  in  dimension  d  >  4  [Mul91].  For  k  much  smaller 
than  n,  the  level  construction  approach  thus  becomes 
quite  inefficient  in  higher  dimensions.  Let  us  remark 
that  the  picture  is  very  different  for  “random”  problem 
instances,  as  Mulmuley  [Mul91]  proves  that  for  hyper¬ 
planes  randomly  chosen  from  several  natural  distribu¬ 
tions,  the  (<  A;)-level  expected  complexity  as  well  as 
expected  construction  time  are  0{nk‘^~^). 

Problem  1.3  in  the  infeasible  case  was  investigated 
in  several  papers.  For  instance,  Cole  et  al.  [CSY87] 
and  Everett  et  al.  [ERvK93]  considered  the  follow¬ 
ing  weak  separation  problem  in  the  plane:  Given  two 
point  sets  R  and  B  with  n  points  in  total,  find  the 
smallest  k  for  which  there  is  a  line  I  and  a  fc-point  set 
E  C  Rli  B  such  that  all  points  oi  R  \  E  lie  on  one 
side  of  I  and  all  point  ol  B  \  E  lie  on  the  other  side. 
In  a  dual  setting,  this  amounts  to  finding  the  smallest 
k  for  which  there  exists  a  point  in  the  plane  contained 

^Even  in  the  plane,  the  above  mentioned  O  (n  log^  n)  algorithm 
can  be  improved  a  httle  for  k  very  small,  see  [ERvK93]. 


in  all  but  k  of  the  given  halfplanes.  [ERvK93]  provide 
an  0(nA:  log  A; -f  7t  log  n)  solution,  based  on  a  (<  A;)-level 
construction  algorithm.  Efrat  et  al.  [ELS93]  consider 
the  problem  in  the  latter  form  and  obtain  similar  re¬ 
sults.  Moreover,  they  mention  some  more  applications 
and  investigate  the  analogous  problem  in  dimension  3, 
where  they  give  an  0{nk^ (logn -klog^  •^))  solution.  An¬ 
other  closely  related  problem  is  the  finding  of  a  line  in¬ 
tersecting  all  but  at  most  k  of  given  n  vertical  segments 
in  the  plane,  investigated  by  Everett  et  al.  [ERvK93]; 
it  can  also  be  reduced  to  linear  programming  with  at 
most  k  violated  constraints. 

Our  results  for  both  the  2  and  3-dimensional  problem 
thus  present  an  improvement  over  the  previous  roughly 
0{nk)  bounds  if  k  is  in  the  range  (roughly)  from  log  77 
to  ^Jn.  Let  us  remark  that  the  smallest  k  (in  the  weak 
separation  problem  or  its  dual)  can  be  found  in  the  same 
time  as  are  the  bounds  in  Theorem  1.4  (where  the  k  is 
given),  since  our  method  searches  all  bases  of  level  at 
most  k,  and  we  may  arrange  it  so  that  bases  of  smaller 
level  are  searched  first. 

2  LP-type  problems 

2.1  Basic  definitions  and  properties 

We  begin  by  recalling  the  abstract  framework  of  [SW92] 
(with  minor  formal  modifications).  An  minimization 
problem  is  a  pair  {H,w),  where  R  is  a  finite  set,  and 
w  :  2^  — »■  W  is  a  function  with  values  in  a  linearly 
ordered  set  (W,  <).  The  elements  of  H  are  called  the 
constraints,  and  for  a  subset  G  C  H ,  w{G)  is  called  the 
value  of  G. 

Intuitively,  the  value  wi^G)  for  a  subset  G  of  con¬ 
straints  stands  for  the  smallest  value  attainable  for 
certain  objective  function  while  satisfying  all  the  con¬ 
straints  of  G.  The  goal  is  to  find  w{H).  For  the  compu¬ 
tation,  the  problem  is  not  specified  by  giving  the  value 
of  w  for  each  subset  (which  would  make  the  computa¬ 
tion  trivial),  but  rather  by  oracles  implementing  certain 
primitive  operations,  to  be  described  below. 

The  set  W  is  assumed  to  posses  a  smallest  element 
denoted  by  — oo  (intuitively,  it  stands  for  ‘optimum  un¬ 
defined’)  and  usually  also  a  largest  element  oo  (with 
intuitive  meaning  ‘no  feasible  solution  exists’). 

The  minimization  problem  (if,  w)  is  called  an  LP- 
type  problem  if  the  following  two  axioms  are  satisfied: 

Axiom  1.  (Monotonicity)  For  any  F,  G  with 
FCGCH,  w{F)  <  w(G). 

Axiom  2.  (Locality)  For  any  F  C  G  C  H 
with  w{F)  =  w{G)  >  — oo  and  any  h  E  H, 
w{G  L)  {/i})  >  w(G')  implies  that  also  w(F  U 
{h})  >  w{F). 

Before  we  specify  the  computational  primitives,  let 
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US  introduce  some  more  terminology.  A  basis  B  is  a  set 
of  constraints  with  w{B')  <  w{B)  for  all  proper  subsets 
B’  oi  B .  A  basis  for  a  subset  G  of  B  is  a  basis  BCG 
with  w{B)  =  w{G).  So  a  basis  for  G  is  a  minimal 
subset  of  G  with  the  same  value  as  G.  The  maximum 
cardinality  of  any  basis  is  called  the  dimension  of  (B,  w) 
and  denoted  by  dim(B,u;). 

We  say  that  a  constraint  h  C  H  violates  a  set  G  if 
w{G  U  {h})  >  w{G).  This  notion  is  most  often  used 
with  G  being  a  basis.  For  G  C  B  we  denote  by  V"(G) 
the  set  of  constraints  of  B  violating  G. 

As  was  mentioned  in  the  introduction,  there  are  sev¬ 
eral  algorithms  for  solving  an  LP-type  problem  of  con¬ 
stant  bounded  dimension  in  time  0(|B|).  These  algo¬ 
rithms  differ  slightly  in  the  assumptions  on  the  prim¬ 
itive  operations  available  for  the  considered  problem. 
We  describe  the  primitives  needed  for  the  random¬ 
ized  algorithm  of  Sharir  and  Welzl  [SW92]  (see  also 
[MSW92]). 

Violation  test.  Given  a  basis  B  and  a 
constraint  h  £  H,  decide  whether  h  violates 
B. 

Basis  change.  Given  a  basis  B  and  a  con¬ 
straint  h,  return  (some)  basis  for  B  U  {/i}. 

Initial  basis.  On  the  beginning,  we  have 
some  basis  Bq  with  ■w{Bq)  >  — oo. 

For  the  0{nk^)  time  bound  in  Theorem  1.2,  we  as¬ 
sume  that  both  violation  test  and  basis  change  are  im¬ 
plemented  in  constant  time,  and  that  for  any  G  C  B 
with  w{G)  >  —00  an  initial  basis  Bq  C  G  with  zn(So)  > 
— oo  can  be  found  in  0(n)  time. 

To  illustrate  the  definitions,  let  us  look  how  two  par¬ 
ticular  problems  fit  into  this  framework.  For  the  prob¬ 
lem  of  finding  the  smallest  enclosing  circleioi  a  set  B  of 
points  in  the  plane,  the  constraints  are  the  points,  the 
value  ti'(G)  of  a  nonempty  set  G  C  B  is  the  radius  of  the 
smallest  circle  enclosing  G,  and  we  set  w(0)  =  — oo;  thus 
the  set  W  consists  of  the  element  -oo  plus  the  nonneg¬ 
ative  real  numbers.  Bases  have  cardinality  0, 1,2  or  3 
(the  ‘interesting’  cases  are  subsets  with  nonzero  circum- 
radius,  where  the  basis  cardinality  is  2  or  3).  A  nontriv¬ 
ial  step  in  verifying  the  axioms  is  to  show  that  for  any 
set,  the  minimum  enclosing  circle  is  unique,  see  [Wel91], 
[SW92].  Implementing  the  computational  primitives  is 
straightforward. 

As  our  second  example,  we  consider  the  linear  pro¬ 
gramming  problem  in  11“^  in  the  following  form:  We  are 
given  a  set  B  of  closed  halfspaces  in  IR'^,  and  the  goal 
is  to  find  the  point  x  in  the  intersection  of  the  halfs¬ 
paces  of  B  with  the  lexicographically  smallest  coordi¬ 
nate  vector.  Here  the  set  W  is  formed  by  a  smallest 
element  -oo,  the  set  ordered  lexicographically  by 
the  coordinate  vectors,  and  a  largest  element  oo.  The 


value  w{G)  of  a  set  of  constraints  is  defined  as  the  lexi¬ 
cographically  smallest  point  (vertex)  of  the  intersection 
of  all  halfspaces  of  G.  There  are  two  exceptional  cases: 
the  value  is  oo  if  the  intersection  of  the  halfspaces  of  G 
is  empty  and  it  is  -oo  if  this  intersection  is  unbounded 
in  the  —xi  direction  or,  more  generally,  if  it  does  not 
have  a  lexicographically  minimal  point  (e.g.,  if  it  is  the 
halfspace  {si  >  0}). 

The  verification  of  the  axioms  is  easy.  If  the  lin¬ 
ear  programming  problem  defined  by  B  has  a  well  de¬ 
fined  optimum,  then  the  combinatorial  dimension  of  the 
problem  is  d;  if  the  linear  program  is  infeasible,  the  com¬ 
binatorial  dimension  is  usually  d-fl  (as  dfi-l  halfspaces 
in  general  position  are  needed  to  witness  the  infeasibil- 

ity)- 

Violation  tests  and  basis  changes  for  linear  program¬ 
ming  are  implemented  easily  using  Gauss  elimination. 
The  “Initial  basis”  assumption  is  more  tricky,  as  it 
asks  for  finding  a  dually  feasible  solution  if  one  exists. 
This  can  be  circumvented  by  including  the  constraints 
“xi  >  -B”  for  i  =  I,2,...,d,  with  K  standing  for 
a  very  large  number.  These  constraints  can  either  be 
added  to  B  explicitly  and  then  they  form  an  initial  ba¬ 
sis,  or  they  can  be  included  implicitly  and  then  each 
set  of  constraints,  including  the  empty  one,  defines  a 
unique  lexicographically  minimal  vertex,  so  there  are 
no  —00  values  in  this  modification.  In  our  discussion  of 
linear  programming  in  section  3,  we  assume  that  this 
latter  variant  has  been  chosen. 

2.2  Nondegenerate  LP-type  problems 

A  nice  feature  of  both  the  linear  programming  algo¬ 
rithms  of  Clarkson  [Cla88]  and  of  Sharir  and  Welzl 
[SW92]  is  that  they  do  not  require  any  special  treat¬ 
ment  of  degenerate  input  configurations  (such  as  many 
halfspace  boundaries  passing  thru  a  single  point).  Prob¬ 
ably  for  this  reason,  no  notion  of  “general  position”  in 
the  context  of  LP-type  problems  has  been  developed. 
For  our  algorithm  below,  some  “nondegeneracy”  is  im¬ 
portant,  so  we  suggest  a  definition  and  make  few  obser¬ 
vations  concerning  nondegenerate  LP-type  problems. 

It  is  natural  to  define  that  an  LP-type  problem 
(B,  w)  is  nondegenerate  if  ui(B)  ^  w{B')  for  any  two 
distinct  bases  B,B'.  For  a  nondegenerate  problem,  we 
write  B(G)  for  the  (unique)  basis  for  a  set  G. 

We  must  also  define  what  does  “removing  degenera¬ 
cies”  mean.  Let  us  say  that  an  LP-type  problem  (B,  w) 
is  a  refinement  of  an  LP-type  problem  (B,  w),  if  for 
any  G,  G'  C  B,  w{G)  <  w{G')  implies  u)(G)  <  w{G'). 
Clearly,  if  B  is  a  basis  in  {H,w),  it  is  also  a  basis  in 
{H,w),  but  not  conversely.  Removing  degeneracies  for 
a  degenerate  problem  (B,  w)  means  finding  some  its 
nondegenerate  refinement. 

When  considering  an  LP-type  problem  (B,w)  and 
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its  refinement  we  let  the  notation  V{G),  resp. 

B{G)  refer  to  the  set  of  violating  constraints  for  G,  resp. 
the  basis  for  G  in  and  we  use  V{G),  B{G)  for 

violating  constraints  and  basis  in  (//,  w). 

There  are  two  different  types  of  degeneracies.  To 
elucidate  the  difference,  call  two  bases  B,B'  equiva¬ 
lent  if  they  have  identical  sets  of  violating  constraints, 
V{B)  =  V{B').  It  is  fairly  easy  to  find  a  refinement  of 
any  LP-type  problem  where  no  two  nonequivalent  bases 
have  the  same  value,  as  follows. 

Define  a  linearly  ordered  set  VV  =  {—00}  U  ((W  \ 
{  —  00})  X  2^),  where  —00  remains  the  smallest  element, 
the  pairs  in  the  Cartesian  product  are  ordered  lexico¬ 
graphically  and  the  ordering  in  the  second  coordinate 
is  some  arbitrary  linear  ordering  of  the  set  of  all  sub¬ 
sets  of  H .  We  define  a  new  value  function  w  by  set¬ 
ting  w{G)  =  {w{G),  V{G))]  it  is  easy  to  check  that  this 
yields  a  refinement  where  no  two  nonequivalent  bases 
have  the  same  value^. 

Two  equivalent  bases  must  always  have  the  same 
value.  Any  two  equivalent  but  distinct  bases  thus  vio¬ 
late  the  nondegeneracy  condition,  and  this  is  an  ‘intrin¬ 
sic’  degeneracy.  For  instance,  for  linear  programming, 
such  degeneracies  correspond  to  several  bases  defining 
the  same  vertex. 

At  present  we  are  not  aware  of  any  universal  and 
efficient  method  for  removing  this  type  of  degeneracies. 
The  following  simple  example  shows  that  sometimes  any 
nondegenerate  refinement  must  have  a  larger  dimension 
than  the  original  (degenerate)  problem. 

Example  2.1  Let  a,b,c,d  be  the  vertices  of  a  square 
in  the  plane  in  clockwise  order,  and  for  G  C  //  = 
{a,b,c,d}  define  the  value  w(G)  as  the  circumradius 
of  G.  This  is  an  LP-type  problem  of  dimension  2,  as 
any  nontrivial  circumscribed  circle  is  defined  by  a  di¬ 
ametral  pair.  The  sets  {a,c}  and  {b,d}  are  two  equiv¬ 
alent  bases.  Suppose  there  existed  a  nondegenerate  re¬ 
finement  (H,  w)  of  this  problem  with  no  basis  having 
more  than  2  points.  Then  w.l.o.g.  we  may  assume  that 
rt)({a,  c})  <  w{{b,  d}).  Set  G  =  {a,  b,  c},  F  =  {a,  c}  and 
h  =  d]  then  w{F)  =  w{F  U  {h})  =  w{G)  <  w{GU{h}), 
which  violates  the  locality  axiom.  Thus,  any  nondegen¬ 
erate  refinement  has  dimension  at  least  .3. 

For  specific  geometric  problems,  one  can  often  as¬ 
sume  that  the  corresponding  geometric  configurations 
are  nondegenerate  in  a  suitable  sense  (e.g.,  for  linear 
programming,  the  boundary  hyperplanes  of  the  con¬ 
straints  are  in  general  position  and  not  parallel  to  any 
coordinate  axis),  using  the  techniques  of  simulation  of 
smiphcity  for  geometric  problems,  see  e.g.  [EM90], 
[Yap90],  [EC92],  [Sei94].  But,  for  instance,  this  is  not 


enough  for  an  infeasible  linear  programming  problem. 
There  may  be  many  bases  witnessing  the  infeasibility, 
and  by  the  treatment  of  linear  programming  indicated 
above,  all  these  bases  receive  the  same  value  00.  An¬ 
other  (related)  serious  problem  is  that  if  the  feasible 
region  of  a  degenerate  linear  programming  problem  con¬ 
sists  of  a  single  point  (say),  then  some  perturbations  of 
the  constraints  may  make  the  problem  infeasible.  Ob¬ 
taining  nondegenerate  refinements  in  such  cases  may 
be  a  somewhat  subtle  matter;  we  discuss  some  possible 
approaches  in  section  3. 


2.3  Bases  of  level  k 

Let  {H,  vj)  be  an  LP-type  problem  and  B  C  H  be  a  ba¬ 
sis.  We  define  the  level  of  B  as  \V{B)\,  i.e.  the  number 
of  constraints  violating  B. 

We  denote  by  Bk  the  set  of  all  bases  of  level  k,  and 
by  the  bases  of  level  k  and  cardinality  i.  We  use 
the  notation  B^k  for  the  set  of  all  bases  of  level  at  most 
k. 

In  this  section,  we  discuss  bounds  for  the  maximum 
possible  size  of  B<k  and  Bk-  In  the  context  of  linear 
programming  (more  exactly,  for  a  feasible  linear  pro¬ 
gram),  a  basis  of  level  k  translates  to  a  local  minimum 
of  the  A:-level  in  the  arrangement  of  hyperplanes  bound¬ 
ing  the  constraints.  Mulmuley  [Mul91]  showed  that  the 
total  number  of  such  local  minima  of  levels  1,2,. . .  ,k  in 
dimension  d  is  0{k‘^).  His  proof  is  a  straightforward 
adaptation  of  the  method  of  Clarkson  and  Shor  [CS89]. 
Clarkson  [Cla92]  and  independently  Mulmuley  [Mul9.3] 
proved  that  the  number  of  local  minima  for  level  k  alone 
is  0(k'^~^).  Both  bounds  are  easily  seen  to  be  tight. 

In  the  following  theorem,  we  generalize  these  results 
for  nondegenerate  LP-type  problems  (the  nondegener¬ 
acy  is  crucial;  without  it  the  bounds  in  the  theorem  be¬ 
low  need  not  hold).  The  proof  for  level  at  most  k  goes 
through  unchanged.  For  the  exactly  k  level  bound,  we 
need  an  extra  assumption,  namely  that  no  value  — co 
appears  in  the  considered  problem.  Then  we  can  imi¬ 
tate  Clarkson’s  proof  quite  closely;  the  proof  is  omitted 
in  this  extended  abstract.  We  do  not  know  whether  the 
statement  remains  true  without  this  assumption. 

Theorem  2.2  (i)  For  any  nondegenerate  LP-type 
problem  of  dimension  d,  the  number  of  bases  of 
level  at  most  k  and  of  cardinality  at  most  i  does  not 
exceed  e(fe-f  1)'.  In  particular,  |L?<jt|  =  0((A;-1- 1)*^). 

(ii)  For  any  nondegenerate  LP-type  problem  {H,w)  of 
dimension  d  with  vj{G)  >  —00  for  any  G  C  H ,  we 
have  \Bk\  =  0{{k  -f-  1)'^“^). 


^  Tills  modification  of  the  weight  function  is  not  reflected  in  the 
computation  of  the  algoritlims  which  use  only  the  primitive  oper- 
ations  mentioned  above,  as  none  of  these  primitives  are  changed. 
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2.4  Finding  the  minimum  A;-level  basis 

In  our  algorithm,  we  need  to  assume  that  we  deal  with 
a  nondegenerate  LP-type  problem.  The  following  easy 
proposition  (whose  proof  is  omitted)  shows  that  in  or¬ 
der  to  find  a  minimal  (<  ^)-level  basis  in  a  possibly 
degenerate  LP-type  problem  (H,w),  it  suffices  to  find 
the  minimum  (<  A;)-level  basis  in  any  nondegenerate 
refinement  of  {H,  w). 

Proposition  2.3  Let  (H,w)  be  an  LP-iype.  problem, 
lei  {H,w)  he  iis  nondegenerate  refinement.  Let  B  be 
the  basis  in  (the  set  of  all  bases  of  level  at  most  k 
in  {H,  w))  with  ike  minimum  w-value,  let  B  be  a  basts 
for  B  in  (H,w).  Then  B  G  and  w(B)  is  minimum 
over  B<k- 

For  the  rest  of  this  section,  let  (Lf,  u))  be  a  nondegen¬ 
erate  LP-type  problem  of  dimension  d,  and  let  k  <  n 
be  a  given  integer.  Our  goal  is  to  find  the  basis  with 
the  smallest  value  among  all  bases  of  level  at  most  k  in 
{H,  w).  The  algorithm  consists  of  searching  all  bases  of 
level  at  most  k  and  selecting  the  one  with  minimum 
value.  It  it  is  easy  to  see  that  for  a  nondegenerate 
problem,  the  optimal  basis  has  level  exactly  k,  but  our 
method  requires  searching  also  bases  of  smaller  levels. 

First  we  define  a  directed  acyclic  graph  on  the  vertex 
set  B<k-  An  edge  goes  from  a  basis  B  £  Bj  to  a  basis 

e  ^i+i  (0  <  i  <  k)  if  V{B')  =  V{B)U  {6}  for  some 
be  B.  We  note  that  B'  is  the  basis  for  H\V(B)\  {6}. 
Therefore,  given  B  e  Bj,  we  can  find  all  its  neighbors 
in  Bj+i  by  computing,  for  every  b  e  B,  the  basis  B' 
for  H  \  V{B)  \  {6}  and  checking  the  condition  V{B')  = 
V{B)  U  {6}.  This  requires  solving  |i?|  <  d  LP-type 
problems  of  the  form  (G,  w)  with  G  C  H  plus  simple 
auxiliary  operations. 

We  need  the  following 

Lemma  2.4  Every  basis  of  Bk  can  be  reached  from 
B{H)  by  a  directed  path. 

Proof;  It  suffices  to  show  that  any  basis  B'  G  Bj+i  has 
a  predecessor  in  Bj . 

Write  G  =  H\  ¥(3'),  then  B'  =  B{G).  For  every 
h  e  V(B')  fz  0,  consider  the  value  uj(G  U  {/i}),  and  let 
/lo  €  V{B')  be  an  element  giving  the  smallest  of  these 
values.  In  fact,  such  /iq  is  unique,  as  w{G  U  {/iq})  = 
w{G  U  {/i})  for  h  fz  ho  implies  that  B(G  U  {Lq})  = 
5(6*  U  {/i})  C  G  by  nondegeneracy,  and  this  in  turn 
means  w{G  U  {/iq})  =  n^(G),  a  contradiction  with  the 
assumption  ho  e  V(B'). 

Let  B  be  the  basis  for  G  U  {/iq}.  We  claim  that 
B  is  the  desired  predecessor  of  B' .  We  have  V{B)  U 
{/iq}  C  V{B')\  we  need  to  show  equality.  Suppose  the 
inclusion  is  proper,  then  some  h  G  V{B'),h  fz  ho  does 
not  violate  B.  Then,  by  locality,  h  does  not  violate 


GU{/io}  either,  so  w{G  U  {ho})  =  w{GU  {h,hQ}),  but 
we  have  zn(GU{/i})  >  «;(GU{/io})  by  the  choice  of  Lq, 
a  contradiction  with  monotonicity.  □ 

We  are  ready  to  finish  the  proof  of  Theorem  1.2. 
We  start  in  the  basis  B{H),  and  we  search  the  above 
defined  directed  acyclic  graph  by  some  graph  traversal 
algorithm,  say  by  depth-first  traversal.  For  any  current 
node,  we  can  find  all  of  its  at  most  d  successors  in  0{n) 
time,  and  by  2.2(i),  we  know  that  only  0{k'^)  bases  need 
to  be  searched.  By  the  above  lemma,  we  know  that  all 
bases  ofB<,t  are  reached.  □ 

We  conclude  this  section  by  a  general  scheme  for  an 
application  of  suitable  dynamic  data  structures  for  im¬ 
plementing  this  algorithm  (which  more  or  less  offers  it¬ 
self). 

When  traversing  the  graph  defined  above,  we  main¬ 
tain  two  dynamic  data  structures,  O  and  V.  For  a  cur¬ 
rent  basis  B,  the  data  structure  G  stores  the  constraints 
of  H  \  V{B),  and  it  can  quickly  determine  the  basis  for 
the  currently  stored  set  of  constraints.  The  data  struc¬ 
ture  V  stores  the  constraints  of  1/(5),  and  it  can  quickly 
test  whether  all  constraints  of  the  currently  stored  set 
violate  a  basis  given  as  a  query. 

For  testing  if  a  successor  basis  B'  of  B  with  1/(5')  = 
1/(5)  U  {b}  for  some  6  G  5  exists,  we  first  delete  b  from 
O  and  then  we  query  O  for  the  basis  5'  for  the  current 
constraint  set.  Then  we  query  the  data  structure  V  to 
check  whether  V (5)  C  V (5').  If  yes,  we  insert  b  into  V, 
and  we  are  ready  to  continue  the  search  with  5'  as  the 
current  basis.  When  traversing  the  previously  visited 
edge  (5,  5')  backwards,  we  insert  b  to  O  and  delete  it 
from  y. 


3  Geometric  applications 

In  this  section,  we  mainly  consider  Problem  1.3  (mini¬ 
mum  vertex  contained  in  all  but  at  most  k  of  given  n 
halfspaces  in  BR,'^).  For  a  halfspace  h,  we  let  dh  stand  for 
its  bounding  hyperplane,  and  for  a  set  G  of  halfspaces 
we  write  dG  for  [dh-,  h  eG}. 

As  outlined  in  section  2.1,  the  linear  programming 
problem  can  be  interpreted  as  an  LP-type  problem.  One 
technical  problem  is  that  the  input  problem  may  be  de¬ 
generate,  while  our  algorithm  requires  a  nondegener¬ 
ate  one.  For  a  feasible  linear  program,  degeneracy  is 
caused  by  degeneracy  in  the  arrangement  of  95,  while 
for  an  infeasible  problem,  any  two  distinct  infeasible 
bases  present  a  degeneracy,  even  if  the  hyperplanes  of 
OH  are  in  general  position.  For  our  algorithm,  we  need 
a  nondegenerate  refinement. 

As  we  will  see  below,  for  a  problem  with  a  nonempty 
and  full-dimensional  feasible  region,  a  nondegenerate 
refinement  can  be  produced  relatively  easily  using  in¬ 
finitesimal  perturbations,  while  the  situation  gets  more 


317 


complicated  in  other  cases.  One  potential  problem  is 
the  following:  if  the  feasible  region  is  nonempty  but 
has  empty  interior,  an  arbitrarily  small  perturbation  of 
the  input  halfspaces  may  cause  the  problem  to  become 
infeasible,  and  in  general  we  need  not  get  a  refinement. 

At  this  moment,  a  general  remark  concerning  simu¬ 
lation  of  simplicity  is  perhaps  appropriate.  When  ap¬ 
plying  simulation  of  simplicity  on  a  geometric  problem, 
we  replace  it  in  effect  by  a  different  problem  (although 
an  “infinitesimally  close”  one),  and  an  algorithm  run  on 
this  different  problem  may  sometimes  yield  a  different 
answer  than  the  correct  one  for  the  original  problem 
(e.g.,  the  original  linear  program  is  feasible  while  the 
perturbed  one  is  not)  .  We  can  take  two  points  of  vmw: 
First,  we  may  say  that  the  input  numbers  for  a  real 
world”  problem  are  inaccurate  anyway,  and  so  the  an¬ 
swer  for  the  perturbed  problem  is  equally  appropriate 
as  the  one  for  the  original  problem.  For  example,  we 
cannot  really  tell  whether  the  feasible  region  is  very  thin 
or  empty.  From  the  second  point  of  view,  the  input  is 
given  exactly  and  its  degeneracies  really  exist,  so  if  we 
use  perturbation,  we  must  be  sure  to  recover  the  answer 
to  the  original  problem  correctly.  By  requiring  that  our 
algorithm  is  run  on  a  nondegenerate  refinement  of  the 
input  problem,  we  are  taking  the  second  of  the  outlined 
positions  (which  may  not  always  be  appropriate,  de¬ 
pending  on  the  application).  The  first  attitude  would 
allow  us  to  take  an  arbitrary  infinitesimal  perturbation, 
and  would  save  us  many  complications. 

The  following  proposition  shows  that  a  nondegener¬ 
ate  refinement  can  be  constructed  for  a  linear  program¬ 
ming  problem.  Let  jF(ff)  denote  the  feasible  region  of 
the  linear  program  defined  by  H . 

Proposition  3.1  Given  a  set  H  of  kalfspaces  in  IR'^, 
one.  can  define  a  nondegenerate  LP-iype  problem  [H ,  w), 
which  is  a  refinement  of  the  LP-type  problem  {H ,  w) 
defined  above,  has  dimension  d  (resp  d+l)  if  T{H)  0 
(resp.  T{H)  =  %),  and  for  which  the  computational 
primitives  can  be  implemented  in  constant  time. 

Proof  sketch:  The  refinement  proceeds  in  several 

steps.  First,  we  replace  the  problem  of  finding  the  lex¬ 
icographically  smallest  feasible  vertex  by  finding  a  ver¬ 
tex  minimizing  an  objective  function  of  the  form  c  ■  x, 
with  c  =  (l,£i,  If£i  >  0  is  chosen  small 

enough,  we  get  an  isomorphic  LP-type  problem.  Then 
we  shift  each  of  the  constraint  halfspaces  by  £2  out¬ 
wards  (keeping  the  bounding  hyperplane  parallel  to  the 
original  one),  with  0  <  €2  This  guarantees  that 

whenever  the  feasible  region  is  nonempty,  it  also  has 
a  nonempty  interior.  Then  we  perturb  each  bounding 
hyperplane  by  at  most  £3  <  £2  (where  distance  of  hy¬ 
perplanes  is  defined  as  the  distance  of  their  dual  points, 
say).  This  perturbation  makes  the  values  of  all  feasible 
bases  distinct. 


Finally  we  have  to  distinguish  the  values  of  the  infea¬ 
sible  bases  (this  step  is  unnecessary  for  feasible  prob¬ 
lems).  We  consider  an  infeasible  set  G  of  halfspaces 
(after  the  final  perturbation),  and  we  divide  them  into 
upper  and  lower  halfspaces.  The  value  of  G  will  be 
the  smallest  amount  t  =  t{G)  by  which  we  must  shift 
the  intersection  of  all  lower  halfspaces  upwards  so  that 
it  touches  the  intersection  of  all  upper  halfspaces.  It 
turns  out  that  this  gives  the  desired  nondegenerate  re¬ 
finement  of  dimension  d+l]  we  omit  the  details.  In 
the  algorithm,  we  do  not  choose  specific  numbers  for 
£i.£2,£3.  rather  we  treat  them  as  indeterminates.  □ 

By  proving  Proposition  3.1,  we  have  shown  that  the 
general  algorithm  from  Theorem  1.2  is  applicable  to 
Problem  1.3.  Further  we  consider  efficient  implementa¬ 
tion  of  the  dynamic  data  structures  O  and  V  mentioned 
in  the  end  of  section  2.4  for  the  linear  programming 
problem  (actually  for  its  nondegenerate  refinement  con¬ 
structed  in  the  proof  of  Proposition  3.1). 

We  begin  with  the  feasible  case.  Here  the  data  struc¬ 
ture  V  stores  a  set  of  halfspaces  which  all  contain  a 
known  point  (a  fixed  point  in  and  the  query 

boils  down  to  checking  whether  a  given  point  lies  out¬ 
side  of  the  union  of  the  current  set  of  halfspaces.  For 
this  task,  algorithms  are  known  with  the  following  per¬ 
formance  [AM91]  (we  only  quote  the  results  relevant 
to  our  application):  In  dimension  3,  0{n^)  query  time 
can  be  achieved  with  O(nlogn)  preprocessing  time  and 
C»(log^  n)  amortized  update  time.  In  dimension  d  >  4, 
one  gets  the  following  tradeoff:  for  a  parameter  m  in 
range  query  time  is  ob¬ 

tained  with  0(mi+")  space  and  preprocessing  time  and 
with  fn)  amortized  update  time  (in  fact,  for 

m  =  n  an  O(nlogn)  preprocessing  suffices) We  must 
apply  the  appropriate  perturbations  on  the  input  half¬ 
spaces,  but  this  is  a  simulation  of  simplicity  technique 
of  a  particular  type  and  it  only  slows  down  the  compu¬ 
tation  by  a  constant  factor. 

The  data  structure  O  should  return  a  basis  for  the 
current  set  of  halfspaces  (and  we  are  guaranteed  that 
the  halfspaces  have  a  nonempty  intersection).  Results 
of  [Mat93a]  show  that  the  same  performance  can  be 
achieved  as  the  one  above  for  V  (the  query  time  grows 
by  a  polylogarithmic  factor,  which  formally  disappears 
in  the  if  term). 

In  dimension  3,  we  perform  0{k^)  queries  and  up¬ 
dates  in  both  data  structures,  which  leads  to  the 
claimed  0{n\ogn  +  k^n‘)  complexity.  For  dimension 
4,  we  choose  a  suitable  tradeoff  between  the  total  up¬ 
date  and  query  time  and  the  preprocessing  time  of  the 
data  structures,  as  follows:  For  k  <  we  let  m  —  n, 

for  <  n  <  fc®  we  let  m  =  larger 

k  we  choose  m  =  This  yields  the  formula  stated 

in  the  theorem.  Tradeoffs  can  be  computed  also  for 
higher  dimensions,  although  with  less  significant  gains 
in  efficiency.  This  establishes  part  (i)  of  Theorem  1.4. 
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In  the  infeasible  case,  the  data  structure  V  stores 
the  upper  halfspaces  and  lower  halfspaces  separately. 
Testing  for  a  feasible  basis  is  as  before.  For  an  infeasi¬ 
ble  basis  B,  we  first  compute  the  value  t{B)  as  in  the 
proof  of  Proposition  2.3  and  the  point  r  =  t{B)  of  the 
intersection  of  all  upper  halfspaces  of  B  where  the  in¬ 
tersection  of  all  lower  halfspaces  of  B  first  touches  when 
shifted  upwards,  and  then  we  test  if  r  lies  outside  all  up¬ 
per  halfspaces  and  r  —  (t,  0, . . . ,  0)  lies  outside  all  lower 
halfspaces. 

The  data  structure  O  also  stores  upper  and  lower 
halfspaces  separately.  If  the  currently  stored  set  is  fea¬ 
sible,  the  algorithm  of  [Mat93a]  returns  the  basis.  If 
infeasibility  is  reported,  we  use  parametric  search  to 
find  the  value  of  t.  As  a  generic  algorithm,  we  use  the 
algorithm  of  [Mat93a]  for  testing  feasibility  of  the  set  of 
the  upper  halfspaces  plus  the  lower  halfspaces  shifted  by 
a  generic  value  of  t.  A  more  detailed  exposition  would 
require  explaining  the  feasibility  testing  algorithm  and 
we  omit  it,  as  the  details  are  easy  to  fill  in,  assuming 
a  familiarity  with  [Mat93a].  The  application  of  para¬ 
metric  search  on  top  of  the  feasibility  testing  algorithm 
only  increases  the  query  time  by  polylogarithmic  fac¬ 
tors.  The  overall  performance  of  the  resulting  dynamic 
data  structures  V  and  O  is  thus  the  same  as  in  the  fea¬ 
sible  case  above;  this  gives  the  result  for  dimension  3  in 
Theorem  1.4(ii). 

For  the  2-dimensional  case,  the  parametric  search 
machinery  is  unnecessary,  and  we  may  directly  use  a 
relatively  simple  dynamic  data  structure  for  maintain¬ 
ing  convex  hulls  in  the  plane  due  to  Overmars  an  van 
Leeuwen  [OvLSl].  To  build  O,  we  use  this  data  struc¬ 
ture  in  a  dual  form.  One  part  represents  the  intersection 
U  of  the  upper  halfplanes,  another  part  the  intersection 
L  of  the  lower  halfplanes.  After  0(n  log  n)  preprocess¬ 
ing,  halfplanes  can  be  inserted  and  deleted  in  0(log^  n) 
time,  and  the  data  structure  provides  a  representation 
of  the  convex  chains  forming  the  boundaries  of  L  and 
U  (the  chains  are  stored  in  balanced  binary  trees). 

If  the  current  problem  is  feasible,  the  optimal  vertex 
is  either  the  extreme  vertex  of  U,  or  the  extreme  vertex 
of  L,  or  one  of  the  two  intersections  of  the  boundaries 
of  U  and  of  L.  All  these  vertices  can  be  found  and 
examined  in  0(log^  n)  time.  For  an  infeasible  problem, 
the  first  contact  of  U  and  L  when  translating  L  upwards 
occurs  either  at  a  vertex  of  L  or  at  a  vertex  of  U.  For  a 
given  vertex  v  of  L,  we  can  determine  the  point  where 
it  hits  the  boundary  of  U  in  O(logn)  time.  From  the 
local  situation  at  that  point,  we  can  detect  whether  the 
first  contact  of  L  and  U  occurs  to  the  left  of  v  or  to  the 
right  of  V.  Hence  we  can  determine  the  first  contact,  the 
corresponding  t  value  and  the  basis  in  0(log^  n)  time 
by  binary  search  in  the  lower  convex  chain.  The  data 
structure  V  is  also  implemented  using  the  Overmars 
and  van  Leeuwen  data  structure,  with  0(log^  n)  time 
per  update  and  O(logn)  time  per  query.  Altogether  we 


get  the  0(nlogn-|-^®  log^  n)  running  time.  This  finishes 
the  proof  of  Theorem  1.4.  □ 

Proof  of  Theorem  1.1;  For  the  smallest  enclos¬ 
ing  circles  problem  discussed  in  the  introduction,  the 
situation  with  a  nondegenerate  refinement  is  consider¬ 
ably  easier  than  the  one  for  linear  programming  (this  is 
because  the  value  function  depends  continuously  on  the 
point  set).  It  suffices  to  take  any  sufficiently  small  per¬ 
turbation  of  the  input  points  such  that  no  4  points  are 
cocircular  and  no  circle  determined  by  2  points  passes 
through  another  point. 

To  build  the  required  dynamic  data  structures,  we 
use  the  well-known  lifting  transformation.  Namely,  we 
map  each  point  p  =  (u,  v)  in  our  point  set  to  the  plane 
Tr{p)  -  {(x,  y,  z)  G  H®;  z  =  -  2ux  -  2vy},  and 

we  map  a  circle  C  with  center  (a,  6)  and  radius  r  to  the 
point  x(C)  =  (a,b,r^-a^  +  b^)  G  IR^.  Then  p  lies  inside 
C  iff  x{C)  lies  above  n{p). 

This  allows  us  to  implement  the  data  structure  V 
using  the  dynamic  data  structure  of  [AM91]  mentioned 
in  the  previous  section.  For  the  data  structure  O,  we 
must  work  somewhat  more. 

For  a  point  set  P  in  the  plane,  let  A{P)  denote  the 
cell  above  all  the  planes  7r(p)  with  p  E  P-  Finding  the 
smallest  circle  enclosing  P  translates  to  minimizing  the 
function  (p{x,  y,  z)  :=  x^  +  y"^  -  z  over  the  region  A(P); 
the  minimum  value  is  the  square  of  the  radius  of  the 
smallest  enclosing  circle,  and  the  x  and  y  coordinates 
of  the  point  realizing  the  minimum  determine  the  center 
of  the  circle. 

The  procedure  is  quite  analogous  to  the  one  from 
[Mat93a]  for  minimizing  a  linear  function  over  a  convex 
polyhedron  (which  is  given  implicitly  by  a  certain  al¬ 
gorithm  for  answering  membership  queries),  only  that 
here  we  minimize  a  convex  function.  In  our  case,  the 
membership  algorithm  is  the  query  answering  algorithm 
for  the  dynamic  data  structure  of  [AM91].  A  reader  fa¬ 
miliar  with  the  procedure  of  [Mat93a]  may  check  that 
the  only  point  where  the  difference  between  a  linear 
function  and  a  convex  one  appears  is  in  the  following 
situation:  We  have  a  j-flat  /  and  a  [j  —  l)-flat  f  C  f  ■ 
We  know  that  the  minimum  of  ip  over  /'  D  A{P)  is  at¬ 
tained  at  some  point  z  £  f .  From  the  local  geometry 
near  this  point  we  want  to  determine  on  which  side  of 
/'  lies  the  minimum  of  p  within  /  fl  A{P).  In  case  of 
minimizing  a  linear  function,  the  minimum  point  z  is 
determined  by  j  -  1  of  the  planes  bounding  A{P),  and 
one  can  easily  determine  on  which  side  of  /'  the  mini¬ 
mized  function  may  grow.  In  our  case  of  a  (quadratic) 
convex  minimized  function,  the  minimum  may  be  the 
global  minimum,  or  it  may  be  determined  by  any  num¬ 
ber  of  planes  up  to  j  —  1,  but  still  one  may  determine 
locally  on  which  side  of  /'  does  the  minimum  within 
/  lie.  With  this  subroutine  available,  we  may  use  the 
procedure  of  [Mat93a],  and  we  obtain  a  minimization 
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algorithm  with  0(7i^)  query  time.  Generalization  to 
an  arbitrary  dimension  is  also  possible  (the  efficiency 
depends  on  the  efficiency  of  the  algorithm  for  the  mem¬ 
bership  queries). 

In  this  way,  also  the  data  structure  O  can  be  im¬ 
plemented  with  O(n^)  query  time,  and  this  gives  the 
bound  in  Theorem  1.1.  □ 

There  are  various  other  optimization  problems  of  ge¬ 
ometric  flavor  fitting  into  the  LP-type  framework,  see 
[MSW92],  [Ame93].  Here  are  few  examples:  Finding 
the  smallest  ball,  resp.  the  smallest  volume  ellipsoid 
enclosing  a  given  point  set  in  IR'^;  finding  the  largest 
volume  ellipsoid  inscribed  into  the  intersection  of  given 
halfspaces  in  IR'^;  finding  the  distance  of  two  convex 
polyhedra  given  by  vertices,  resp.  by  facet  hyperplanes 
in  IR*^;  finding  a  line  transversal  for  given  convex  poly¬ 
gons  in  the  plane. 

If  the  dimension  is  fixed,  one  can  apply  our  general 
algorithm  to  the  respective  derived  problems  with  k  vi¬ 
olated  constraints,  provided  that  the  nondegeneracy  is¬ 
sue  can  be  handled.  In  many  of  the  problems,  simula¬ 
tion  of  simplicity  alone  should  suffice;  it  seems  that  lin¬ 
ear  programming  is  complicated  in  this  respect  because 
feasible  solutions  need  not  exist.  The  applicability  of 
dynamic  data  structures  to  speed  up  the  computations 
must  be  checked  individually.  In  general,  the  improve¬ 
ments  will  probably  be  significant  only  if  the  dimension 
is  really  small. 

4  Discussion 

It  would  be  interesting  to  find  more  about  nondegener¬ 
ate  refinements  of  LP-type  problems.  We  have  shown 
that  the  dimension  must  sometimes  grow  at  least  by 
1;  a  natural  question  is  how  large  growth  is  necessary 
and  sufficient  in  the  worst  case;  is  there  any  bound  onfy 
depending  on  the  dimension? 

From  a  practical  point  of  view,  it  would  be  very  de¬ 
sirable  to  have  some  more  direct  scheme  for  making 
the  geometric  LP-type  problems,  linear  programming  in 
particular,  nondegenerate  that  the  rather  cumbersome 
approach  via  geometric  perturbations  we  used.  Alter¬ 
natively  one  might  find  an  algorithm  which  can  handle 
nondegeneracy  directly. 

In  Theorem  2.2(ii),  we  saw  that  the  number  of  bases 
of  level  exactly  k  is  in  a  d-dimensional  LP- 

type  problem  with  no  — oo  values.  Two  questions  arise 
naturally:  First,  is  the  claim  still  true  if  we  allow  — oo 
values?  And  second,  can  one  use  this  result  algorith¬ 
mically  for  finding  the  smallest  basis  of  level  k,  that 
is,  can  one  efficiently  avoid  searching  all  bases  of  level 
at  most  k,  whose  number  may  be  of  the  order  k‘^1  In 
particular,  for  the  2-dimensional  infeasible  linear  pro¬ 
gramming,  we  are  actually  interested  only  in  2-element 
bases  (as  all  infeasible  bases  have  the  same  value  oo  in 


the  original  problem),  and  we  know  that  there  are  only 
0{k'^)  of  these.  Still,  the  current  method  may  search 
k^  bases,  most  of  them  infeasible  ones;  could  this  be 
avoided? 
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Abstract 

We  give  an  0{n  log  n)-time  method  for  finding  a  best  k- 
link  piecewise- linear  function  approximating  an  n-point 
planar  data  set  using  the  well-known  uniform  metric  to 
measure  the  error,  e  >  0,  of  the  approximation.  Our 
method  is  based  upon  new  characterizations  of  such 
functions,  which  we  exploit  to  design  an  efficient  al¬ 
gorithm  using  a  plane  sweep  in  “e  space”  followed  by 
several  applications  of  the  parametric  searching  tech¬ 
nique.  The  previous  best  running  time  for  this  problem 
was  0(v?). 


1  Introduction 


Given  a  set  5  =  {(a:i,  j/i),  (a:2,2/2),  •  •  • ,  (aJn, 2/n)},  the 
problem  of  approximating  5  by  a  function  is  classic 
in  applied  mathematics,  and  it  finds  applications  in  a 
number  of  computational  problems.  The  general  goals 
in  this  area  of  research  are  to  find  a  function  F  belong¬ 
ing  to  a  class  of  functions  F  such  that  each  F  ^  T  \s 
simple  to  describe,  represent,  and  compute  and  such 
that  the  chosen  F  approximates  S  well.  For  example, 
one  may  desire  that  F  be  the  class  of  linear  or  piecewise- 
linear  functions,  and,  for  any  particular  F  &  F,  that  the 
measure  of  the  error  be  the  well-known  uniform  metric: 

l|5-F||oo=  max  \y,-F{x,)\, 

which  is  also  known  as  the  /qo  or  Chebychev  measure  of 
error  [16,  18,  30].  The  goal,  then,  is  to  determine  the 
value  of 

e*  =  min||5-F||oo, 
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and  find  an  F  e  F  achieving  this  error  bound. 

The  version  of  this  problem  we  address  in  this  pa¬ 
per  is  to  find  a  function  F  E  F  that  minimizes  the 
uniform  error  term  with  respect  to  5  where  F  is  the 
class  of  Glink  piecewise- linear  functions,  for  some  given 
k  e  {1, 2, . . . ,  n  -  1}.  Using  terminology  from  the  ap¬ 
proximation  theory  literature  (e.g.,  see  [9,  16,  18,  19]), 
this  is  equivalent  to  the  problem  of  finding  a  best  (fc-bl)- 
knot  degree- 1  spline  approximating  S  under  the  foo 
norm.  Of  course,  the  case  fc  =  n  -  1  is  trivial,  and 
there  is  a  simple  reduction  of  the  case  =  1  to  3- 
dimensiona!  linear  programming,  which  can  be  solved 
in  0(n)  time  [12,  20,  40,  42,  43,  53].  Thus,  the  inter¬ 
esting  cases  are  for  1  <  /c  <  n  —  1.  We  show  how  to 
solve  this  problem  for  any  such  k  in  O(nlogn)  time. 

The  motivation  for  this  problem  is  that  one  may 
have  limited  resources  with  which  to  describe  the  set  S, 
but  one  wishes  the  best  approximation  possible  within 
the  given  resource  bounds.  This  can  also  be  viewed  as 
a  data  compression  problem. 

1.1  Previous  work 

The  problem  we  address  is  a  special  case  of  a  whole  class 
of  problems  in  approximation  theory  where  one  wishes 
to  fit  a  set  of  data  using  a  spline  function  under  some 
metric.  Thus,  the  interested  reader  is  referred  to  texts 
discussing  approximation  theory,  such  as  those  by  Bell¬ 
man  and  Roth  [9],  Conte  and  de  Boor  [16],  Davis  [18], 
and  Dierckx  [19],  for  a  general  treatment  of  such  prob¬ 
lems.  Research  in  this  literature  is  primarily  interested 
in  minimizing  the  number  of  knots  in  a  spline  under  the 
the  least  squares  metric,  e.g.,  Jupp  [36]  gives  a  numer¬ 
ical  approach  to  this  problem.  For  the  specific  prob¬ 
lem  we  address  here.  Bellman  and  Roth  [8]  describe 
a  dynamic-programming  approach  based  upon  using  a 
uniform  grid  to  determine  possible  placements  of  link 
endpoints  (which  they  call  knots).  Their  method  is  not 
guaranteed  to  find  a  best  /s-link  approximation,  how¬ 
ever. 

Hakimi  and  Schmeichel  [30]  show  that  such  a  best 
approximation  can  be  found  in  O(n^logn)  time,  and 
this  is  the  first  method  we  know  of  that  is  guaranteed 
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to  find  a  best  approximation.  Their  algorithm  is  based 
upon  a  clever  lemma  that  shows  that  one  can  limit  the 
number  of  “critical”  e  values  that  are  candidates  for 
e*  to  be  0{'n?).  They  also  show  that  one  can  test  if 
any  such  e  value  is  equal  to  e*  in  0{n)  time,  which 
implies  that,  once  enumerated  and  sorted,  one  can  per¬ 
form  a  “binary  search”  among  these  critical  values  to 
find  e*.  Of  course,  enumerating  these  critical  e’s  re¬ 
quires  n(n2)  time.  Indeed,  a  straightforward  applica¬ 
tion  of  the  lemma  by  Hakimi  and  Schmeichel  would 
require  0(n^)  time  to  enumerate  them.  They  reduce 
the  time  to  0(n^  logn)  using  the  powerful  plane  sweep¬ 
ing  technique  (e.g.,  see  [51]),  which  involves  “sweeping” 
the  plane  with  a  line  L  while  maintaining  appropriate 
data  structures  for  the  points  L  encounters  along  the 
way.  More  recently,  Wang  et  al.  [58]  show  how  to  use 
an  even  more  clever  plane  sweep  procedure  to  find  a 
best  A:-link  approximation  under  the  uniform  metric  in 
O(n^)  time. 

1.2  Related  work 

With  the  exception  of  the  papers  by  Hakimi  and  Schme¬ 
ichel  [30]  and  Wang  et  al.  [58],  related  work  in  the 
computational  geometry  literature  has  been  directed 
at  what  can  be  viewed  as  the  “inverse  problem,”  which 
is  also  addressed  in  the  paper  by  Hakimi  and  Schme¬ 
ichel  [30].  In  this  problem  one  is  given  an  error  mea¬ 
sure  e  >  0  and  asked  to  find  a  minimum-link  polygo¬ 
nal  path  (which  may  or  may  not  be  required  to  be  a 
function)  that  has  distance  at  most  e  from  all  the  ob¬ 
jects  in  S  (which  need  not  just  be  points),  under  some 
reasonable  distance  metric.  As  mentioned  earlier,  for 
the  case  when  S  is  a  set  of  points  and  the  error  mea¬ 
sure  is  the  uniform  metric,  then  Hakimi  and  Schme¬ 
ichel  show  that  this  problem  can  be  solved  in  0(n) 
time.  Their  method  can  be  viewed  as  an  extension 
of  the  linear-time  method  of  Suri  [54] ,  which  computes 
a  minimum-link  path  inside  a  simple  polygon,  to  the 
problem  of  finding  a  minimum-link  monotone  polygonal 
chain  that  “stabs”  a  given  set  of  line  segments.  Hersh¬ 
berger  and  Snoeyink  [32]  show  how  to  further  generalize 
this  method  to  find  in  0{n)  time  a  minimum- link  path 
of  a  particular  homotopy  type  in  a  non-simple  polygon, 
and  Guibas,  Hershberger,  Mitchell,  and  Snoeyink  [29] 
show  how  to  generalize  this  method  even  further  to 
find  in  0(n)  time  a  minimum-link  stabber  for  any  given 
set  of  disjoint  convex  objects  that  must  be  stabbed  in 
some  given  order  (not  necessarily  just  by  increasing 
^-coordinates) .  Robert  and  Toussaint  [52]  study  the 
problem  of  finding  a  line  L  that  minimizes  a  weighted 
minmax  error  measure  to  a  set  of  convex  polygons  in 
O(n^logn)  time. 

There  has  also  been  a  considerable  amount  of  work 
on  finding  a  minimum-link  approximation  to  a  polygo¬ 


nal  curve,  subject  to  some  error  tolerance.  The  prob¬ 
lem  of  fitting  a  minimum-link  convex  polygon  nested 
between  two  given  polygons  was  studied  by  Aggarwal 
et  al.  [5],  who  give  an  O(nlogn)  time  solution  to  this 
problem.  In  addition,  Imai  and  Iri  [34,  35]  give  an  0{n) 
time  method  for  finding  the  minimum-link  function  ap¬ 
proximating  a  given  monotone  chain.  Their  method 
is  very  similar  to  an  0{n)  time  method  independently 
discovered  by  Suri  [54,  55]  for  solving  the  more-general 
problem  of  finding  a  minimum  link  path  joining  two 
points  inside  a  simple  polygon.  There  has  also  been 
some  work  on  approximations  that  are  required  to  use 
a  subset  of  the  endpoints  of  the  given  polygonal  chain. 
For  example,  using  an  approach  of  Imai  and  Iri  [33,  35], 
Toussaint  [57]  and  Melkman  and  O’Rourke  [44]  give 
several  O(n^logn)  time  methods  under  various  met¬ 
rics. 

There  hasn’t  been  much  work  on  three-dimensional 
version  of  these  approximation  problems  with  guaran¬ 
teed  performance  bounds,  however,  although  the  recent 
work  by  Mitchell  and  Suri  [49]  on  a  special  case  of  the 
3-dimensional  function  approximation  problem  is  a  no¬ 
table  exception. 

There  is  also  a  rich  literature  that  studies  minimum- 
link  distance  as  a  metric  in  its  own  right  (e.g.,  see  [6, 
23,  24,  37,  39,  47,  48,  54,  55]). 

1.3  Our  results 

As  mentioned  above,  we  give  an  O(nlogn)  time  algo¬ 
rithm  for  finding  a  best  fc-link  piecewise-linear  function 
approximating  a  set  5  of  n  points  in  the  plane  under 
the  uniform  metric.  Our  method  is  based  upon  new 
geometric  insights  that  allow  us  to  apply  a  novel  plane 
sweep  in  “e  space”  to  enumerate  a  set  of  0(n)  criti¬ 
cal  e  values,  which  we  then  search  in  a  binary  search 
fashion.  This  allows  us  to  restrict  the  range  of  e  values 
containing  e*  to  be  an  interval  [£1,62],  but  it  does  not 
necessarily  give  us  ei  =  £2  =  £*■  To  achieve  this  latter 
result  we  give  additional  geometric  characterizations  of 
a  best  ifc-link  approximation  that  allow  us  to  follow  this 
preprocessing  step  by  several  applications  of  pipelined 
versions  of  the  well-known  parametric  searching  tech¬ 
nique  (e.g.,  see  [2,  3,  4,  11,  13,  14,  15,  41]). 

Admittedly,  the  use  of  this  technique  typically 
makes  an  algorithm  rather  impractical  to  implement. 
But  we  show  that  this  is  not  true  in  our  case,  for  we 
can  design  a  relatively  simple  version  of  our  algorithm 
that  uses  only  the  most  simple  versions  of  parametric 
searching  (which  can  be  made  even  more  practically 
efficient  via  randomization) . 

In  the  section  that  follows  we  give  some  properties 
of  a  best  fc-link  approximation  and  in  Section  3  we  show 
how  to  exploit  these  properties  to  restrict  the  range  of 
candidate  e  values.  We  show  how  to  then  complete  the 


323 


•  •  e  « 


o 


T 

2e 

i 


Figure  1:  An  example  set  S  such  that  5(e*)  has 
a  1-link  ordered  stabber,  but  any  ordered  slabber 
of  S{e)  requires  10  links  if  e  <  e*. 


construction  in  Section  4  by  relying  on  additional  geo¬ 
metric  properties  of  a  best  A;-link  approximation,  which 
we  show  can  be  exploited  in  a  series  of  applications  of 
parametric  searching.  Finally,  we  show  how  to  simplify 
our  implementation  in  Section  5. 


2  Some  Properties  of  a  Best  k- 
Link  Approximation 

Let  S  =  ipi,P2,  ■  ■  ■ , Pn)  be  a left-to-right  ordered  listing 
of  the  points  in  5  and  let  e  >  0  be  given.  So  as  to  for¬ 
mally  define  our  approximation  problem  and  articulate 
some  of  its  important  properties,  let  us  introduce  a  bit 
of  additional  notation.  For  each  point  p^  =  {xi,yi)  in 
S  define  rq  =  {xi,yi  +  e)  and  gi  =  {xi,xji  -  e),  and 
let  5(e)  denote  the  ordered  set  of  vertical  segments 
{uigi,U2g2,  ■  •  ■ , Ungn)-  Thus,  if  we  view  points  as  de¬ 
generate  segments,  then  5  =  5(0).  For  any  ordered  set 
of  disjoint  geometric  objects  A  a  polygonal  chain  C  is 
an  ordered  stabber  if  a  traversal  of  C  intersects  the  ob¬ 
jects  of  A  in  the  given  order  [29].  Finally,  define  F(e) 
to  be  a  minimum-link  ordered  stabber  of  5(e). 

The  formal  problem  we  address  in  this  paper,  then, 
is  to  find  e*,  the  smallest  e  >  0  such  that  F{e)  has 
at  most  k  links.  Formulating  the  problem  in  this  way 
allows  us  to  deal  with  “degenerate”  inputs,  such  as  the 
one  illustrated  in  Figure  1,  where  5(e*)  may  allow  an 
ordered  stabber  with  k'  <  k  links,  but  a  minimum-link 
stabber  of  5(e)  may  require  k  >  k  links  for  any  i  <  e* . 
Thus,  a  best  fc-link  approximation  F  =  F{e*)  may,  in 
fact,  have  fewer  than  k  links  because  of  degeneracies. 
Of  course,  one  can  always  introduce  “dummy”  vertices 
along  F  to  force  its  link-count  to  be  exactly  k  in  such 
a  case. 

2.1  A  Canonical  Form  for  Best  A:-Link 
Approximations 

Let  us  connect  consecutive  g-^'s  and  UiS  so  as  to  form 
two  “parallel”  monotone  chains  U{e)  and  G(e),  with 
U{e)  being  the  upper  chain,  i.e.,  let  us  create  edges 
giQi+i  defining  G(e)  and  uFFM+i  defining  U{e)  for 
i  G  {1,  2, . . . ,  n  —  1}.  One  might  be  tempted  to  think 
that  a  best  k-link  function  F  approximating  5  can  be 


Figure  2:  An  instance  where  a  minimum-link 
stabber  of  5(e)  is  not  confined  to  lie  between  17(e) 
and  G(e). 


constrained  to  lie  between  U{e*)  and  G(e*),  but  this 
is  not  the  case^  (as  shown  in  Figure  2).  This  is  ac¬ 
tually  a  good  thing,  for  otherwise  we  would  run  into 
some  robustness  difficulties,  for  we  would  have  to  use 
a  method  for  finding  a  minimum-link  path  in  a  simple 
polygon  as  a  subroutine,  and,  as  Snoeyink  observers^, 
the  bit  complexity  for  finding  such  a  path  can  be  sign- 
ficantly  larger  than  the  bit  complexity  for  representing 
the  vertices  of  the  input  polygon.  This  is  no  problem 
for  our  method,  however,  for  we  will  be  using  methods 
for  finding  minimum-link  stabbers  as  subroutines  in  our 
algorithm,  and  these  methods  do  not  suffer  from  this 
bit  complexity  blow-up  difficulty. 

To  describe  why  we  can  use  minimum-link  stabber 
methods  as  subroutines,  we  must  show  how  to  restrict 
F  to  a  certain  canonical  form.  For  a  given  e,  let  Tiu{e) 
be  the  shortest  path  from  u\  to  that  does  not  go 
above  17(e)  and  does  not  cross  G(e).  Similarly,  let  7rg(e) 
be  the  shortest  path  from  gi  to  that  does  not  go 
below  G(e)  and  does  not  cross  17(e).  Such  paths  were 
introduced  by  Lee  and  Preparata  [38]  and  are  often 
referred  to  as  geodesic  paths  [1,  7,  10,  25,  28,  45,  46, 
56],  and  the  union  of  two  such  paths  is  often  called  an 
hourglass  [22,  26].  Let  us  therefore  use  H{e)  to  denote 
this  hourglass  7r„(e)U7rg(e).  We  say  that  an  edge  of  i7(e) 
is  an  inflection  edge  if  one  of  its  endpoints  lies  on  U{e) 
while  its  other  endpoint  lies  on  G(e).  Let  I{e)  denote 
the  set  of  all  such  inflection  edges.  (See  Figure  3.) 

We  say  that  two  consecutive  links  pq  and  gr  in  F 
have  a  zig  turn  type  if  r  is  above  the  ray  p^  (i.e.,  pq 
and  qr  form  a  “left  turn”).  Similarly,  two  consecutive 

^We  are  indebted  to  Jack  Snoeyink  (personal  communication) 
for  pointing  this  example  out  to  us. 

^Again,  by  a  personal  communication. 
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Figure  3;  An  example  hourglass.  The  inflection 
edges  are  shown  shadded. 


links  pq  and  gr  in  F'  a  zag  turn  type  if  r  is  below  the  ray 
p^.  This  allows  us  to  characterize  each  link  in  F,  other 
than  the  first  and  last  links,  by  the  turn  types  they 
form  with  their  predecessor  and  successor  links.  For 
example,  a  zig-zag  link  forms  a  left  turn  with  its  pre¬ 
decessor  and  a  right  turn  with  its  successor.  The  next 
lemma  establishes  an  important  relationship  between 
such  links  and  inflection  edges  in  I{t). 

Lemma  2.1:  There  is  a  best  k-link  function  F  approx¬ 
imating  S  such  that 

1.  each  e  G  I{e*)  is  contained  by  the  first  or  last  link 
of  F  or  by  a  zig-zag  or  zag-zig  link  of  F,  and 

2.  the  first  and  last  link  of  F,  as  well  as  each  zig-zag 
and  zag-zig  link  of  F,  contains  an  e  €  h(e*). 

Proof:  (1:)  Suppose  e  G  I(e*),  i.e.,  e  is  an  inflection 
edge  of  H{e*).  Also  suppose,  for  the  sake  of  contra¬ 
diction,  that  e  is  contained  in  no  link  of  the  appropri¬ 
ate  type  in  any  best  fc-link  approximation  F.  We  will 
follow  a  proof  technique  of  Ghosh  [23],  which  involves 
performing  local  perturbations  of  a  candidate  stabber, 
to  derive  a  contradiction.  Since  e  is  an  inflection  edge, 
it  connects  a  Ui  to  a  gj  \  hence,  F  must  intersect  e  along 
some  link  A,  for  F  cannot  go  above  any  Ui  nor  below 
any  .  Let  us  assume  for  the  time  being  that  A  is  nei¬ 
ther  the  first  nor  last  link  of  F,  and  let  A“  denote  A’s 
predecessor  and  let  A+  denote  A’s  successor.  Also,  let 
L(e)  denote  the  line  containing  e  and  let  B  denote  the 
set  of  all  points  on  some  segment  from  a  point  p  on  A~ , 
A,  or  A+  to  p's  nearest  neighbor  on  L{e),  i.e.,  the  points 
“between”  L(e)  and  A",  A,  and  A+.  B  can  contain  no 
points  of  U{e*)  nor  G(e*),  for  if  this  were  not  the  case, 
then  e  would  not  be  an  inflection  edge  (e.g.,  see  Fig¬ 
ure  4a).  Thus,  we  can  “move”  the  common  endpoint 
of  A“  and  A  to  be  on  L(e),  and  the  common  endpoint 
of  A  and  A+  to  be  on  F(e),  keeping  the  rest  of  F  fixed, 
and  we  will  keep  F  as  a  fc-link  approximation  to  S.  To 


Figure  4:  Example  zig-zag  edges.  In  (a)  we  il¬ 
lustrate  why  an  inflection  edge  is  contained  in  a 
zig-zag  link,  and  in  (b)  we  illustrate  why  a  zig-zag 
link  contains  an  inflection  edge. 
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establish  that  A  must  be  a  zig-zag  or  zag-zig  link,  note 
that  the  first  place  where  the  ray  UiQj  crosses  G{e*) 
cannot  be  before  the  first  place  it  crosses  U{e*),  and, 
likewise,  the  first  place  where  the  ray  gjv!i  crosses  U{(*) 
cannot  be  before  the  first  place  it  crosses  G(e*)  (e.g., 
see  Figure  4a).  If  this  were  not  so,  uig]  would  not  be 
an  inflection  edge  of  H{e*).  Thus,  either  A  is  the  first 
or  last  link  of  F  (which  occurs  if  one  of  the  rays  mg]  or 
crosses  neither  U (e*)  nor  G{e*))  or  A  is  a  zig-zag  or 
zag-zig  link  (since  F  is  a  minimum-link  approximation 
to  S(e*)).  Similar  (actually  simpler)  arguments  hold 
for  the  cases  when  A“  or  A"*”  do  not  exist,  and  are  left 
to  the  reader.  Therefore,  there  is  a  best  /c-link  approx¬ 
imation  to  S  that  contains  each  edge  in  S,  and  each 
such  edge  is  contained  in  the  first  or  last  link  of  F  or 
in  a  zig-zag  or  zag-zig  link. 

(2:)  For  the  second  part  of  the  lemma,  let  m  be 
the  minimum  number  of  zig-zag,  zag-zig,  first,  and  last 
links  that  do  not  contain  any  edge  in  /(e*),  taken  over 
all  best  A:-link  approximations  to  S  satisfying  part  one 
of  the  lemma  (which  we  have  just  shown  to  be  true). 
In  addition,  let  A  =  ^  be  one  of  these  m  links.  Let 
us  assume  for  the  time  being  that  A  is  zig-zag  link  in 
F.  Let  X~  =  fp  denote  the  predecessor  of  A  in  F  and 
let  A+  =  qs  denote  the  successor  of  A  in  F.  Since 
F  is  a  minimum-link  path,  the  line  rs  must  intersect 
both  {/(e*)  and  G(e*)(e.g.,  see  Figure  4b).  But  this 
implies  that  F  crosses  an  inflection  edge,  which  is  a 
contradiction;  hence,  we  establish  the  second  part  of 
the  lemma  for  this  case.  The  proofs  for  the  other  cases 
are  similar;  hence,  this  establishes  the  lemma.  □ 

Having  established  an  important  property  of  some 
of  the  links  in  a  best  /c-link  function  approximation  to 
S,  let  us  now  turn  to  the  problem  of  enumerating  these 
edges,  and  in  the  process  we  will  also  restrict  the  range 
of  e’s  that  allow  a  /c-link  approximation. 

3  Finding  the  Inflection  Edges 

We  say  that  an  e  is  geodesic- critical  if  H{e)  has  I  edges, 
but  H{i)  has  fewer  than  I  edges  for  e  >  e.  Our  method 
for  finding  all  the  inflection  edges  is  to  determine  an 
interval  [ei,e2]  that  contains  e*  and  is  such  that  H{ti) 
is  combinatorially  equivalent  to  F(e2).  This  will  allow 
us  to  determine  all  inflection  edges  that  F  must  con¬ 
tain.  Our  procedure  is  conceptually  quite  simple.  First 
we  enumerate  all  0(n)  geodesic-critical  e  values,  and 
then  we  perform  a  binary  search  among  these  values  to 
determine  the  interval  [ei,e2]  containing  e*. 


3.1  Enumerating  all  geodesic-critical  e 
values 

Our  method  for  enumerating  all  geodesic-critical  e  val¬ 
ues  is  based  upon  a  sweep  through  “e  space.”  We  main¬ 
tain  the  hourglass  H{e)  =  7r„(e)  U  TTg{e)  while  taking  e 
from  0  to  -boo,  stopping  at  each  geodesic-critical  e  value 
along  the  way.  To  simplify  the  discussion,  however,  let 
us  concentrate  on  the  problem  of  maintaining  7r„(e), 
so  that  we  restrict  our  notion  of  geodesic-critical  e’s  to 
those  that  change  7r„(e);  the  method  for  maintaining 
Tigie)  is  similar.  Initially,  for  e  =  0,  7r„(e)  is  the  chain 
U(e)  =  G(e);  hence,  it  consists  of  n  —  1  edges.  If  we 
then  increase  e  by  an  “infinitesimal”  amount  we  find 
that  some  of  the  vertices  of  7r„(e)  lie  on  {7(e)  while  oth¬ 
ers  lie  on  G(e).  For  any  vertex  p  on  7r„(e),  if  one  of 
p’s  adjacent  vertices  on  7r„(e)  lies  on  the  chain  oppo¬ 
site  from  the  chain  that  p  lies  on,  then  we  say  that  p 
is  a  pinch  vertex.  For  each  pinch  vertex  p  on  7r„(e), 
let  q  and  r  denote  p’s  adjacent  vertices  on  TTuie),  and 
compute  the  e  >  e  value  at  which  p  would  cease  to  be  a 
pinch  vertex  if  we  were  to  restrict  U (e)  and  G(e)  to  that 
portion  of  the  plane  bounded  by  the  lines  x  =  x{q)  and 
X  —  x{r)  inclusive,  where  x{t)  denotes  the  a;-coordinate 
of  a  point  t.  Call  this  e  value  locally- critical  for  p,  and 
let  E  be  the  set  of  all  e’s  that  are  locally-critical  for 
pinch  vertices  on  7r„(e). 

Lemma  3.1:  The  smallest  e  in  E  is  the  smallest 
geodesically-critical  value  bigger  than  e. 

Proof:  Let  e'  be  the  smallest  geodesically-critical  value 
bigger  thane.  If  we  were  to  increase  e'  “infinitesimally,” 
then,  by  definition,  7r„(e')  would  have  at  least  one  fewer 
edge.  For  this  to  occur,  two  consecutive  edges  of  nu{e') 
would  have  to  be  replaced  by  a  single  edge.  Thus,  the 
vertex  incident  upon  the  two  removed  edges  is  a  pinch 
vertex;  hence  e'  is  in  E.  Now,  let  e"  be  the  smallest 
value  in  E.  Since  there  is  no  value  in  E  smaller  than  e" 
in  F,  7r„(e)  does  not  change  for  e  G  (e,e"].  Thus,  there 
is  a  global  change  to  7ru(e)  for  e  >  e",  which  implies  that 
e"  is  geodesically-critical.  Therefore,  e'  =  e",  which 
completes  the  proof.  □ 

Our  method  for  maintaining  7r^;(e),  then,  is  as  fol¬ 
lows.  We  store  the  values  belonging  to  F  in  a  priority 
queue  that  supports  the  operations  of  insert,  delete, 
and  extract-min  in  O(logn)  time  (e.g.,  see  [17]).  While 
F  is  not  empty,  we  extract  the  smallest  e  in  F,  perform 
the  modification  of  7r„(e)  implied  by  this  geodesically- 
critical  value,  and  then  update  F  to  reflect  the  new 
geodesic  path.  This  update  involves  examining  the  two 
vertices  of  7r„(e)  that  now  become  adjacent  and  up¬ 
dating  F  accordingly.  If  either  of  them  were  previ¬ 
ously  pinch  vertices,  then  we  remove  its  corresponding 
locally-critical  value  from  F.  Likewise,  if  either  of  them 
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becomes  a  pinch  vertex  after  performing  the  update 
for  e,  then  we  insert  its  new  locally-critical  value  into 
E.  Since  we  reduce  by  one  the  number  of  edges  of  the 
geodesic  path  with  each  event  in  E,  the  total  number 
of  events  must  be  0{n)\  hence,  the  total  time  to  enu¬ 
merate  all  the  geodesically-critical  values  is  O(nlogn). 

Given  these  geodesically-critical  values  it  is  then  a 
simple  manner  to  determine  the  consecutive  pair  that 
contains  e*  by  using  the  method  of  Hershberger  and 
Snoeyink  [32],  Guibas  et  al.  [29],  or  Hakimi  and  Schme- 
ichel  [30]  to  drive  a  binary  search  among  the  set  of 
geodesically-critical  values.  Using  one  of  these  simple 
algorithms,  all  of  which  are  based  upon  the  greedy 
method,”  to  test  if  a  given  e  is  smaller  than  e*  takes 
0(n)  time.  This  implies  that  we  can  determine,  in 
O(nlogn)  time,  the  combinatorial  structure  of  7r„(e*), 
and,  as  mentioned  above,  a  similar  procedure  gives  us 
the  combinatorial  structure  of  7rg(e*).  Therefore,  we 
can  identify  all  the  inflection  edges  in  H{e  ),  each  of 
which  F  must  contain,  by  Lemma  2.1. 

All  that  is  left,  then,  is  for  us  to  determine  all  the 
links  of  F  that  do  not  contain  inflection  edges. 

4  Completing  the  Construction 

Whereas  we  used  geodesic  paths  to  find  the  zig-zag  and 
zag-zig  links,  to  complete  the  construction  we  use  a  re¬ 
lated  structure — the  visibility  graph.  In  particular,  re¬ 
call  that  the  visibility  graph  of  a  set  of  line  segments  R 
has  a  vertex  for  each  endpoint  of  a  segment  in  R  and  an 
edge  for  each  pair  (p,  q)  such  that  the  line  segment  pq 
does  cross  any  segment  in  R,  although  we  allow  pq  to 
intersect  segment  endpoints  and  even  contain  the  seg¬ 
ment  pq  if  it  is  in  R.  It  is  well-known,  for  example,  that 
geodesic  paths  always  follow  visibility  graph  edges  .  In 
our  case  we  are  interested  in  the  visibility  graph  defined 
on  the  segments  in  {7(e)uG(e).  We  say  that  an  line  seg¬ 
ment  s  is  U -anchored  (resp.,  G -anchored)  if  s  contains 
an  edge  e  in  the  visibility  graph  of  U{e*)L>  G{e*)  such 
that  both  of  e’s  vertices  lie  on  [/(e*)  (resp.,  G(e*)).  The 
following  lemma  establishes  an  important  relationship 
between  a  best  fc-link  function  approximation  to  S  and 
these  anchored  links. 

Lemma  4.1:  Any  canonical  best  k-link  approximation 
to  S  has  a  U -anchored  link  containing  a  vertex  of  G(e*) 
or  a  G-anchored  link  containing  a  vertex  ofU{e*). 

Proof:  The  proof  follows  immediately  from  the  char¬ 
acterization  lemma  of  Hakimi  and  Schmeichel  [30].  □ 

Our  method  for  enumerating  all  such  edges  is  based 
upon  several  applications  of  the  parametric  searching 

^For  more  information  about  visibility  graphs  and  their  prop¬ 
erties  see  the  excellent  book  by  O’Rourke  [50]. 


technique,  as  optimized  by  Cole  [13,  14].  One  such  op¬ 
timization  applies  to  any  situation  that  involves  a  set 
Z  =  {zi,Z2,---,Zm}  oim  independent  binary  searches 
among  an  ordered  set  A  =  (ai ,  02, . . . ,  Un)  of  n  items, 
where  each  comparison  c{ai,Zj)  is  parameterized  by  e. 
The  outcome  of  c{ai,Zj)  depends  upon  which  of  a  con¬ 
stant  number  of  intervals,  determined  by  Oi  and  Zj, 
contain  e* .  If  it  takes  T  steps  to  determine  if  e*  <  e, 
for  a  particular  e,  then  this  parametric  searching  tech¬ 
nique  allows  us  to  perform  all  m  binary  searches  in 
0((T  -b  m)(logn  4-  logm))  time  (see  Cole  [13]  for  de¬ 
tails).  It  also  gives  us  an  interval  [ei,e2]  containing  e* , 
which  is  the  intersection  of  all  the  intervals  determined 
to  contain  e*  during  the  m  searches. 

The  second  optimization  applies  when  one  wishes 
to  sort  a  set  A  =  {oi,  02, . . . ,  fln})  where,  as  in  the 
previous  case,  each  comparison  c(ai,aj)  is  parameter¬ 
ized  by  e  so  that  the  outcome  of  c{ai,aj)  depends  upon 
which  of  a  constant  number  of  intervals,  determined 
by  ai  and  a^,  contain  e*.  If  it  takes  T  steps  to  deter¬ 
mine  if  e*  <  e,  for  a  particular  e,  then  this  parametric 
searching  technique  allows  us  to  sort  the  elements  of  A 
in  0((T -t- n)  log n)  time  (see  Cole  [13,  14]  for  details). 
Also,  this  method  gives  an  interval  [£1,62]  containing  e*, 
which  is  the  intersection  of  all  the  intervals  determined 
to  contain  e*  during  the  sort. 

The  challenge,  of  course,  in  applying  these  tech¬ 
niques  is  to  design  the  parameterized  sorts  and  searches 
so  that  the  result  is  meaningful.  Let  us  therefore  turn 
to  our  application  of  these  techniques. 

4.1  Finding  e* 

Our  method  for  completing  the  construction  is  to 
perform  a  parametric  search  for  a  [/-anchored  or  G- 
anchored  link  satisfying  Lemma  4.1.  Observe  that  find¬ 
ing  such  a  link  effectively  “clamps”  [/(e)  and  G(e)  at 
e  =  e*.  Also  note  that  we  can  use  the  linear-time 
method  of  Hershberger  and  Snoeyink  [32],  Guibas  et 
al.  [29],  or  Hakimi  and  Schmeichel  [30]  to  resolve  com¬ 
parisons  and  to  give  us  the  final  approximation  F  once 
we  have  narrowed  the  interval  of  candidate  e  values  to 

To  simplify  the  discussion  let  us  concentrate  on  the 
problem  of  determining  a  [/-anchored  link  that  contains 
a  vertex  of  G(e*)|  the  method  is  similar  for  visibility 
edges  anchored  on  G(e).  Our  algorithm  will  actually 
“dovetail”  the  search  for  a  [/-anchored  link  containing 
a  vertex  of  G(e*)  with  the  search  for  a  G-anchored  link 
containing  a  vertex  of  U{e*). 

Call  a  vertex  p  of  U{e)  a  left  inflection  (resp.,  right 
inflection)  vertex  if  p  is  the  left  (resp.,  right)  endpoint 
of  an  inflection  edge  of  H{e*).  Following  an  approach 
similar  to  that  used  by  Ghosh  [23],  consider  a  portion  of 
U{e)  between  a  left  inflection  vertex  p  and  the  leftmost 
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Figure  5:  A  J7-anchored  edge  determining  e*. 


right  inflection  vertex  q  to  the  right  of  p.  Denote  this 
portion  of  U{t)  as  Note  that  the  portion  of 

TTu{e*)  between  p  and  g  is  a  convex  chain  of  edges  such 
that  each  consecutive  pair  forms  a  “right  turn.”  De¬ 
note  this  portion  of  7ru(e*)  as  7r[p,  g]  (but  note  that  we 
have  yet  to  determine  e* — at  this  point  we  only  know 
the  combinatorial  structure  of  nu{e*)).  Finally,  observe 
that  if  17[p^,](e)  contains  the  endpoints  v  and  z  of  the 
17-anchored  link  that  we  seek,  then  vz  must  be  equal 
to  the  the  common  tangent  of  v  and  7r[p,  g]  as  well  as 
z  and  7r[p,  g],  (See  Figure  5.)  Moreover,  if  the  link  sat¬ 
isfying  Lemma  4.1  is  a  17-anchored  link,  then  it  must 
contain  such  a  tangent  edge  for  some  17[p_,](e). 

Our  first  parametric  search  therefore  is  to  determine 
for  each  vertex  v  on  17[p^,](€)  its  vertex  of  tangency,  t{v), 
with  the  convex  chain  7r[p,  g]  at  e  =  e*  (see  Figure  5). 
In  this  case  we  can  use  binary-search-based  parametric 
searching  [13]  applied  to  a  well-known  “binary  search” 
tangent-finding  method  (e.g.,  see  [21,  51])  to  find  all 
such  tangents  in  O(nlogn)  time.  This  may  still  not 
restrict  our  interval  of  e  values  to  [e*,e*],  however,  for 
a  vertex  v  will  have  the  same  vertex  of  tangency,  t{v), 
over  a  range  of  e  values. 

For  each  vertex  w  on  7r[p,  g],  collect  each  vertex  v 
on  U^p,q]{e)  such  that  w  =  t{v)  into  two  sets — l(w), 
containing  the  vertices  to  the  left  of  w,  and  r{w),  con¬ 
taining  the  vertices  to  the  right  of  re.  If  w  is  the  vertex 
of  tangency  for  the  U -anchored  edge  we  seek,  then  that 
edge  is  determined  by  a  u  e  l{w)  and  a  z  £  r{w). 
In  order  for  these  two  vertices  to  be  able  “search  for 
each  other,”  however,  we  must  first  order  the  vertices 
in  l{w)  and  r{w)  radially  around  w.  To  accomplish  this 
we  make  a  second  application  of  parametric  searching, 
this  time  using  the  second  version,  based  upon  a  paral¬ 
lel  sorting  algorithm  [13,  14],  together  with  the  linear¬ 
time  method  of  Hershberger  and  Snoeyink  [32],  Guibas 
et  al.  [29],  or  Hakimi  and  Schmeichel  [30]  for  resolving 


comparisons,  to  sort  all  l{w)  and  r{w)  lists  in  O(nlogn) 
time.  This  may  still  not  restrict  the  range  of  e  values 
to  [e*,e*],  however,  for  the  vertices  in  an  l{w)  and  r{w) 
may  have  the  same  ordering  with  respect  to  w  over  a 
range  of  e  values. 

Therefore,  to  complete  the  process  we  must  perform 
one  more  application  of  parametric  searching  where  we 
perform  a  binary  search  in  l{w)  for  each  z  G  r{w)  to 

locate  the  vertex  in  l{w)  hit  by  the  ray  zt{z],  if  it  exists. 
Given  that  each  l[w)  is  now  sorted,  we  can  perform  all 
of  these  binary-search  parametric  searches  in  0{n  log  n) 
time.  Since  one  of  these  searches  (or  a  corresponding 
search  for  G- anchored  edges)  must  succeed,  we  finally 
are  guaranteed  to  have  restricted  the  range  of  candi¬ 
date  e  values  to  the  interval  [e*,e*].  Moreover,  seeing 
how  the  last  comparison  in  this  parametric  search  was 
resolved  using  a  minimum-link  ordered  stabber  algo¬ 
rithm,  this  completes  the  construction,  giving  us  the 
following  theorem. 

Theorem  4.2:  Given  a  set  S  of  n  points  in  the  plane, 
and  an  integer  parameter  1  <  k  <  n  -  1,  one  can  con¬ 
struct  a  best  k-link  approximation  to  S  under  the  uni¬ 
form  metric  in  0(n  log  n)  time. 

5  Simplifying  the  Implementa¬ 
tion 

Although  our  method  for  finding  a  best  fe-link  approx¬ 
imation  to  S  is  conceptually  simple  at  a  high  level,  im¬ 
plementing  it  in  practice  would  probably  not  be  so  easy. 
The  chief  difficulty  arises  from  the  (up  to)  three  calls 
to  parametric  searching  that  are  made  at  the  end  of 
the  algorithm;  all  the  other  steps  would  be  relatively 
straightforward  to  implement.  But  those  calls  to  para¬ 
metric  searching,  especially  the  version  based  upon  a 
parametric  sorting,  would  be  a  challenge  to  implement, 
and  the  constant  factors  they  would  imply  in  the  run¬ 
ning  time  would  not  be  very  small.  Fortunately,  as 
we  show  in  this  section,  we  can  considerably  simplify 
these  calls  to  parametric  searching.  In  particular,  we 
show  how  to  eliminate  the  (second)  call  based  upon 
sorting  all  together,  and  we  observe  how  one  can  sim¬ 
plify  all  our  calls  to  parametric  searching  through  a 
simple  randomization  technique.  The  resulting  algo¬ 
rithm  will  still  be  guaranteed  to  find  a  best  fc-link  ap¬ 
proximation  to  S,  however,  for  the  randomization  will 
only  impact  the  running  time  of  our  method,  not  its 
correctness.  Nevertheless,  it  will  result  in  a  method 
that  runs  in  O(nlogn)  time  with  very  high  probabil¬ 
ity.  Moreover,  the  (expected)  constant  factors  will  be 
reasonably  small. 

Let  us  begin  by  describing  how  we  can  eliminate 
the  call  to  sorting-based  parametric  searching,  which 
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Figure  6:  An  illustration  of  why  lvis{w)  is  already 
sorted. 


comprises  the  second  call  to  parametric  searching  in  our 
algorithm.  Recall  that  after  we  have  made  our  first  call 
to  (binary-search  based)  parametric  searching,  we  have 
determined  for  every  vertex  v  on  f/[p,,](€)  its  tangent, 
t{v)  on  7r|jp,  ^].  Previously,  we  then  collected,  for  each 
w  on  7r[p,  g],  all  the  vertices  on  f^[p,q](€)  whose  tangent 
is  w  to  the  “left  set”  l{w)  and  the  “right  set”  r{w),  and 
we  sorted  the  vertices  of  l{u))  radially  around  iv  (using 
sorting-based  parametric  searching)  so  that  we  can  then 
do  a  binary  search  in  l{w)  for  each  vertex  in  r(w)  (using 
binary-search  based  parametric  searching).  Observe, 
however,  that  we  do  not  need  all  the  vertices  of  l{w)  to 
perform  this  search;  we  need  only  those  vertices  that 
are  visible  from  w  on  U[p^q]{e*).  Thus,  let  us  use  Ivisiw) 
to  denote  the  set  of  vertices  in  l{w)  that  are  visible 
from  w  on  (j](e*).  If  we  can  determine  these  visible 
vertices,  then,  as  the  following  lemma  shows,  we  can 
sort  them  without  resorting  to  parametric  searching. 

Lemma  5.1:  Sorting  the  vertices  of  Zvis(a:)  r&dinlly 
around  w  as  they  appear  on  f^[p,g](^*)  the  same 

order  as  a  listing  of  the  vertices  of  Ivis  {w)  by  increasing 
X -coordinates. 

Proof:  Suppose,  for  the  sake  of  a  contradiction,  that 
there  are  two  vertices  u  and  z  in  Ivisiw)  such  that  u  is 
before  z  in  a  radial  listing  around  w  as  they  appear  on 
U[p,g]{e*),  but  x{z)  <  x{u).  (See  Figure  6.)  Since  w 
is  below  the  curve  17[p,,](e*),  this  implies  that  C/[p,,](e*) 
intersects  the  segment  Jw.  But  this  contradicts  the 
fact  that  2:  is  visible  from  w\  hence,  this  establishes  the 
lemma.  □ 

Thus,  it  is  sufficient  for  us  to  determine  the  mem¬ 
bers  of  lvis{w),  for  each  w  on  7r[p, g],  since  they  are 
already  given  by  increasing  x-coordinates  (for  that  is 
how  the  vertices  are  ordered  in  S). 


So,  let  us  turn  to  the  problem  of  determining  the 
members  of  Wis{w)  for  some  given  w  on  7r[p,g].  Let 
w'  denote  the  vertex  on  L’[p,g](e)  directly  above  w  (i.e., 
if  ra  =  gi,  then  w'  =  Ui).  A  simple  consequence  of 
the  proof  of  Lemma  5.1  is  that  a  vertex  v  in  l{w)  is 
in  Ivisiw)  if  and  only  if  the  geodesic  path  from  v  to 
w'  is  above  the  line  segment  vw.  Indeed,  it  is  suffi¬ 
cient  that  the  first  edge  e{v)  in  this  path  be  above  the 
segment  vw.  Since  each  such  geodesic  path  remains 
unchanged  for  e  £  [ei ,  £2]  (because  it  is  a  sequence  of 
left  turns  that  occur  at  vertices  of  U[p^q]{e)),  we  can 
determine  e{v)  for  each  such  v  without  knowing  the 
value  of  e* .  Specifically,  we  can  use  the  data  structure 
of  Guibas  and  Hershberger  [27],  as  simplified  by  Her¬ 
shberger  [31],  to  determine  each  e{v)  in  O(logn)  time. 
This  data  structure  is  relatively  simple  to  construct  and 
query,  especially  since  each  U[p^q]{e)  is  a  single  mono¬ 
tone  chain. 

Note  that  each  e{v)  determines  a  critical  iy  value 
such  that  V  is  visible  if  e  >  e,  and  v  is  not  visible  oth¬ 
erwise.  Thus,  once  we  have  determined  all  the  e(t))’s, 
we  can  then  determine  which  x’s  are  visible  from  their 
respective  w  =  t{v)  vertices  simply  by  resolving  all  of 
the  e’s  against  e*  using  the  method  of  Hershberger  and 
Snoeyink  [32],  Guibas  et  al.  [29],  or  Hakimi  and  Schme- 
ichel  [30]  to  drive  a  binary  search.  Since  there  are  0{n) 
such  parameterized  comparisons,  and  each  resolution 
requires  0(n)  time,  the  total  time  needed  to  resolve  all 
of  these  comparisons  is  0(n  log  n).  This  implies  that 
we  can  substitute  the  most  simple  form  of  paramet¬ 
ric  searching  for  the  more-complicated  sorting-based 
method  [13,  14].  The  resulting  algorithm  would  still 
run  in  0(n  log  n)  time.  The  only  slightly  impractical 
steps  in  this  algorithm  are  repeated  computations  of 
medians  (e.g.,  of  the  remaining  unresolved  e’s  in  each 
iteration  of  the  above  binary  search),  for  which  the  best 
deterministic  algorithm  has  a  relatively  large  constant 
(e.g.,  see  [17]). 

But  the  computation  of  a  median  or  weighted  me¬ 
dian  (which  is  the  least-efficient  step  in  the  binary- 
search  based  parametric  searching  method  of  Cole  [13]) 
can  be  significantly  improved  in  practice  through  the 
use  of  randomization  (e.g.,  see  [17]).  The  effect  in  our 
case  is  that  the  resulting  algorithm  will  still  be  guaran¬ 
teed  to  find  a  best  fc-link  approximation,  but  it  will  not 
be  guaranteed  to  run  in  0(n  log  n)  time.  Nevertheless, 
it  will  run  in  0(n  log  n)  time  with  very  high  probabil¬ 
ity,  and  the  (expected)  constants  factors  in  the  running 
time  will  be  very  reasonable.  Thus,  we  suggest  such 
a  use  of  randomization  in  any  implementation  of  our 
method. 
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6  Conclusion  and  Directions  for 
Future  Work 

We  have  given  an  0(n  log  n)-time  method  for  finding 
a  best  fc-link  approximation  to  set  of  n  points  in  the 
plane  under  the  uniform  metric,  and  we  have  even 
given  a  method  for  efficiently  implementing  it  in  prac¬ 
tice.  This  suggests  a  number  of  interesting  directions 
for  future  work.  Some  possibilities  include  examining 
metrics  other  than  the  uniform  metric,  approximating 
with  higher-degree  spline  functions,  and  approximat¬ 
ing  more-general  types  of  geometric  objects  (such  as 
segments  joined  in  a  polygonal  path).  As  with  our  re¬ 
sults,  the  general  goal  should  be  to  find  a  best  lb-piece 
approximation,  since  this  is  the  version  of  the  prob¬ 
lem  driven  by  bounds  placed  upon  the  computational 
resources  needed  to  represent  the  approximation. 
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Applications  of  Weighted  Voronoi  Diagrams  and  Randomization 

to  Variance-Based  /^-Clustering 

(Extended  Abstract) 

Mary  Inaba*,  Naoki  Katoh^  and  Hiroshi  Imai* 


Abstract 


In  this  paper  we  consider  the  fc-clustering  problem  for 
a  set  S  of  n  points  p,-  =  (x,  )  in  the  d-dimensional  space 
with  variance- based  errors  as  clustering  criteria,  moti¬ 
vated  from  the  color  quantization  problem  of  comput¬ 
ing  a  color  lookup  table  for  frame  buffer  display.  As 
the  inter-cluster  criterion  to  minimize,  the  sum  of  intra- 
cluster  errors  over  every  cluster  is  used,  and  as  the  intra- 
cluster  criterion  of  a  cluster  Sj, 

Pi€Sj 

is  considered,  where  ||  •  ||  is  the  L2  norm  and  x{Sj)  is 
the  centroid  of  points  in  Sj,  i.e.,  (l/|5j|)  Sp.-es  ^he 
cases  of  a  =  1,2  correspond  to  the  sum  of  squared  errors 
and  the  all-pairs  sum  of  squared  errors,  respectively. 

The  A:-clustering  problem  under  the  criterion  with 
a  =  1,2  are  treated  in  a  unified  manner  by  charac¬ 
terizing  the  optimum  solution  to  the  /^-clustering  prob¬ 
lem  by  the  ordinary  Euclidean  Voronoi  diagram  and  the 
weighted  Voronoi  diagram  with  both  multiplicative  and 
additive  weights.  With  this  framework,  the  problem 
is  related  to  the  generalized  primary  shutter  function 
for  the  Voronoi  diagrams.  The  primary  shutter  func¬ 
tion  is  shown  to  be  which  implies  that,  for 

fixed  k,  this  clustering  problem  can  be  solved  in  a  poly¬ 
nomial  time.  For  the  problem  with  the  most  typical 
intra-cluster  criterion  of  the  sum  of  squared  errors,  we 
also  present  an  efficient  randomized  algorithm  which, 
roughly  speaking,  finds  an  e-approximate  2-clustering 
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in  0{n{l/eY)  time,  which  is  quite  practical  and  may 
be  used  to  real  large-scale  problems  such  as  the  color 
quantization  problem. 

1  Introduction 

Clustering  is  the  grouping  of  similar  objects  and  a  clus¬ 
tering  of  a  set  is  a  partition  of  its  elements  that  is  cho¬ 
sen  to  minimize  some  measure  of  dissimilarity.  It  is  very 
fundamental  and  used  in  various  fields  in  computer  sci¬ 
ence  such  as  pattern  recognition,  learning  theory,  im¬ 
age  processing  and  computer  graphics.  There  are  vari¬ 
ous  kinds  of  measure  of  dissimilarity,  called  criteria,  in 
compliance  with  the  problem. 

Definition  of  the  ^-clustering  problem:  The  gen¬ 
eral  /s-clustering  problem  can  be  defined  as  follows.  A 
fc-clustering  is  a  partition  of  the  given  set  5  of  n  points 
Pi  =  (iCi)  {i  =  1, . . .  ,n)  in  the  d-dimensional  space  into 
k  disjoint  nonempty  subsets  5i, . . . ,  5*,,  called  clusters. 
A  /c-clustering  is  measured  by  the  following  two  criteria. 
(Intra-cluster  criterion)  For  each  cluster  Sj,  the 
measure  (or  error)  Intra(5'j)  of  Sj,  representing  how 
good  the  cluster  Sj  is,  is  defined  appropriately  by 
applications.  Typical  intra-cluster  criteria  are  the  di¬ 
ameter,  radius,  variance,  variance  multiplied  by  |5j| 
(sum  of  squared  errors)  and  variance  multiplied  by 
(all-pairs  sum  of  squared  errors)  of  point  set  Sj. 
(Inter-cluster  criterion)  The  inter-cluster  criterion 
defines  the  total  cost  of  the  fc-clustering,  which  is  a 
function  of  Intra(5j)  {j  =  l,...,k)  and  is  denoted 
by  Inter(?/i,?/2,...,t/t)  where  yj  -  Intra(5j).  Typi¬ 
cal  function  forms  are  max{  yj  |  j  =  1, . . . ,  A;  }  and 

Ek 

i=l  Vk- 

Then,  the  A:-clustering  problem  is  to  find  a  Ai-clustering 
which  minimizes  the  inter-cluster  criterion: 
min{  Inter(Intra(S'i),.  •  •  ,Intra(5fc))  | 

A:-clustering  (5i, . . . ,  5*)  of  5  } 

Previous  results  concerning  diameter  and  ra¬ 
dius:  In  computational  geometry,  many  results  have 


332 


been  obtained  for  the  clustering  problem.  The  di¬ 
ameter  and  radius  problems  are  rather  well  studied. 
They  include  an  0(n  log  n)-time  algorithm  for  finding 
a  2-clustering  of  n  points  in  the  plane  which  mini¬ 
mizes  the  maximum  diameter  (Asano,  Bhattacharya, 
Keil  and  Yao  [1]),  an  0{n^  \og^  n)-time  algorithm  for 
finding  a  3-clustering  of  planar  point  set  which  mini¬ 
mizes  the  maximum  diameter  (Hagauer  and  Rote  [5]), 
and  an  0(n  log^  n/loglogn)-time  algorithm  for  finding 
a  2-clustering  which  minimizes  the  sum  of  the  two  di¬ 
ameters  (Hershberger  [10]).  When  k  is  regarded  as  a 
variable,  most  ^-clustering  problems  become  NP-hard 
(e.g.,  see  Megiddo  and  Supowit  [14],  Feder  and  Greene 
[4]).  For  fixed  k,  the  fc-clustering  problem  using  the 
diameter  and  radius  as  the  intra-cluster  criterion  and 
a  monotone  function,  including  taking  the  maximum 
and  the  summation,  as  the  inter-cluster  criterion  can 
be  solved  in  a  polynomial  time  (Capoyleas,  Rote  and 
Woeginger  [3]). 

There  are  also  proposed  approximate  algorithms  for 
the  diameter  and  radius  whose  approximation  ratio  is 
theoretically  guciranteed.  Feder  and  Greene  [4]  gave  op¬ 
timal  approximate  algorithms  whose  running  time  is 
0{n\ogk)  for  n  points  in  the  d-dimensional  space  for 
fixed  d,  and  whose  worst-case  ratio  is  2. 

Motivation  for  the  variance- based  clustering:  In 
this  paper,  we  consider  the  fe-clustering  problem  with 
variance-based  meaisures  as  an  intra-cluster  criterion. 
This  is  motivated  from  the  color  quantization  prob¬ 
lem  of  computing  a  color  lookup  table  for  frame  buffer 
display.  Typical  color  quantization  problems  cluster 
hundreds  of  thousands  of  points  in  the  RGB  three- 
dimensional  space  into  k  =  256  clusters.  Since  k  is 
large,  a  top-down  approach  to  recursively  divide  the 
point  set  into  2  clusters  is  mostly  employed.  In  this 
problem,  the  diameter  and  rcidius  Me  not  suited  as  an 
intra-cluster  criterion,  and  the  variance- based  (Wan, 
Wong  and  Prusinkiewicz  [15])  and  ii-based  (median 
cut;  Heckbert  [9])  criteria  are  often  used.  In  [15],  [9], 
the  top-down  approach  is  used  and  in  solving  the  2- 
clustering  problem  both  only  treat  separating  planes 
orthogonal  to  some  coordinate  axis.  These  algorithms 
are  implemented  in  rlequemt  of  Utah  Raster  Toolkit, 
and  ppmquant  of  X11R5  or  tiffmediem  of  Tiff  Soft.  Al¬ 
though  these  implementations  run  rather  fast  in  prac¬ 
tice,  roughly  speaking  in  0{n  log  n)  time,  there  is  no 
theoretical  guarantee  about  how  good  their  solution  k- 
clusterings  are. 

Rigorous  definition  of  the  variance- based  clus¬ 
tering:  Therefore,  it  is  required  to  develop  a  fast  2- 


clustering  algorithm  and  to  determine  the  complexity 
of  the  k-clustering  problem  for  the  variance-based  case. 
Before  describing  the  existing  computational-geometric 
results  concerning  variance-based  case,  let  us  define  the 
variance-based  intra-cluster  criterion  in  a  rigorous  way. 
The  variance  Var(S)  of  points  pi  =  (a:,)  in  S  is  defined 

I*'  Pies 

where  x{S)  is  the  centroid  of  S: 

®('^)  = 

'  '  Pies 

For  a  parameter  a,  define  Var“(5)  by 
Var“(S)  =  |5|“Var(5). 

Var**  is  exactly  the  variance  itself.  Var^  is  represented 
as 

Var^(S)  =  jjaji  -s(5)|p 

Pies 

and  hence  is  the  sum  of  squared  errors  with  respect  to 
the  centroid  of  S.  Var^  is  represented  as 

Var^(5)=  |5]^||xi-4(5)|p 

Pi6S 

=  Y 

Pi, Pies,  i<i 

and  hence  is  the  all-pairs  sum  of  squared  errors  in  5. 
Adopting  Var"  as  the  intra-cluster  metric,  as  a  becomes 
larger,  the  sizes  of  clusters  in  an  optimum  fc-clustering 
becomes  more  balanced. 

Previous  results  on  the  vciriance-based  cluster¬ 
ing:  For  the  variance- based  criteria,  unlike  the  di¬ 
ameter  and  radius,  the  fc-clustering  problem  cidopting 
the  maximum  function  as  the  inter-cluster  criterion  be¬ 
comes  hard  to  solve.  For  this  inter-cluster  criterion 
with  the  all-pairs  sum  of  squared  errors,  only  a  pseudo- 
polynomial  approximation  scheme  is  known  (Hasegawa, 
Imai,  Inaba,  Katoh  and  Nakano  [7]).  Also,  in  applica¬ 
tions  such  as  the  color  quantization  problem,  the  sum¬ 
mation  function  is  adopted  as  an  inter-cluster  crite¬ 
rion  [15].  In  this  paper,  we  consider  only  the  summa¬ 
tion  case,  that  is,  the  A:-clustering  problem  to  minimize 
the  summation  of  variance-based  intra-cluster  costs  over 
clusters. 

For  the  variance-bcised  clustering  problem  with  the 
summation  function  as  an  inter-cluster  metric,  the  fol¬ 
lowing  are  known.  Concerning  Var^ ,  the  sum  of  squared 
errors,  it  is  well  known  that  an  optimum  2-clustering  is 
linearly  separable  and  that  an  optimum  k-clustering  is 
induced  by  the  Voronoi  diagram  generated  by  k  points 
(e.g.,  see  [2,  7, 15]).  Using  this  characterization  together 
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with  standard  computational-geometric  techniques,  the 
2-clustering  problem  with  Var^  as  the  intra-cluster  met¬ 
ric  can  be  solved  in  O(n^)  time  and  0(n)  space,  and  the 
fc-clustering  problem  is  solvable  in  a  polynomial  time 
when  k  is  fixed  [7].  Concerning  Vcir^,  the  all-pairs  sum 
of  squared  errors,  an  optimum  2-clustering  is  circularly 
separable  (Boros  and  Hammer  [2]),  and  a  finer  charac¬ 
terization  by  using  the  higher-order  Voronoi  diagram  is 
given  in  [7].  Using  this  characterization,  the  2-clustering 
problem  with  Var^  as  the  intra-cluster  metric  can  be 
solved  in  time,  and  also  it  is  seen  that  the 

A:-clustering  problem  for  this  case  can  be  solved  in  a 
polynomial  time  when  k  is  fixed  [6], 

There  is  also  proposed  an  approximate  algorithm  for 
the  ^-clustering  problem  with  Var^  as  an  intra-cluster 
metric.  Hasegawa,  Imai,  Inaba,  Katoh  and  Nakano  [7] 
gave  an  0(n*^+^ )-time  algorithm  for  fixed  d  whose  worst- 
case  ratio  is  2.  This  algorithm  solve  the  A:-clustering 
problem  with  constraining  the  representative  point  of 
each  cluster  to  be  one  of  points  in  the  cluster. 

Our  results:  In  this  paper,  the  /i-clustering  problem 
under  the  intra-cluster  criterion  Var"  with  a  =  1, 2  is 
treated  in  a  unified  way  by  characterizing  the  optimum 
solution  to  the  A:-clustering  problem  by  the  ordinary 
Voronoi  diagram  and  the  weighted  Voronoi  diagrams 
with  both  multiplicative  and  additive  weights. 

With  this  framework,  the  problem  is  related  to  the 
generalized  primary  shutter  function  for  the  Voronoi 
diagrams,  which  is  roughly  the  number  of  parti¬ 
tions  of  n  points  in  the  d-dimensional  space  induced 
by  the  Voronoi  diagram  generated  by  k  generator 
points.  The  primary  shutter  function  of  the  Eu¬ 
clidean  Voronoi  diagram  is  shown  to  be  and 

that  for  the  Voronoi  diagram  with  additive  and  multi¬ 
plicative  weights  Based  on  these,  the  k- 

clustering  problem  for  n  points  in  the  d-dimensional 
space  with  a  variance-based  criterion  can  be  solved  in 
Q(^(d+2)fc+i)  'j'jjjg  greatly  improves  the  previous 

bound  0{nP^^^  ^).  We  have  thus  given  a  polynomial¬ 
time  algorithm  for  the  case  of  fixed  k,  but  its  degree  is 
large  even  for  moderate  values  of  d  and  k. 

To  develop  a  practically  useful  2-clustering  algorithm, 
for  the  problem  with  the  most  typical  intra-cluster  crite¬ 
rion  of  the  sum  of  squared  errors,  we  present  an  efficient 
randomized  algorithm  which,  roughly  speaking,  finds  an 
e- approximate  2-clustering  in  0(n(l/e)^)  time,  which  is 
quite  practical  and  may  be  used  to  real  large-scale  prob¬ 
lems  such  as  the  color  quantization  problem.  In  the 
analysis,  a  fact  that  this  intra-cluster  cost  has  its  statis¬ 
tical  meanings  by  definition  is  used.  This  randomized 
algorithm  can  be  easily  generalized  to  the  fc-clustering 


problem. 

2  A  unified  approach  to  the 
vfiriance-based  /c-clustering  by 
weighted  Voronoi  diagrams 

The  variance-based  ^-clustering  problem  is  described  as 
follows: 

k 

inin{  Var“(5'j)  |  ^-clustering  (-Si, . . . ,  Sk)  of  -S  } 
i=i 

In  [7],  a  parametric  characterization  was  given  for  the 
case  of  a  =  2  (all-pairs  case)  by  using  a  general  paramet¬ 
ric  technique  for  minizing  quasiconcave  functions  de¬ 
veloped  by  Katoh  and  Ibaraki  [12],  which  enabled  us 
to  characterize  an  optimal  2-clustering  for  o;  =  2  by 
means  of  higher-order  Voronoi  diagram,  and  to  obtain 
a  pseudo  polynomial  approximation  scheme  for  the  2- 
clustering  problem  for  Var^  and  the  maximum  function 
as  the  inter-cluster  metric. 

In  this  paper,  we  concentrate  on  the  case  where  the 
summation  function  is  adopted  for  the  inter-cluster  cri¬ 
terion,  and  give  a  more  direct  characterization  for  the 
problem  with  a  =  1,2. 

We  may  make  use  of  partitions  of  n  points  induced 
by  weighted  Voronoi  diagrams.  Consider  k  points 
qj  =  (Hj)  in  the  d-dimensional  space  with  multiplicative 
weight  i/j  and  additive  weight  aj  (j  =  1, . . . ,  fc).  Define 
the  Voronoi  region  'VoT{qj)  of  qj  by 
k 

Vor{qj)  =  P|{p  =  (x)  I 
1=1 

^j\\^  -  (^j\?  +  -  M;|P  +  O';  } 

For  any  point  in  Vor(gj),  qj  is  the  closest  point  among 
qi  (/  =  l,...,k)  with  respect  to  the  weighted  distance. 
^or(gj)  {j  —  l,...,k)  partitions  the  space,  which  is 
called  the  weighted  Voronoi  diagram  generated  by  these 
k  points  qj.  When  aj  =  0  and  vj  —  1  (j  =  1,...,A;), 
this  weighted  Voronoi  diagram  reduces  to  the  ordinary 
Euclidean  Voronoi  diagram. 

By  the  Voronoi  diagram  generated  by  these  k 
weighted  points,  n  points  in  the  given  set  S  are  nat¬ 
urally  partitioned  into  k  clusters  (we  here  ignore  the 
case  in  this  definition  where  a  point  in  S  is  equidistant 
from  two  points  among  these  k  weighted  points).  We 
call  this  partition  a  Voronoi  partition  of  n  points  in  S  by 
k  weighted  generators.  Apparently,  not  all  fc-clusterings 
are  Voronoi  partitions.  In  fact,  we  can  characterize  op¬ 
timal  ^-clusterings  by  the  Voronoi  partition.  The  case 
of  a  =  1  is  well  known,  and  we  here  state  only  the 
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theorem. 

Theorem  1  ([2,  7,  15])  Suppose  that  is 

an  optimum  k-clustering  for  the  k-clustering  problem 
with  Var^  (a  =  1)  as  the  intra-cluster  metric.  Then, 
an  optimum  k-clustering  is  a  Voronoi  partition  by  the 
ordinary  Euclidean  Voronoi  diagram  for  k  points  qj  — 
in-)  (m*=x(5;)).  □ 

Now,  we  prove  the  following  theorem  for  the  case  of 
Var^,  i.e.,  all-pairs  sum  of  squared  errors. 

Theorem  2  Suppose  that  (5i,...,5*)  is  an  optimum 
k-clustering  for  the  k-clustering  problem  with  Var^  (a  = 
2)  as  the  intra-cluster  metric.  Then,  an  optimum  k- 
clustering  is  a  Voronoi  partition  by  the  weighted  Voronoi 
diagram  for  k  points  =  (A*j)  (Mj  ~  multi¬ 

plicative  weight  Uj  =  [511  and  additive  weight  aj  defined 

Proof:  First,  observe  the  following  relation. 

=  \Sj\ •  l|x  -  ®(‘5j)|p  +  ||xi  -  x(5j)|P 

Pi€Sj 

where  it  should  be  noted  that  ~  ~ 

Now,  suppose  that,  in  the  weighted  Voronoi  diagram 
above,  a  point  p,  €  S]  is  not  contained  in  Vor(gj) 
for  qj  =  (x(5|)),  and  is  in  Vor(gj-).  Then,  moving 
Pi  from  Sj  to  Sj,,  the  total  cost  is  strictly  reduced 
from  the  above  formula  (note  that  Var^  is  the  all-pairs 
sum  of  squared  errors),  which  contrridicts  the  optimal¬ 
ity  of  (5* , . . . ,  SI).  Hence,  each  pj  €  S]  is  contained  in 
Vor(gj)  for  j  =  1, . . . ,  k,  and  the  theorem  follows.  □ 

By  Theorem  1  and  Theorem  2,  the  variance- based 
fc-clustering  problem  with  a  =  1,  2  can  be  solved  by 
enumerating  all  the  Voronoi  partitions  of  n  points  gen¬ 
erated  by  k  weighted  points,  and  finding  a  partition  with 
minimum  one. 

The  number  of  distinct  Voronoi  partitions  has  strong 
connection  with  the  generalized  primary  shutter  func¬ 
tion  for  fc-label  space  introduced  by  Hasegawa  [6,  8] 
which  has  applications  in  computational  learning  the¬ 
ory.  We  here  analyze  this  number  by  showing  that 
Voronoi  partitions  are  duals  of  arrangements  of  alge¬ 
braic  surfaces  in  the  0(dfc)-dimensional  space. 

Theorem  3  The  number  of  Voronoi  partitions  of  n 
points  by  the  Euclidean  Voronoi  diagram  generated  by 
k  points  in  the  d-dimensional  space  is  0(n‘^^),  and  all 


the  Voronoi  partitions  can  be  enumerated  in  0{n^'^'^^) 
time. 

Proof:  In  the  ordinary  Voronoi  diagram,  all  multiplica¬ 
tive  weights  are  one  and  all  additive  weights  are  zero, 
i.e.,  Vj  =  1,  aj  -  0  {j  -  l,...,k).  Parameters  are  Hj 
(j  =  l,...,k).  Consider  the  (dA;)-dimensional  vector 
space  consisting  of  fj,  with  pi  =  (^j,  >  b'-k)- 

(dfc)-dimensional  space,  we  can  define  an  equivalence 
relation  among  points  such  that  two  points  are  in  the 
equivalence  relation  if  their  corresponding  Voronoi  par¬ 
titions  are  identical.  The  equivalence  relation  produces 
a  subdivision  of  this  space  into  equivalence  classes. 

For  each  pair  of  distinct  and  pij^  among  (ij 
(j  =  l,...,k)  and  each  point  p;  =  (®i)  among  pi 
(i  =  l,...,n),  consider  an  algebraic  surface  in  this  dk- 
dimensional  space  defined  by 

ii^E.  - -  iix.- -  Aij  jr = 0 
where  Xi  is  regarded  as  a  constant  vector.  The  num¬ 
ber  of  such  surfaces  is  nk{k  -  l)/2.  The  arrangement 
of  these  nk{k  -  l)/2  algebraic  surfaces  coincides  with 
the  subdivision  defined  by  the  equivalence  relation  from 
Voronoi  partitions.  The  number  of  Voronoi  partitions 
is  bounded  by  the  combinatorial  complexity  of  the  ar¬ 
rangement  of  nk{k -l)/2  constant-degree  algebraic  sur¬ 
faces,  and  the  theorem  follows.  □ 

A  further  detailed  analysis  about  the  primary  shut¬ 
ter  function  of  the  fc- Voronoi  space  is  done  by  Ishiguro 
[llj.  In  that  paper,  the  linearization  technique  is  ap¬ 
plied  in  the  analysis,  and  an  algorithm  using  hyperplane 
arrangements  is  given  based  on  it.  Also,  an  application 
of  the  primary  shutter  function  to  learning  an  unknown 
fc- Voronoi  space  from  examples  is  shown. 

Theorem  4  The  number  of  Voronoi  partitions  of  n 
points  by  the  weighted  Voronoi  diagram  generated  by  k 
weighted  points  with  a  multiplicative  weight  and  an  ad¬ 
ditive  weight  in  the  d-dimensional  space  is 
and  all  the  Voronoi  partitions  can  be  enumerated  in 
time.  □ 

The  proof  is  similar,  and  is  omitted  in  this  version. 
Combining  these  results,  we  have  the  following. 

Theorem  5  (1)  The  k-clustering  problem  for  Vas^  can 
be  solved  in  0(ra‘^*‘'‘^)  time. 

(2)  The  k-clustering  problem  for  Var^  can  be  solved 
in  time. 

It  should  be  noted  that,  from  the  linear  separability 
or  circular  separability,  of  an  optimum  2-clustering  for 
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Var^  and  Var^,  respectively,  as  sho-wn  by  [2,  7],  the  lb- 
clustering  problem  for  Var'  and  Var^  is  readily  seen  to 
be  solvable  in  and  time, 

respectively,  using  similar  arguments  in  [3]  for  the  diam¬ 
eter  and  radius.  Only  with  the  linear/circular  separa¬ 
bility  for  the  2-clustering,  an  algorithm  of  order 
may  be  best  possible  for  the  A;-clustering  problem.  Our 
algorithms  run  in  time,  and  improve  the 

bound  greatly.  This  becomes  possible  by 
the  fine  characterization  of  optimum  ^-clusterings  by 
the  weighted  Voronoi  diagram,  and  by  evaluating  the 
primary  shatter  function  of  the  weighted  Voronoi  parti¬ 
tions  in  a  tighter  manner. 

3  Randomized  algorithms  for 
the  case  of  the  sum  of  squared 
errors 

The  results  in  the  previous  section  are  interesting  from 
the  theoretical  viewpoint,  and  the  time  complexity  is 
polynomial  when  k  is  considered  as  a  constant.  How¬ 
ever,  even  for  k  =  3,4,5,  its  polynomial  degree  is 
quite  high,  which  makes  it  less  interesting  to  implement 
the  algorithms  for  practical  problems  such  as  the  color 
quantization  problem.  The  fc-clustering  problem  is  NP- 
complete  in  general  when  k  is  regarded  as  a  variable, 
and  in  this  respect  the  results  are  best  possible  we  may 
expect  to  have. 

To  develop  a  practically  useful  algorithm,  utilizing 
randomization  may  be  a  good  candidate,  since  the  intra¬ 
cluster  metric  we  are  using  has  its  intrinsic  statistical 
meanings.  In  this  section,  we  develop  randomized  algo¬ 
rithms  for  the  A:-clustering  problem  with  Var^ ,  the  sum 
of  squared  error,  as  the  intra-cluster  metric. 

In  this  extended  abstract,  we  mainly  consider  the  2- 
clustering  problem  with  Var\  but  most  of  the  follow¬ 
ing  discussions  carry  over  to  the  fc-clustering  problem. 
First,  let  us  consider  how  to  estimate  Var'(5)  for  the 
set  5  of  n  points  pi  =  (a:,)  {i  —  l,...,n)  by  random 
sampling.  Let  T  be  a  set  of  m  points  obtained  by  m 
independent  draws  at  random  from  5.  If  the  original 
point  set  5  are  uniformly  located,  (n/{m  -  l))Var*(T) 
may  be  a  good  estimate  for  Var^(5).  However,  this  is 
not  necessarily  the  case.  For  example,  suppose  that  a 
point  Pi  in  S  is  far  from  the  other  n  -  1  points  in  5, 
and  the  other  n-1  points  are  very  close  to  one  another. 
Then,  Var^(5)  is  nearly  equal  to  the  squared  distance 
between  p,-  and  a  point  in  5  —  {p,},  while  with  high 
probability  Var^(r)  is  almost  zero.  This  indicates  that 
Var^(r)  cannot  necessarily  provide  a  good  estimate  for 


\ar\S). 

On  the  other  hand,  the  centroid  x{T)  of  T  is  close  to 
the  centroid  x(5)  of  S  with  high  probability  by  the  law 
of  large  numbers,  and  we  obtain  the  following  lemma. 

Lemma  1  With  probability  1  —  b, 

\\x{T)-x{S)f 

Proof:  First,  observe  that 

E{x(T))  =  x{S),  E(||x(T)  -  x(5)|n  =  ^Var“(5) 

and  then  apply  the  Markov  inequality  to  obtain  the  fol¬ 
lowing. 

Pr(||x(r)  -  x(5)||2  >  ^Var“(5))  <  5.  □ 


Lemma  2  With  probability  1  —  8, 

^||x.-x(T)||2<(l+^)Vari(5). 

Proof:  Immediate  from  Lemma  1  and  the  following. 
^  ||x.  -x(T)||2  =  Vari(5)-f|5M|x(r)-x(5)||2.  □ 

Pi€S 

Thus,  we  can  estimate  Var^(5)  by  random  sam¬ 
pling.  For  the  2-clustering  problem,  we  have  to  estimate 
Var*(5i)  and  Var^(52)  for  a  2-clustering  (5i,  ^2)  by  es¬ 
timating  the  centroids  of  and  82-  Now,  consider  the 
following  algorithm. 

A  randomized  algorithm  for  the  2-clustering: 

1.  Sample  a  subset  T  of  m  points  from  5  by  m  inde¬ 
pendent  draws  at  random; 

2.  For  every  linearly  separable  2-clustering  (Ti,T2)  of 
T,  execute  the  following; 

Compute  the  centroids  ti  and  t2  of  Ti  and  T2, 
respectively; 

Find  a  2-clustering  (81,82)  of  8  by  dividing  8 
by  the  perpendicular  bisector  of  line  segment 
connecting  tj  and  t2; 

Compute  the  value  of  Var^(5i)-1-Var^(52)  and 
maintain  the  minimum  among  these  values; 

3.  Output  the  2-clustering  of  8  with  minimum  value 
above. 
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The  idea  of  this  randomized  algorithm  is  to  use  all 
pairs  of  centroids  of  linearly  separable  2-clusterings  for 
the  sampled  point  set  T.  Let  {8^,82)  be  an  optimum 
2-clustering  of  8  for  Var\  and  let  Sj  and  be  the 
centroids  of  8^  and  5^,  respectively.  By  considering 
all  linearly  separable  2-clusterings  for  T,  the  algorithm 
handles  the  2-clustering  {Tl.T^)  obtained  by  dividing  T 
by  the  perpendicular  bisector  of  line  segment  connecting 
Sj  and  82-  Then,  from  the  centroids  of  T{  and  T2,  we 
obtain  a  2-clustering  {8[,82)  in  the  algorithm. 

Since  T  is  obtained  from  m  independent  draws, 

E(ir;i)  =  ^I5;i  (j  =  i,2). 

From  Lemma  2,  Var^(5p  can  be  estimated  by  using 
|Tj|.  The  sizes  |Tj|  (j  =  1,2)  are  determined  by  inde¬ 
pendent  Bernoulli  trials,  and  is  dependent  on  the  ratio 
of  |5i|  and  For  the  sampling  number  m,  we  say 

that  8  is  f{m) -balanced  if  there  exists  an  optimum  2- 
clustering  (5*,  5^)  with 

—  min{  |5i|,  IS2I  }  >  /(m), 
n 

and  the  optimum  2-clustering  is  called  an  /(m)- 
balanced  optimum  2-clustering.  We  then  have  the  fol¬ 
lowing. 

Lemma  3  Suppose  there  exists  a  (log^  m) -balanced  op¬ 
timum  2-clustering  {81,82)-  Then,  with  probability 
2 

1 - -j-T-  for  a  constant  (0  <  /?  <  1),  the  follow- 

mP 

ing  holds. 

min{  |r;|,  \T2*\ }>  (1  -  /?)f  min{  18^,  152*1  } 

>  (1  —  /?)  logm. 

771 

Proof:  Set  p,'  =  —  min{|5*|,  |52|}.  For  m  indepen- 

dent  Bernoulli  trials  X\,  X2,  ■  ■  ■ ,  Xm  with  Pr(Wi  =  1)  = 
p' fm  <  Pr(Xi  =  0)  =  1  -  p' /m,  the  Chernoff  bound 
implies,  for  X  =  Xi  -|- - h  Xm, 

Pr(X  <  (1  -  S)p')  <  exp(-M'/3V2). 


From  the  cissumption, 

exp(-/x'/3^/2)  <  exp(-(logm)/?^/2) 


1 


Theorem  6  8uppose  that  the  point  set  8  is  f{m)- 
balanced  with  f{m)  >  logm.  Then,  the  randomized  al¬ 
gorithm  finds  a  2- clustering  whose  total  value  is  within 

a  factor  ofl-\-  - ^r-77 — r  to  the  optimum  value  with 

6{l-p)f{m) 

2  .  .. 


probability  1  —  6 


time. 


Proof:  From  Lemmas  2  and  3,  with  probability  1  -  6  - 

2 

i=i  pi^s'. 

i=i 

holds.  Furthermore,  the  left  hand  side  is  bounded  from 
below  by  Z]j=i  (5j)5  whose  value  is  computed  in 
the  algorithm.  Hence,  the  minimum  value  found  in  the 
algorithm  is  within  the  factor. 

Concerning  the  time  complexity,  all  linearly  separable 
2-clusterings  for  T  can  be  enumerated  in  0{m‘^)  time. 
For  each  2-clustering  (Ti,T2)  of  T,  finding  a  pair  of 
centroids  and  a  2-clustering  of  8  generated  by  the  pair 
together  with  its  objective  function  value  can  be  done 
in  0(n)  time.  Thus  the  theorem  follows.  □ 

We  have  developed  a  randomized  algorithm  only  for 
the  2-clustering  problem  so  far,  but  this  can  be  directly 
generalized  to  the  fc-clustering  problem.  If  there  ex¬ 
ists  a  balanced  optimum  fc-clustering,  similar  bounds 
can  be  obtained.  It  may  be  noted  that  the  technique 
employed  here  has  some  connection  with  the  technique 
used  to  obtain  a  deterministic  approximate  algorithm 
with  worst-case  ratio  bounded  by  2  for  the  fc-clustering 
problem  in  [7]. 

The  above  theorem  assumes  some  balancing  condi¬ 
tion.  In  some  applications,  a  very  small  cluster  is  use¬ 
less  even  if  its  intra-cluster  is  small.  For  example,  when 
we  apply  a  2-clustering  algorithm  recursively  in  a  top- 
down  fashion  to  solve  the  fc-clustering  problem,  a  bal¬ 
ancing  condition  on  2-clusterings  may  be  imposed  to  2- 
clustering  subproblems  so  that  the  sizes  of  subproblems 
may  become  small  quickly  and  the  total  clustering  may 
have  nicer  properties.  In  such  a  case,  the  randomized  cd- 
gorithm  naturally  ignores  such  small-size  cluster.  Also, 
for  the  case  of  finding  a  good  and  balanced  2-clustering, 
we  have  only  to  apply  a  slightly  modified  version  of  the 
randomized  algorithm  directly.  This  is  typical  for  the 
clustering  problem  in  VLSI  layout  design.  See,  for  ex¬ 
ample,  Kernighan  and  Lin  [13].  Generalizing  Theorem 
6  for  such  cases,  we  have  the  following. 

Theorem  7  For  the  problem  of  finding  an  optimum  2- 
clustering  among  {'fm) -balanced  2-clusterings  for  a  con¬ 
stant  7,  the  randomized  algorithm  finds  a  2-clustering 
which  is  almost  at  least  {•ym) -balanced  and  whose  value 
is  within  a  factor  ofl-\-0{l/{6m))  to  the  optimum  value 
of  this  problem  with  probability  1  —  6  for  small  6.  □ 

In  the  proof  of  this  theorem,  we  use  results  concern¬ 
ing  the  e-net  and  e-approximations.  The  details  will  be 
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given  in  the  full  version.  We  here  restate  the  theorem 
in  a  rough  way  so  that  its  essence  can  be  understood 
direclty. 

Corollary  1  When  there  is  an  optimum  2-clustering 
{81,82)  of  8  such  that 

l^ri,  |52l>7|5|  =  n 

for  some  constant  7  (0  <  7  <  1/2),  the  randomized 
algorithm,  which  samples  m  points  from  8,  finds  a  2- 
clustering  whose  expected  value  is  l  +  0(l/m)  with  high 
probability.  O 

Corollary  2  For  the  problem  of  finding  an  optimum 
2-clustering  among  2-clusterings  {81,82)  of  8  such  that 

l^il,  |52|  >  7|5|  =  n 

for  some  constant  7  (0  <  7  <  1/2),  the  randomized 
algorithm,  which  samples  m  points  from  8,  can  be  easily 
modified  to  find  an  approximate  balanced  2-clustenng 
whose  expected  value  is  l+0(l/m)  with  high  probability. 
□ 

On  the  other  hand,  if  very  small  clusters  with  small 
intra-cluster  metric  should  be  found,  we  may  enumer¬ 
ate  such  small  clusters  deterministically  or  in  a  random¬ 
ized  manner,  since  the  number  of  such  small  clusters  is 
relatively  small.  For  the  2-clustering  problem  in  the 
two-dimensional  case,  the  number  of  linearly  separable 
2-clustering  such  that  one  cluster  consists  of  at  most  k' 
points  is  0{k'n)  and  can  be  enumerated  efficiently.  By 
enumerating  A:'-sets  for  an  appropriate  value  of  k',  we 
obtain  the  following  theorem. 

Theorem  8  The  2-clustering  problem  for  n  points  in 
the  plane  with  Var'  as  the  intra-cluster  metric  can  be 
solved  in  0(n^/^(logn)^)  time  with  the  approximation 
ratio  within  a  factor  of  1  -\-  0(1/ log  n)  with  probability 
1  —  0(l/logn). 

Proof:  Set  m  =  n^/^logn.  For  this  m,  by  the  ran¬ 
domized  algorithm,  we  find  a  good  (log  m)^-balanced 
2-clustering.  Also,  by  the  deterministic  algorithm  enu¬ 
merating  (<  logn)-sets,  we  find  a  best  unbalanced 
2-clustering.  Setting  b  =  1/logm  and  /?  to  a  con¬ 
stant,  the  time  complexity  of  the  randomized  algorithm 
is  0(n(n^/^  logra)^)  =  0(n^/^(log7i)^)  and  the  approxi¬ 
mation  ratio  is  bounded  by  1  -|-  0(1/ log  m)  with  proba¬ 
bility  1  — 0(1/ log  m).  The  deterministic  algorithm  runs 
in  0(n(n^/^  logn)(logn)^)  =  0(n®/^(log  n)^)  time.  □ 

It  should  be  noted  that  the  time  complexity  in  this 
theorem  is  subquadratic,  compared  with  the  determin¬ 
istic  quadratic  exact  algorithm. 


4  Concluding  remcu*ks 

We  have  demonstrated  that  optimum  solutions  to  the 
variance- based  fc-clustering  can  be  characterized  by  the 
(weighted)  Voronoi  diagram  generated  by  k  points,  and 
have  evaluated  the  primary  shutter  function  of  the  k- 
Voronoi  space.  This  primary  shutter  function  can  be 
used  in  computational  learning  theory  in  learning  k- 
Voronoi  spaces. 

We  have  then  presented  a  simple  randomized  algo¬ 
rithm  for  the  /:-clustering  problem  with  Var^  as  an 
intra-cluster  metric.  This  algorithm  is  practically  useful 
when  k  is  small  and  balanced  A;-clusterings  are  prefer¬ 
able.  For  example,  for  the  problem  of  finding  an  op¬ 
timum  2-clustering  for  n  planar  points  among  almost 
completely  balanced  2-clusterings,  an  approximate  2- 
clustering  which  is  approximately  balanced  and  whose 
cost  is  within  a  factor  ofl-|-l/3  =  4/3on  the  average  to 
the  optimum  cost  can  be  found  by  sampling  10  points 
from  n  points  and  spending  O(lO^n)  =  0{n)  time  with 
probability  (^“)/2'°  w  0.89,  or  by  sampling  20 

points  and  spending  O(20^n)  =  0{n)  time  with  proba¬ 
bility  Ells  ( i  )/2^“  «  0.9996. 

The  randomized  algorithm,  however,  is  not  so  suit¬ 
able  to  find  a  good  unbalanced  2-clustering.  Also, 
although  the  randomized  algorithm  itself  is  valid  for 
large  k,  the  running  time  becomes  inherently  large  since 
the  primary  shutter  function  for  m  sampled  points  is 
0{m^^‘^^^).  This  implies  that  the  variance-based  k- 
clustering  problem  is  harder  in  such  a  situation  com¬ 
pared  with  the  diameter-  or  radius-based  A:-clustering 
to  which  a  good  and  fast  approximate  algorithm  with 
worst-case  ratio  2  is  known  [4]. 

Therefore,  to  solve  the  variance-based  /:-clustering  for 
large  k  practically,  say  for  k  =  256  of  the  typical  color 
quantization  problem,  we  may  apply  the  randomized  2- 
clustering  algorithm  proposed  in  this  paper  recursively 
in  a  top-down  manner  with  sampling  only  a  small  num¬ 
ber  of  points  at  each  stage  as  mentioned  above.  We  plan 
to  implement  this  approach  and  experimentally  evalu¬ 
ate  its  efficiency. 
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Abstract 

In  the  first  part  of  this  paper,  we  reduce  two  geometric 
optimization  problems  to  convex  programming;  find¬ 
ing  the  largest  axis-aligned  box  in  the  intersection  of  a 
family  of  convex  sets,  and  finding  the  translation  and 
scaling  that  minimizes  the  Hausdorff  distance  between 
two  polytopes.  These  reductions  imply  that  important 
cases  of  these  problems  can  be  solved  in  expected  linear 
time.  In  the  second  part  of  the  paper,  we  use  convex 
programming  to  give  a  new,  short  proof  of  an  interest¬ 
ing  Helly-type  theorem,  first  conjectured  by  Griinbaum 
and  Motzkin. 

1  Introduction 

Linear  programming  is  a  popular  tool  in  computa¬ 
tional  geometry.  A  related  problem  is 

Convex  Programming 

Input:  A  finite  family  H  of  closed  convex  sets  in 
E‘^  and  a  convex  function  f . 

Output:  The  minimum  of  f  over  fj  H ,  the  inter¬ 
section  ofH. 

Remember  that  a  function  f  :  E‘^  — ^  'E-  is  convex 
when  f{Xa  +  (1  -  X)b)  <  Xf(a)  -f  (1  -  A)/(6),  for 
all  a,h  E  E^  and  0  <  A  <  1. 
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The  elements  of  the  family  H  are  called  the  con¬ 
straints.  Like  a  linear  program,  a  convex  program 
has  a  single  global  minimum  which  is  determined 
by  a  subfamily  of  constraints.  And  under  reason¬ 
able  computational  assumptions  on  the  constraints 
and  the  convex  function,  fixed-dimensional  convex 
programming  can  be  solved  in  expected  linear  (in 
|.H'|)  time  by  any  of  the  randomized  combinato¬ 
rial  linear  programming  algorithms  of  [C90],  [S90], 
[MSW92].  These  algorithms  are  combinatorial  in 
the  sense  that  they  operate  by  searching  the  sub¬ 
families  of  constraints  for  one  that  defines  the  min¬ 
imum;  a  more  classic  example  of  a  combinatorial 
LP  algorithm  the  simplex  algorithm. 

We  give  two  new  reductions  from  geometric  op¬ 
timization  problems  of  practical  interest  to  con¬ 
vex  programming.  The  first  problem  is  finding  the 
largest  volume  axis-aligned  box  in  the  intersection 
of  a  family  K  of  convex  sets.  Like  the  ubiqui¬ 
tous  bounding  box,  this  hounded  box  can  be  used  in 
heuristics  to  approximate  a  more  complicated  vol¬ 
ume.  When  the  dimension  is  fixed  and  the  elements 
of  K  are  linear  halfspaces  (or  otherwise  easy  to 
compute  with),  this  reduction  implies  an  expected 
0(n)  time  algorithm,  n  =  \K\.  The  planar  case  of 
this  problem  was  raised  in  the  context  of  a  heuris¬ 
tic  for  packing  clothing  pattern  pieces  [DMR93]. 
They  give  an  0(na(n)logn)  algorithm  to  find  the 
maximum  area  axis-aligned  rectangle  in  any  sim¬ 
ple  polygon  in  E'^.  Our  reduction  may  be  useful 
for  higher  dimensional  packing  heuristics  as  well, 
and  also  when  objects  are  to  be  decomposed  into 
collections  of  axis-aligned  boxes  (eg.  in  ray  tracing, 
[AK89],pp  219-23). 

Our  second  reduction  concerns  some  particu¬ 
lar  cases  of  the  much-studied  problem  of  mini¬ 
mizing  the  Hausdorff  distance  between  two  ob- 


jects  under  a  group  of  transformations.  The  one- 
directional  Hausdorff  distance  from  a  set  to  a 
set  B,  is  the  maximum  distance  from 

any  point  in  A  to  the  nearest  point  in  B.  The 
Hausdorff  distance  between  A  and  B,  H{A,B),  is 
max{H{A,B),H{B,A)}.  The  HausdorflF  distance 
is  used  in  pattern  recognition  and  computer  vision 
as  a  a  measure  of  the  difference  in  shape  between 
the  two  sets;  typically  a  family  P  of  critical  points 
is  extracted  from  an  image  and  compared  with  a 
stored  template  A.  The  problem  is  to  choose  a 
transformation  from  some  family  which,  when  ap¬ 
plied  to  A,  minimizes  H{P,A)  or  H{P,A). 

For  convex  polytopes  A,  B  we  reduce  the  prob¬ 
lem  of  finding  the  translation  and  scaling  which 
minimizes  H{A,B)  to  a  convex  program.  In  the 
plane,  this  convex  program  can  be  solved  in  ex¬ 
pected  0{n)  time,  where  n  is  the  total  number 
of  vertices.  A  similar  convex  program,  motivated 
by  the  scenario  above,  finds  the  translation  which 
minimizes  the  H{P,A),  where  P  is  a  point  set 
and  A  is  a  convex  polytope.  Again  in  the  pla¬ 
nar  case,  this  convex  program  requires  expected 
mm{0(mn),0(nlgn -t- m)})  time,  where  n  =  |P| 
and  A  has  m  vertices. 

There  are  many  results  on  minimizing  the  Haus¬ 
dorff  distance  between  various  objects  under  differ¬ 
ent  groups  of  motions.  Most  of  them  solve  more  dif¬ 
ficult  problems  and  require  more  time.  For  planar 
point  sets  P,  Q,  an  algorithm  of  [HKS91]  finds  the 
translation  that  minimizes  H{P,Q)  in  0(mn(m-l- 
n)lg(mra))  time,  where  n  =  |P|,m  =  \Q\.  Allow¬ 
ing  rotation  as  well,  an  algorithm  of  [CGHKKK94] 
minimizes  P(P,  Q)  in  0(m^n^(m-t-n)  Ig^  mn)  time. 
Algorithms  for  measuring  the  Hausdorff  distance 
for  fixed  polygons,  allowing  no  transformations 
at  all,  require  O(nlgn)  time  for  simple  polygons 
[ABB91],  and  0(n)  for  convex  polygons  [A83].  The 
algorithms  implied  by  our  reductions  are  also  com¬ 
paratively  simple  and  implementable. 

We  turn  from  these  practical  issues  to  a  theoreti¬ 
cal  question  from  combinatorial  geometry.  We  give 
a  new  proof  of  the  theorem  of  [Mo73]: 

Theorem  1.1  Let  Cj  be  the  family  of  all  sets  in 
consisting  of  the  disjoint  union  of  at  most  k 
closed  convex  sets.  Let  I’j  C  be  a  subfamily 
with  the  special  property  that  it  is  closed  under  in¬ 
tersection.  Then  the  intersection  of  I ^  is  nonempty 


if  and  only  if  the  intersection  of  J  is  nonempty,  for 
all  J  CI^  with  \J\  <  k{d-\-l). 

This  is  called  a  Helly-type  theorem  because  of  its 
combinatorial  structure:  the  whole  family  of  sets 
has  nonempty  intersection  if  and  only  if  all  of 
its  constant-size  subfamilies  do.  There  are  many 
Helly-type  theorems;  it  is  appealing  to  think  that 
there  is  some  fundamental  topological  property  un¬ 
derlying  them  all.  This  theorem  is  interesting  be¬ 
cause  it  suggests  this  fundamental  property  might 
be  that  the  intersection  of  every  subfamily  is  some¬ 
how  homologically  of  constant  complexity. 

Griinbaum  and  Motzkin  conjectured  Theorem 
0.1  [GM61],  and  proved  the  case  k  =  2,  using  a 
more  general  axiomatic  structure  in  place  of  con¬ 
vexity.  The  case  fc  =  3  was  proved  by  barman 
[L68].  Morris  settled  the  conjecture  in  his  thesis. 
His  proof,  however,  is  quite  long  (69  pages)  and 
involved,  and  a  better  proof  has  been  called  for 
[E93]. 

Using  convex  programming,  we  give  a  short  and 
insightful  proof.  Our  approach  is  to  introduce  a 
function  /,  and  then  show  that  the  problem  of  min¬ 
imizing  /  over  a  family  belongs  to  the  cleiss  GLP 
(for  Generalized  Linear  Programming).  Informally, 
GLP  is  the  class  of  problems  which  can  be  solved 
by  combinatorial  LP  algorithms.  This  is  already 
interesting,  as  an  example  of  a  problem  which  is 
combinatorially  similar  to  LP  although  geometri¬ 
cally  the  intersection  of  the  constraints  fails  not 
only  to  be  convex,  but  even  to  be  connected.  The¬ 
orem  0.1  follows  by  applying  an  easy  theorem  from 
[A93],  that  there  is  a  Helly-type  theorem  about  the 
constraint  set  of  every  GLP  problem. 

2  Setup 

In  this  section  we  give  some  background  on  con¬ 
vex  programming,  GLP,  and  the  combinatorial  LP 
algorithms. 

First  we  give  the  formal  definition  of  GLP,  us¬ 
ing  an  abstract  framework  due  to  Sharir  and  Welzl 
[SW92].  A  GLP  problem  is  a  pair  {H,w),  where 
i7  is  a  family  of  constraints,  generally  sets,  and 

:  2^  — A  is  a  function  which  takes  a  subfamily 
of  constraints  to  an  element  of  a  totally  ordered 
set  A.  A  has  a  special  maximum  element  fl.  When 
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w{G)  <  Q,  G  is  feasible,  and  otherwise  G  is  infea¬ 
sible. 

In  convex  programming,  for  example,  w{G),  for 
G  C  H,  is  the  minimum  of  /  over  flG')  and  if 
RG  =  0,  then  w{G)  =  In  order  to  ensure 
that  w  is  defined  on  every  subfamily  of  constraints, 
we  may  have  to  enclose  the  problem  in  a  compact 
“bounding  box”.  We  say  that  w  is  the  objective 
function  inducedhy  f. 

{H,  w)  is  a  GLP  problem  if 

1.  For  allFCGCH:  w{F)  <  w{G) 

2.  For  3.11  F  C  G  C  H  such  that  w{F)  =  w{G)  and 
for  each  h  £  H: 

w{F  +  h)  >  w{F)  if  and  only  if  w{G  +  h)  >  w{G) 
(by  F  +  h,  we  mean  F  U  {h}) 

A  basis  is  a  subfamily  G  C  H  such  that  w{G-h)  < 
for  all  h  £  G.  The  combinatorial  dimension 
of  a  GLP  is  the  maximum  cardinality  of  any  feasi¬ 
ble  basis. 

Every  convex  program  meets  Condition  1,  since 
adding  more  constraints  to  a  problem  can  only  in¬ 
crease  the  minimum.  It  is  also  well-known  that 
a  d-dimensional  convex  program  has  combinatorial 
dimension  d;  there  is  a  formal  proof  in  [A93’]. 

But  notice  that  it  is  not  the  case  that  every  con¬ 
vex  program  is  a  GLP  problem,  since  it  may  fail 
to  satisfy  Condition  2.  Condition  2  is  always  sat¬ 
isfied,  however,  when  the  minimum  of  /  over  the 
intersection  of  every  subfamily  G  C  if  is  achieved 
by  exactly  one  point.  This  observation  implies  that 
the  following  cases  of  convex  programming  are  al¬ 
ways  GLP  problems  of  combinatorial  dimension  d. 

A  function  /  :  E‘^  — >  7?.  is  strictly  convex  when 
/(Aa  -f  (1  —  X)b)  <  A/(a)  -|-  (1  —  X)f(b),  for  all 
a.b  £  and  0  <  A  <  1. 

Strictly  convex  programming 

Input:  A  family  H  of  compact  convex  subsets  of 
E'^ ,  and  a  strictly  convex  function  f . 

Output:  The  minimum  of  f  over{^H. 

The  minimum  is  achieved  at  one  unique  point  since 
any  point  x  on  the  the  line  segment  between  any 
two  feasible  points  y,  z  with  /(y)  =  f{z)  has  f(x)  < 

fiy)  =  fiz). 

The  lexicographic  function  f  :  E^  FF  takes  a 
point  to  it’s  coordinates  x  =<  Xi, . . . ,  >,  which 

are  totally  ordered  so  that  x  >  y  if  xi  >  yi,  or  if 
=  Vi  and  X2  >  y2,  and  so  on. 


Lexicographic  convex  programming 

Input:  A  family  H  of  compact  convex  subsets  of 
E‘^ ,  and  the  lexicographic  function  f . 

Output:  The  minimum  of  f  over  r\H. 

The  minimum  is  certainly  achieved  at  a  single 
point,  since  each  point  has  a  unique  value.  But 
this  is  not,  strictly  speaking,  a  convex  program¬ 
ming  problem,  since  /  is  not  a  function  into  F. 
We  observe,  however,  that  /  is  related  to  the  lin¬ 
ear  function 

y,(x)  =  xi  -I-  ex2  -f  e^xs  -f  . . .  -|-  e'^'-^Xd 

for  infinitessimally  small  e.  To  make  this  relation¬ 
ship  precise,  we  adopt  the  terminology  of  [M94], 
and  say  that  an  objective  function  u  is  a  refinement 
of  a  function  w  when,  for  F,G  C  H,  w{G)  >  w{F) 
implies  v{G)  >  v{F). 

Let  w  and  be  the  objective  functions  induced 
by  /  and  y^,  respectively.  For  any  finite  family  H 
of  constraints,  there  is  some  e  small  enough  so  that 
V(  is  a  refinement  of  w.  Note  that  there  may  be 
no  e  small  enough  that  v^{F)  =  v^iG)  whenever 
w{F)  =  w{G),  ais  illustrated  in  figure  ??.  Finally, 


Figure  1:  Minima  are  different  under 

we  observe  that  if  a  function  w  has  some  refine¬ 
ment  V  such  that  {H,  v)  meets  Condition  1  and  has 
combinatorial  dimension  d,  then  so  does  {H,w). 
So  Lexicographic  Convex  Programming  is  a  GLP 
problem  of  combinatorial  dimension  d,  since  {H,  vj 
is  a  convex  program  and  so  meets  Condition  1  and 
has  combinatorial  dimension  d,  and  (if,  w)  meets 
Condition  2  as  well. 

The  computational  requirement  under  which  any 
of  the  combinatorial  LP  algorithms  can  be  applied 
to  any  GLP  problem  is  that  there  is  a  subroutine 
available  for  the  following  problem: 

Basis  computation 
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Input:  A  basis  G  and  a  constraint  h. 

Output:  A  basis  G'  C  G  +  h  such  that  w{G')  = 
w[G  +  /i) . 

This  operation  corresponds  to  a  pivot  step  in  the 
simplex  algorithm.  In  d-dimensional  convex  pro¬ 
gramming,  a  basis  computation  minimizes  /  over 
f|G,  where  |G|  <  d  A  1.  When  a  basis  compu¬ 
tation  can  be  done  in  constant  time,  then  any  of 
[C90],  [S90],  [MSW92]  require  expected  0(n)  time, 
where  n  =  \H\.  All  of  these  algorithms  have  been 
implemented  for  LP,  and  the  algorithm  in  [S90]  has 
also  been  applied  to  the  particular  convex  program 
of  finding  the  smallest  ball  enclosing  a  family  of 
points  in  [W91]. 

3  Bounded  boxes 

In  this  section  we  prove 

Theorem  3.1  Finding  the  largest  volume  axis- 
aligned  box  in  the  intersection  of  a  family  K  of  n 
convex  bodies  in  E‘^  is  a  strictly  convex  program  in 
E^'^  with  2‘^n  constraints. 

Proof  sketch:  We  parameterize  an  axis-aligned 
box  by  a  pair  of  vectors  x,  a  G  EA.,  where  Xi, . . . ,  x^ 
are  the  coefficients  of  the  lexicographically  mini¬ 
mum  vertex  of  the  box,  and  ai , . . . ,  are  positive 
offsets  in  each  coordinate  direction.  This  parame- 
teriztion  defines  a  space  of  boxes,  in  which  we  will 
construct  a  convex  program. 

For  each  convex  body  C  £  K,  we  will  define  2*^ 
constraints,  one  for  each  box  vertex.  Note  that  a 
box  is  contained  in  a  convex  body  C  if  and  only  if 
all  of  its  vertices  are.  Let  us  label  the  vertices  with 
0-1  vectors  in  the  natural  way,  so  that  (0, . . . ,  0)  is 
the  lexicographic  minimum  corner  of  the  box.  The 
set  of  boxes  for  which  vertex  u  is  contained  in  C  is 
/i  =  {x,a|x-|-(u(8)a)GC}  (here  (g)  is  coordinate- 
wise  multiplication,  and  -I-  is  translation).  This  is 
convex. 

To  prevent  the  largest  volume  box  determined 
any  subproblem  from  being  unbounded,  we  require 
the  bounded  box  to  be  contained  in  a  very  large 
bounding  box  which  is  guranteed  to  contain  f]  K. 
This  adds  one  more  convex  constraint  to  the  prob¬ 
lem  in  the  space  of  boxes. 

It  remains  to  show  that  maximizing  the  volume 
of  the  box  corresponds  to  minimizing  some  convex 


function  over  fl-ff-  The  volume  of  a  box  (x,a), 
negated,  is  given  by 

d 

5(a)  =  -II®'' 

t  =  l 

with  all  the  a,-  constrained  to  be  positive.  This  is 
not  a  convex  function,  but 

d  d 

f{a)  =  -log(J|a.)  =  -^log(a,) 

1=1  i=l 

is  a  strictly  convex  function.  Minimizing  /  over 
f\H  is  &  strictly  convex  programming  problem  in 

Eid 

a 

When  the  elements  of  K  are  linear  halfspaces,  then 
the  constraints  are  as  well.  When  the  elements  of 
K  are  of  constant  complexity,  so  that  a  basis  com¬ 
putation  requires  0(1)  time  for  fixed  d,  the  largest 
volume  axis-aligned  box  can  be  found  in  expected 
0(n)  time.  Note  that  /  and  g  are  minimized  at 
the  same  point,  so  the  basis  computation  may  be 
implemented  using  g. 

4  HausdorfF  distance 

Now  we  consider  the  problem  of  finding  the  trans¬ 
lation  and  scaling  which  minimize  the  Hausdorff 
distance  between  two  convex  polytopes  A  and  B  in 
E'^.  The  boundary  of  the  set  of  points  at  distance 
A  from  A  (that  is,  the  Minkowski  sum  of  A  with 
the  closed  disk  of  radius  A  centered  at  the  origin) 
is  called  the  X-offset  surface.  H{B,A)  <  A  when 


Figure  2:  Offset  surface 

every  vertex  of  B  lies  within  the  A-offset  surface  of 
A. 

We  can  think  of  the  scaling  and  translation 
transformations  as  being  applied  to  A  alone.  We 
define  a  (d  4-  2)-dimensional  transformation  space 
in  which  the  coordinates  of  each  point  represent  a 
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d-dimensional  translation  vector  r  to  be  applied  to 
A,  a  scale  factor  a  to  be  applied  to  A,  and  an  offset 
distance  A.  For  any  point  b  E  B,  let  be  the 

subset  of  transformation  space  such  that  h  is  within 
the  A-offset  surface  of  the  homothet  of  A  scaled  by 
cr  and  translated  by  r.  Similarly,  let  be  the 

set  of  translations  and  scalings  of  A  which  put  a 
point  a  E  A  inside  the  A-offset  surface  of  B. 

Theorem  4.1  The  scaling  and  translation  that 
minimizes  the  Hausdorff  distance  between  two  poly¬ 
topes  in  can  be  found  by  a  lexicographic  convex 
program  in  . 

Proof:  Let  F (A)  and  V (B)  be  the  vertex  sets  of 
A  and  B,  respectively.  Consider  a  vertex  b  E  V{B). 
Fixing  CT  =  1  and  A  =  0,  we  find  that  the  set  of 
translations  of  A  which  cover  6,  ,  is  itself  a 

translate  of  the  convex  set  —A.  Allowing  a  to  vary, 
we  find  that  is  a  cone  over  —A,  also  convex, 

into  the  direction  of  the  a  coordinate.  Finally,  we 
allow  A  to  vary  as  well.  Notice  that,  as  A  varies, 
the  set  of  disks  of  radius  A  centered  at  the  origin 
forms  a  convex  cone,  C.  i®  Minkowski 

sum  of  with  C.  This  is  convex  because  the 

Minkowski  sum  of  convex  bodies  is  convex.  Each 
vertex  of  B  produces  one  such  convex  constraint. 

Now  consider  a  vertex  a  E  V{A).  Again 
is  convex,  this  time  a  translate  of  —B.  As  a  varies, 
A  scales,  and  vertex  a  moves  in  some  direction  v, 
along  line  a  +  av.  So  o)  i®  ^  convex  cylinder 
over  —B,  and  ^  ^®  ^h®  Minkowski  sum  of  the 

cylinder  with  C. 

We  use  the  objective  function  given  by  the  lexi¬ 
cographic  function  /  on  the  transformation  space, 
where  A  is  the  most  significant  coordinate,  followed 
by  a  and  then  the  d  coordinates  of  the  transla¬ 
tion  r.  The  minimum  point  in  the  intersection  of 
the  constraints  ^^nd  with  respect  to 

/,  represents  a  scaling  and  translation  which  min¬ 
imizes  the  Hausdorff  distance  between  A  and  B. 

□ 

In  the  plane,  the  most  important  case  for  ex¬ 
isting  applications,  we  can  implement  this  convex 
program  so  that 

Theorem  4.2  The  scaling  and  translation  which 
minimizes  the  Hausdorff  distance  between  two  poly¬ 
gons  in  the  plane  can  be  found  in  expected  0{n) 
time,  where  n  =  \V (A)|  -|-  \  V (5)|. 


Proof:  We  redefine  each  polygon  as  the  intersec¬ 
tion  of  pieces  of  constant  complexity,  and  associate 
each  piece  with  a  subset  of  the  vertices  of  the  other 
polygon.  Each  piece  is  the  infinite  wedge  formed 
by  a  vertex  of  one  of  the  polygons  and  the  rays  sup¬ 
porting  the  adjacent  sides,  which  we  shall  call  an 
angle.  If  every  vertex  of  A  is  within  the  A-offset  sur¬ 
face  of  every  angle  from  B,  and  visa  versa,  H {A,  B) 
is  no  greater  than  A.  This  gives  a  GLP  with  O(n^) 
constraints,  pairing  every  vertex  of  A  with  every 
angle  from  B,  and  visa  versa. 

We  get  a  linear  number  of  constraints  by  noting 
that  for  every  angle  a  from  B,  all  of  A  is  within  the 
A-offset  surface  of  a  if  every  vertex  from  a  critical 
subset  of  V (A)  is.  The  faces  of  B  divide  the  cir- 


Figure  3:  Intervals  on  the  circle  of  normals 

cle  of  normal  directions  into  a  family  of  closed 
intervals,  each  interval  corresponding  to  an  angle. 
A  vertex  u  of  A  is  critical  for  an  angle  a  if  u  is 
extremal  in  A  for  any  direction  in  the  interval  in¬ 
duced  by  a.  The  face  normals  of  A  also  divide  the 
circle  of  normal  directions  into  a  family  of  closed 
intervals,  each  corresponding  to  the  set  of  direc¬ 
tions  in  which  a  particular  vertex  of  A  is  extremal. 
When  the  interval  corresponding  to  a  vertex  v  in¬ 
tersects  the  interval  corresponding  to  an  angle  a, 
V  is  extremal  for  a.  The  critical  subsets  for  every 
angle  can  be  found  by  merging  and  Is  in  linear 
time.  Each  of  the  n  intersections  of  an  interval  in 
I A  with  an  interval  in  Ib  gives  a  vertex-angle  pair 
which  produces  a  constraint.  We  also  construct  the 
n  constraints  induced  by  angles  of  A  and  vertices 
of  5. 

The  lexicographic  minimum  point  in  the  inter¬ 
section  of  any  four  constraints  can  be  found  in  con¬ 
stant  time,  so  we  get  an  expected  0(n)  time  algo¬ 
rithm. 

□ 
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We  now  turn  our  attention  to  the  problem  of 
minimizing  the  one-directional  Hausdorff  distance 
H (P,  A)  from  a  set  of  P  points  to  a  convex  poly¬ 
tope  A.  We  can  always  scale  A  so  that  it  is  large 
enough  to  cover  all  the  points,  making  H{P,A) 
zero.  When  the  only  transformation  allowed  is 
translation,  however,  we  find 

Theorem  4.3  The  translation  that  minimizes  the 
one- directional  Hausdorff  distance  from  a  set  P 
of  points  to  a  convex  polytope  A  in  E‘^  can  be 
found  by  a  lexicographic  convex  program  in  . 
For  d  =  2,  we  can  solve  the  problem  in  expected 
mm{0(mn),  0(n Ig  n -f  m)})  time,  where  n  =  (P] 
and  m  =  |F(A)|. 

Proof  sketch:  If  the  0{mn)  term  is  less  than 
the  0(nlgn  -f-  m)  term  (few  vertices  and  many 
points),  we  construct  a  convex  program  in  which 
each  point  in  P  produces  n  constraints  of  constant 
complexity,  similar  to  those  in  the  previous  reduc¬ 
tion.  If  not,  we  compute  the  convex  hull  of  P  and 
use  a  one-directional,  translation-only  modification 
of  the  previous  construction. 

Note  that  the  Hausdorff  distance  used  in  this 
section  can  be  derived  from  any  metric  on  E'^,  not 
just  L^. 

5  A  new  proof  of  an  interesting 
Helly-type  theorem 

In  this  section  we  will  use  an  easy  but  powerful  the¬ 
orem  from  [A93] ,  which  gives  us  a  simple  technique 
for  proving  Helly-type  theorems. 

Theorem  5.1  Let  {H,w)  be  a  GLP  problem  with 
combinatorial  dimension  d.  The  intersection  of  H 
is  nonempty  if  and  only  if  the  intersection  of  every 
G  C  H  with  |G|  <  d-\-l  is  nonempty. 

Let  be  the  family  of  all  sets  in  consisting 
of  the  disjoint  union  of  at  most  k  closed  convex 
sets.  A  family  I  of  sets  is  intersectional  if,  for  ev¬ 
ery  P  C  7,  Pi  P  G  I.  Gj  is  not  intersectional.  But 
consider  some  subfamily  Ij  C  C\  which  is  intersec¬ 
tional.  This  may  “just  happen”  to  be  true,  or  7^ 
may  be  intersectional  for  some  geometric  reason. 
For  example  consider  of  any  family  J  of  sets  like 
the  one  in  figure  4  where  each  set  is  a  pair  of  balls 
of  diameter  5,  separated  by  a  distance  of  at  least  8, 


Figure  4:  Generating  family  of  7^ 


kind  of  like  dumbbells.  The  family  formed  by  tak¬ 
ing  the  intersection  of  every  subfamily  of  J  forms 
an  intersectional  family  Ij,  since  every  intersection 
consists  of  at  most  two  convex  components. 

Morris  [Mo73]  proved  the  following 

Theorem  5.2  Any  intersectional  family  7^  C  G^ 
has  Helly  number  k{d  -I-  1). 

We  use  Theorem  4.1  to  give  a  new  short  and  intu¬ 
itive  proof.  Given  any  finite  family  P  C  7*,  we  con¬ 
struct  a  GLP  problem  with  P  as  the  constraints. 
Let  /  be  the  lexicographic  objective  function  on 
E‘^,  and  define  w{G)  =  min{/(x)  |  x  E  PlG},  for 
all  G  C  P.  Since  minima  are  identified  with  points, 
we  will  speak  of  w(G)  as  a  point  and  ignore  the  fine 
distinction  between  the  point  x  itself  and  the  value 
fix). 

Theorem  5.3  Finding  w{H)  is  a  GLP  problem  of 
combinatorial  dimension  k{d  -4- 1)  —  1. 

Proof:  It  is  easy  to  see  that  the  problem  satis¬ 
fies  Condition  1,  since  adding  constraints  to  a  sub¬ 
problem  can  only  increase  the  minimum.  And  is 
satisfies  Condition  2  since  every  value  of  wiG)  is 
identified  with  a  unique  point,  which  is  either  in  or 
out  of  h. 

Recall  that  the  combinatorial  dimension  is  the 
largest  cardinality  of  any  basis  B  such  that  f|-B 
is  nonempty.  We  will  count  the  constraints  in  any 
basis  B  by  carefully  removing  selected  constraints 
one  by  one,  while  building  up  a  subfamily  S  of 
“sacred”  constraints  which  may  not  be  removed  in 
later  steps. 

We  will  maintain  two  invariants.  The  first  is  that 
w{B  —  h)  <  w{B)  for  all  h  E  B  —  S.  The  second 
invariant  is  that  for  all  h  E  B  —  S,  the  minimum 
point  w{B  —  h)  lies  in  a  different  convex  component 
of  n(.S  —  h)  from  the  point  w{B). 

First  we  choose  the  subfamily  S  so  that  the  in¬ 
variants  are  true  initially.  Since  0)  there 
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is  a  minimum  point  w(B)  in  some  convex  compo¬ 
nent  of  n  -S-  Each  h  E  B  is  the  disjoint  union 
of  convex  sets;  for  each  h,  the  point  w{B)  is  con¬ 
tained  in  exactly  one  of  them.  Call  this  convex  set 
Ch,  and  let  C  =  {Ch  \  h  £  B}.  The  pair  {C,w) 
is  a  lexicographic  convex  programming  problem,  a 
GLP  problem  of  combinatorial  dimension  d,  with 
w{C)  =  w{B).  So  C  must  contain  a  basis  Be  with 
\Bc\  <  d.  We  set  5  =  {/i  G  B  I  C/i  G  Be]- 

How  does  this  ensure  the  invariants?  Since  B 
is  a  basis,  the  first  invariant  holds  for  any  subset 
S.  The  second  invariant  holds  because  all  the  con¬ 
straints  which  contributed  a  convex  component  to 
Be  are  in  5,  and  for  any  h  E  B  —  S,  w{B  —  h)  < 
w{B)  =  w{Be)-  That  is,  since  the  point  w{B)  is 
the  lowest  point  in  H-Bc,  and  w{B  —  h)  is  lower 
than  w{B),  the  point  w{B  —  h)  cannot  be  in  fl  Be, 
and  hence  must  be  in  a  different  convex  component 

ofn(5-/i). 

Now  we  turn  our  attention  to  selecting  a  con¬ 
straint  to  remove  from  B.  We  use  the  fact  that  all 
the  points  w{B  —  h)  are  distinct,  for  all  h  G  B  —  5. 
This  is  true  because  the  point  w{B  —  h)  ^  h,  so 
that  for  any  other  h'  £  B,  since  h  G  {B  —  h'), 
w{B  —  h)  ^  fj(5  —  h').  Since  the  w(B  —  h)  are 
distinct,  there  is  some  E.  B  —  S  such  that 

w{B  —  hmax)  >  w{B  —  h)  for  all  other  h  £  B  —  S. 

So  consider  removing  hmax  from  B.  Since  w{B  — 
h)  <  w(B  —  hmax),  for  any  other  h  £  B  —  S,  cer¬ 
tainly  w{B  -  h  -  hmax)  <  w{B  -  hmax)-  So  the 
first  invariant  is  maintained  for  B  —  hmax  and  S. 
To  re-establish  the  second  invariant,  we  have  to  add 
more  elements  to  S.  We  do  this  in  the  same  way  as 
before,  by  finding  the  at  most  d  constraints  which 
determine  the  minimum  of  the  convex  component 
containing  w{B  —  hmax)-  We  add  these  constraints 
to  5,  and  set  B  =  B  —  hmax- 

We  iterate  this  process,  selecting  constraints  to 
remove  from  B  and  adding  constraints  to  S,  un¬ 
til  H  —  5  is  empty,  that  is,  B  =  S.  We  now  show 
that  each  removed  constraint  h  accounts  for  at  least 
one  convex  component  Ch  in  H  S.  Removing  h 
from  B  caused  a  new  minimum  point  w{B  —  h) 
to  be  created.  This  point  was  the  minimum  point 
in  some  convex  component  Ch  of  fj(-B  ~  h).  We 
added  the  constraints  determining  w(B  —  h)  to  S, 
so  w{B  —  h)  has  to  remain  the  minimum  point  in  its 
convex  component,  throughout  the  rest  of  the  pro¬ 
cess.  This  ensures  that,  although  Ch  may  later  be¬ 


come  part  of  some  larger  component,  it  will  never 
become  part  of  a  larger  component  with  a  lower 
minimum  point.  Each  subsequent  component  cre¬ 
ated  by  the  removal  of  another  constraint  from  B 
will  in  fact  have  a  lower  minimum  point,  so  the 
component  containing  w{B  —  h)  must  remain  dis¬ 
tinct  from  all  later  components.  Thus  every  re¬ 
moved  constraint  h  will  account  for  at  least  one 
distinct  component  in  fj  S. 

Since  is  an  intersectional  family,  no  subfamily 
of  constraints  can  have  more  than  k  convex  com¬ 
ponents  in  its  intersection.  Since  fj  B  was  initially 
nonempty,  we  started  with  at  lea.st  one  convex  com¬ 
ponent,  and  at  most  d  constraints  in  S.  No  more 
than  A:  —  1  constraints  were  removed,  and  each  con¬ 
straint  removed  added  at  most  d  constraints  to  S. 
So  the  total  size  of  \B\  <  (A:  — 1)-|-|5|  <  A:(d-f  1)  — 1. 

□ 

Theorems  4.1  and  4.3  together  imply  Theorem 
4.2. 
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Abstract 

Let  JF  be  a  collection  of  n  d-variate,  possibly  par¬ 
tially  defined,  functions,  all  algebraic  of  some  con¬ 
stant  maximum  degree.  We  present  a  randomized 
algorithm  that  computes  the  vertices,  edges,  and  2- 
faces  of  the  lower  envelope  (i.e.,  pointwise  minimum) 
of  T  in  expected  time  for  any  e  >  0.  For 

d  =  3,  by  combining  this  algorithm  with  the  point  lo¬ 
cation  technique  of  Preparata  and  Tamassia,  we  can 
compute,  in  randomized  expected  time  0(n^+^),  for 
any  e  >  0,  a  data  structure  of  size  0(n^+^)  that, 
given  any  query  point  q,  can  determine  in  0(log^  n) 
time  whether  q  lies  above,  below  or  on  the  envelope. 
As  a  consequence,  we  obtain  improved  algorithmic 
solutions  to  many  problems  in  computational  geom¬ 
etry,  including  (a)  computing  the  width  of  a  point 
set  in  3-space,  (b)  computing  the  biggest  stick  in 
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a  simple  polygon  in  the  plane,  and  (c)  computing 
the  smallest-width  annulus  covering  a  planar  point 
set.  The  solutions  to  these  problems  run  in  time 
Q(j^i7/ii+e),  f-Qj.  £  >  0,  improving  previous  so¬ 
lutions  that  run  in  time  We  also  present 

data  structures  for  (i)  performing  nearest-neighbor 
and  related  queries  for  fairly  general  collections  of 
objects  in  3-space  and  for  collections  of  moving  ob¬ 
jects  in  the  plane,  and  (ii)  performing  ray-shooting 
and  related  queries  among  n  spheres  or  more  gen¬ 
eral  objects  in  3-space.  Both  of  these  data  structures 
require  storage  and  preprocessing  time,  for 

any  e  >  0,  and  support  polylogarithmic-time  queries. 
These  structures  improve  previous  solutions  to  these 
problems. 

1  Introduction 

Let  T  —  {/i,  . . .  ,  /„}  be  a  collection  of  n  d-variate, 
possibly  partially  defined,  functions,  all  algebraic  of 
some  constant  maximum  degree  h  (and  if  they  are 
partially  defined,  their  domains  of  definition  are  also 
described  each  by  a  constant  number  of  polynomial 
equalities  and  inequalities  of  maximum  degree  b). 
Abusing  the  notation  slightly,  we  will  not  distinguish 
between  a  function  and  its  graph.  The  lower  enve¬ 
lope  Ejr  of  T  is  defined  as  Eyr{-x.)  =  min/i(x),  where 
the  minimum  is  taken  over  all  functions  of  E  that  are 
defined  at  x.  The  minimization  diagram  Mjr  of  E  is 
the  decomposition  of  into  maximal  connected  re¬ 
gions  (of  any  dimension),  called  cells,  so  that  within 
each  cell  the  same  subset  of  functions  appears  on  the 
envelope  Eyr.  (A  more  detailed  definition,  treating 
also  the  case  of  partially-defined  functions,  is  given 
in  [28].)  The  combinatorial  complexity  of  Myr  and  of 
Ejr  is  the  number  of  faces  of  all  dimensions  in  Mjr. 

Recently,  there  has  been  significant  progress  in  the 
analysis  of  the  combinatorial  complexity  of  lower  en¬ 
velopes  of  multivariate  functions  [19,  28].  In  partic¬ 
ular,  it  was  shown  in  [28]  that  the  maximum  com- 
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plexity  of  Mj^  is  0(n'^+^),  for  any  £  >  0,  where  the 
constant  of  proportionality  depends  on  £,  d  and  b. 
This  result  almost  settles  a  major  open  problem  and 
has  already  led  to  many  applications  [6, 19,  28].  How¬ 
ever,  less  progress  has  been  made  on  the  correspond¬ 
ing  algorithmic  problem,  which  calls  for  the  efficient 
construction  of  the  lower  envelope  of  such  a  collec¬ 
tion  T,  in  time  for  any  £  >  0.  The  ideal 

output  of  an  algorithm  that  computes  the  envelope 
is  a  data  structure  of  size  0(n^+®),  which  can  return, 
for  a  given  point  x  £  the  identity  of  the  func- 
tion(s)  attaining  the  envelope  at  x  in  logarithmic  (or 
polylogarithmic)  time.  Weaker  solutions  might  pro¬ 
vide  just  an  enumeration  of  the  cells  of  Mjr,  each  cell 
augmented  with  its  description  as  a  semialgebraic  set, 
and/or  with  a  representative  point  lying  in  it,  and 
possibly  include  explicit  adjacency  structure,  repre¬ 
senting  all  pairs  of  cells  where  one  cell  is  contained  in 
the  closure  of  the  other. 

The  paper  [28]  presents  an  algorithm  for  computing 
the  lower  envelope  of  bivariate  functions,  having  the 
properties  listed  above,  which  runs  in  time 
for  any  £  >  0.  Other  algorithms  with  a  similar  per¬ 
formance  are  given  in  [9,  16].  The  simplest  solution 
to  this  problem,  involving  a  deterministic  divide-and- 
conquer  algorithm,  was  recently  presented  in  [3].  All 
these  algorithms  facilitate  point  location  queries  of 
the  sort  described  above.  Unfortunately,  these  meth¬ 
ods  fail  in  higher  dimensions.  For  example,  the  tech¬ 
nique  of  [28]  relies  on  the  existence  of  a  vertical  de¬ 
composition  of  the  minimization  diagram  Mr  of  a 
sample  R  oi  r  functions  of  R,  into  a  small  number 
(that  is,  0(r‘^+^))  of  cells  of  constant  descriptional 
complexity.  Such  decompositions  exist  (and  are  easy 
to  compute)  for  d  =  2,  but  their  existence  in  higher 
dimensions  is  still  an  open  problem. 

In  this  paper  we  present  an  efficient  algorithm  for 
computing  the  vertices,  edges,  and  two-dimensional 
faces  of  the  lower  envelopes  of  d-variate  functions 
having  the  properties  assumed  above.  We  can  use 
this  algorithm  to  compute  the  entire  lower  envelope 
of  a  collection  of  tri-variate  functions,  which  has  all 
the  desired  characteristics;  in  particular,  it  supports 
queries  for  determining  whether  a  given  point  lies 
above  or  below  the  envelope.  The  algorithm  by¬ 
passes  the  problem  of  having  to  construct  small-size 
vertical  decomposition  by  applying  the  technique  of 
Preparata  and  Tamassia  [26]  for  point  location  in  cer¬ 
tain  types  of  3-dimensional  subdivisions.  This  allows 
us  to  use  a  coarser  decomposition  of  the  minimization 
diagram,  whose  size  is  close  to  cubic. 

Several  recent  papers  [4,  11,  23]  have  studied  a  va¬ 
riety  of  geometric  problems  whose  solution  calls  for 
the  construction  of,  and  searching  in,  lower  or  up¬ 


per  envelopes  in  4-space.  These  applications  fall  into 
two  main  categories:  preprocess-for-queries  problems, 
which  call  for  the  construction  of  such  an  envelope, 
to  be  queried  repeatedly  later,  and  batched  problems, 
where  all  the  queries  are  known  in  advance,  and  the 
goal  is  to  produce  the  answers  to  all  of  them  effi¬ 
ciently.  We  will  present  some  of  these  applications, 
as  listed  in  the  abstract,  where  our  new  algorithm  for 
computing  lower  envelopes  in  four  dimensions  leads 
to  improved  solutions. 

The  paper  is  organized  as  follows.  In  Section  2 
we  present  a  general  efficient  randomized  technique, 
which  we  believe  to  be  of  independent  interest,  for 
computing  all  the  0,  1,  and  2-dimensional  features 
of  lower  envelopes  in  any  dimension.  Then,  in  Sec¬ 
tion  3,  we  apply  this  algorithm  to  the  case  of  tri¬ 
variate  functions.  This  gives  us  an  initial  representa¬ 
tion  of  the  lower  envelope  of  such  a  collection,  which 
we  then  augment  by  additional  features,  resulting  in 
a  representation  suitable  for  the  application  of  the 
Preparata- Tamassia  technique.  In  Sections  4  and  5 
we  present  applications  of  our  result  to  a  variety  of 
problems  in  computational  geometry,  as  listed  in  the 
abstract. 

2  Lower  Envelopes  in  Arbitrary  Di¬ 
mension 

In  this  section  we  present  a  randomized  technique  for 
computing  a  partial  description  of  lower  envelopes  of 
d-variate  functions,  for  d  >  2,  whose  expected  run¬ 
ning  time  is  0(n‘^+^),  for  any  £  >  0.  This  technique 
only  computes  the  vertices,  edges,  and  2-faces  of  the 
envelope,  which  is  sufficient  for  the  construction  of 
envelopes  in  4-space,  as  will  be  explained  in  Section 

3  below. 

Let  T  —  {fi,  ■  ■  ■ ,  fn}  be  a  collection  of  (partial) 
d- variate  functions  in  satisfying  the  conditions 

described  in  the  introduction.  We  assume  that  the 
functions  of  R  are  in  general  position  (see  [28]  for 
more  details  and  for  a  discussion  of  this  assumption). 
To  compute  the  lower  envelope  Ej:  of  J- in  the  above 
sense,  we  proceed  as  follows.  Let  E  be  the  family 
of  all  (d  -  l)-subsets  of  T.  We  fix  a  subset  a  — 
e  E,  and  let 

m  =  {x  G  I  /i(x)  =  ■  •  •  =  /d-i(x)}  . 

Since  the  /j’s  are  assumed  to  be  in  general  position, 
is  a  two-dimensional  surface  (or  a  surface  patch). 
For  the  sake  of  simplicity,  we  assume  that  H'"  is  con¬ 
nected  and  a;  10:2 -monotone  (i.e.,  any  (d  —  2)-flat  or¬ 
thogonal  to  the  a;ia;2-plane  meets  H^  in  at  most  one 
point);  otherwise  we  decompose  it  into  a  constant 
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number  of  connected  portions  so  that  each  portion  is 
an  a;ia;2-monotone  surface  patch,  and  work  with  each 
patch  separately.  For  each  i>  d,  let 

7.  =  {x  e  n-  I  /,(x)  =  /i(x)  =  ■■■==  /d_i(x)}  ; 

7i  is  a  1-dimensional  curve,  which  partitions  into 
two  (not  necessarily  connected)  regions,  K^,  , 

where 

K+  =  {x  G  n'"  I  /i(x)  >  /i(x)  =  •  ■  ■  =  /d-i(x)} 
K-  =  {xGm  |/,(x)  </i(x)  =  ..--/d_i(x)}. 

Then  the  intersection  =  Hod  Kt  is  the  portion 
of  11'^  over  which  the  envelope  Ejr  is  attained  by  the 
functions  of  a.  The  algorithm  will  compute  the  re¬ 
gions  over  all  choices  of  {d  —  l)-tuples  a  of  func¬ 
tions,  thereby  yielding  all  the  vertices,  edges  and  2- 
faces  of  Myr  (because  of  the  general  position  assump¬ 
tion,  any  such  feature  must  show  up  as  a  feature  of 
at  least  one  of  the  regions  Q^). 

We  compute  using  a  randomized  incremental 
approach,  similar  to  the  ones  described  in  [12,  16, 
24,  25,  29].  Since  the  basic  idea  is  by  now  fairly 
standard,  we  only  give  a  brief  overview  of  the  al¬ 
gorithm,  and  refer  the  reader  to  [12,  29]  for  details. 
Let  \  d  <  i  <  n}.  We  first  compute  the 

set  (We  need  to  assume  an  appropriate  model 
of  computation,  in  which  any  of  the  various  prim¬ 
itive  operations  required  by  the  algorithm  can  be 
performed  in  constant  time.  For  example,  we  can 
assume  the  model  used  in  computational  real  alge¬ 
braic  geometry  [27],  where  each  algebraic  operation, 
involving  a  constant  number  of  polynomials  of  con¬ 
stant  maximum  degree,  can  be  performed  exactly,  us¬ 
ing  rational  arithmetic,  in  constant  time.)  Next,  we 
add  the  curves  7^  one  by  one  in  a  random  order,  and 
maintain  the  intersection  of  the  regions  Kf  for  the 
curves  added  so  far.  Let  {')d,ld+i,  ■■■  ,ln)  be  the 
(random)  insertion  sequence,  and  let  denote  the 
intersection  of  ,  . . .  ,  .  We  construct  and  main¬ 

tain  the  ‘vertical  decomposition’  Qf  of  Qf ,  defined  as 
the  partitioning  of  each  2-face  (t>  of  Qf  into  ‘pseudo¬ 
trapezoids’,  obtained  by  drawing,  from  each  vertex 
of  d  and  from  each  locally  xi-extreme  point  on  dcf),  a 
curve  within  0  orthogonal  to  the  a^i-axis,  and  extend 
it  (in  both  directions  if  necessary)  until  it  hits  dcj) 
again.  Each  pseudo-trapezoid  is  defined  by  at  most 
four  curves  of  r®";  conversely,  any  four  or  fewer  curves 
of  F*”  define  a  constant  number  of  pseudo-trapezoidal 
cells,  namely,  those  formed  along  11'^  when  only  these 
curves  are  inserted.  (Note  that  this  construction  is 
well-defined  since  11'^  is  an  a;iX2-monotone  surface.) 
In  the  (i  -t-  l)**-step  we  add  and  compute  Qf+i 
from  Qf ,  using  the  technique  described  in  [12], 


The  analysis  of  the  expected  running  time  of  the 
algorithm  proceeds  along  the  same  lines  as  described 
in  [12,  29].  We  define  the  weight,  w{t),  of  a  pseudo¬ 
trapezoid  T,  defined  by  the  arcs  of  r°',  to  be  the  num¬ 
ber  of  functions  /j,  hv  i  =  d,d  +  1, ...  ,n  (excluding 
the  up  to  4  functions  whose  intersections  with  XI'”  de¬ 
fine  t),  such  that  /^(x)  <  /i(x)  =  ■  •  •  =  /d-i(x)  for 
some  point  x  G  r. 

As  shown  in  [12],  the  cost  of  the  above  procedure 
is  proportional  to  the  number  of  pseudo-trapezoids 
that  are  created  during  the  execution  of  the  algo¬ 
rithm,  plus  the  sum  of  their  weights,  plus  an  over¬ 
head  term  of  0{n‘^),  needed  to  prepare  the  collec¬ 
tions  of  curves  7^  over  all  2-dimensional  intersection 
manifolds  IT'’'.  The  analysis  given  below  deals  only 
with  pseudo-trapezoids  that  are  defined  by  exactlj^ 
four  such  functions  (plus  the  d—\  functions  defining 
n'^),  and  easy  and  obvious  modifications  are  neces¬ 
sary  to  handle  all  other  pseudo-trapezoids. 

Let  T*”  denote  the  set  of  pseudo-trapezoids  (or 
‘cells’  for  brevity)  defined  by  four  arcs  of  F'^  (again, 
see  [12]  for  details),  and  let  T  =  Ucres^'^-  Each  cell 
in  T  is  defined  byd-l-|-4  =  d-t-3  functions  of  E. 
In  what  follows  we  implicitly  assume  that  the  speci¬ 
fication  of  a  cell  T  £  T  includes  the  d  +  3  functions 
defining  r,  where  there  is  a  clear  distinction  between 
the  first  d  —  1  functions  (constituting  the  set  a),  and 
the  last  four  functions  (defining  the  four  curves  that 
generate  t  along  n”^).  For  an  integer  A:  >  0  and  a 
subset  i?  C  JF,  let  Tk{R)  C  T  (resp.  T<k{R)  Q  T) 
denote  the  set  of  cells,  defined  by  d  -f-  3  functions  of 
R,  as  above,  with  weight  k  (resp.  at  most  k).  Let 
NkiR)  -  |T*.(i?)|  and  Nk{r)  =  m&XRNk{R),  where 
the  maximum  is  taken  over  all  subsets  i?  of  .F  of  size 
r.  Similarly,  we  define  N<k{R)  and  N<k{r).  Since 
each  cell  of  To{R)  lies  on  the  lower  envelope  of  R,  it 
follows  that  No{r)  =  Adapting  the  analysis 

technique  of  Clarkson  and  Shor  [15],  we  have 

Lemma  2.1  The  probability  that  a  cell  t  G  Tk{J-) 
is  created  during  the  incremental  construction  of  Q^ , 
where  a  £  E  is  the  tuple  for  which  contains  t,  is 

i/er). 

Lemma  2.2  For  any  0  <  k  <  n  and  for  any  e  >  0, 
N<k{n)  =  Oi{k  +  lf-^n^+^). 

For  a  cell  t  £  T,  let  Ar  be  the  event  that  t  £  Qf , 
for  the  tuple  cr  G  E  for  which  11°'  contains  t,  and 
for  some  d  <  i  <  n.  The  expected  running  time 
of  the  algorithm,  over  all  choices  of  (d  —  l)-tuples  of 
functions,  is  thus  proportional  to 

^(w(r)  -b  1)  ■  Pr[A^]  -b  0(n‘‘') 

rer 


.350 


=  E  E  {k+l)-Pr[A,]  +  Oin‘^) 

k>OreTk(T) 

fc=0  V  4  J 

where  the  last  inequality  follows  from  Lemma  2.1. 
Since  Nk{T)  =  N<k{^)  -  fV<(fc_i)(J'),  we  obtain, 
using  Lemma  2.2, 


-d-3 


E 

fc=0 


(fc+l)jVfc(^) 

('^r) 

hi  (fc  +  4)(fc  +  3)(fc  +  2) 

n  — d— 4 

0(n^+"+  5] 


k=l 

-d-4 


{k  +  4){k  +  3)ik  +  2)ik  +  l). 


fc=l 


We  thus  obtain: 


Theorem  2.3  The  vertices,  edges,  and  2-faces  of  the 
lower  envelope  ofn  (partial)  d-variate  functions,  sat¬ 
isfying  the  conditions  stated  in  the  introduction,  can 
be  computed  in  randomized  expected  time 
for  any  e  >  0. 


3  Envelopes  in  Four  Dimensions 

We  next  apply  the  results  of  the  preceding  section 
to  obtain  an  efficient  algorithm  for  constructing  the 
lower  envelope  of  a  collection  of  n  tri-variate  func¬ 
tions,  satisfying  the  assumptions  made  above,  in  the 
following  strong  sense:  One  can  preprocess  !F,  in  ran¬ 
domized  expected  time  for  any  e  >  0,  into  a 

data  structure  of  size  0(n^+^)  that  supports  queries 
of  the  form:  given  a  point  in  G  M^,  compute  the  func- 
tion(s)  attaining  Ejr  at  w;  each  query  can  be  an¬ 
swered  in  0(log^  n)  time. 

We  first  apply  the  algorithm  summarized  in  Theo¬ 
rem  2.3,  to  compute  the  vertices,  edges,  and  2-faces 
of  the  minimization  diagram  M^r  of  E.  We  next  par¬ 
tition  the  cells  of  Mjr  into  ‘monotone’  subcells,  in 
the  sense  of  Lemma  3.1  below,  to  obtain  a  refinement 
M'jr  of  Afjr.  We  then  use  the  point-location  technique 
of  Preparata  and  Tamassia  [26]  to  produce  the  data 
structure  representing  the  lower  envelope  in  the  above 
manner. 

We  define  and  construct  as  follows.  We  mark, 
along  each  2-face  F  of  Mjr,  the  locus  7f  of  all  points 
of  F  which  are  either  singular  or  have  a  vertical 
tangency  (in  the  z-direction).  The  arcs  -fp,  for  all 


2-faces  F,  lie  along  O(n^)  curves  in  each  be¬ 
ing  the  xi/z-projection  of  the  locus  of  all  singular 
points  or  points  with  ^-vertical  tangency  along  some 
2-manifold  fi  —  fj,  for  a  pair  of  indices  i  ^  j,  or 
along  the  boundary  of  some  fi.  We  consider  below 
only  the  former  case;  the  latter  case  can  be  han¬ 
dled  in  almost  the  same  (and,  actually,  simpler)  man¬ 
ner.  Let  S  be  one  of  these  curves.  We  consider  the 
2-dimensional  surface  Vs,  within  the  xyz-spa.ce,  ob¬ 
tained  as  the  union  of  all  lines  passing  through  points 
of  6  and  parallel  to  the  z-axis;  let  ,  Vf~  denote  the 
portions  of  Vs  that  lie,  respectively,  above  and  below 
8.  (Note  that  Vs  may  have  self-intersections  along 
some  vertical  lines,  along  which  V^  and  Vf~  are  not 
well-defined;  we  omit  here  details  of  the  (rather  easy) 
handling  of  such  cases.) 

Let  ^0  be  the  portion  of  8  over  which  the  functions 
fi  and  fj  attain  the  envelope  Fp.  Clearly,  80  is  the 
union  of  all  arcs  that  are  contained  in  8,  and  each 
such  edge  of  Fp  is  contained  in  some  connected  com¬ 
ponent  of  ^0)  so  the  number  of  connected  components 
in  5o>  over  all  intersection  curves,  is  0{n^'^^).  Let  w 
be  a  point  in  (5o-  Then  the  cell  c  of  Mp  lying  imme¬ 
diately  above  w  in  z-direction  is  such  that  within  c 
the  envelope  Ff  is  attained  by  either  fi  or  fj .  Thus 
the  upward-directed  ^-vertical  ray  emanating  from  w 
leaves  c  (if  at  all)  at  a  point  that  lies  on  a  2-manifold 
of  the  form  fi  =  fk  or  fj  =  fk-  For  fixed  i  and  j, 
there  are  only  0{n)  possible  2-manifolds  of  this  kind. 
We  compute  the  lower  envelope  of  these  0(n) 
2-manifolds,  restricted  to  V^ .  It  is  easily  seen  that 
the  complexity  of  E^^')  is  0(A,(n)),  for  some  constant 
q  depending  on  the  maximum  degree  and  shape  of 
these  2-manifolds;  A5(n)  is  the  maximum  length  of 
Davenport-Schinzel  sequences  of  order  q,  and  is  close 
to  linear  for  any  fixed  q  [3] .  We  next  take  the  portions 
of  the  graph  of  E^^^  that  lie  over  80,  and  “etch”  them 
along  the  corresponding  2-faces  of  Ep.  We  apply  a 
fully  symmetric  procedure  within  for  all  curves 
8.  The  overall  combinatorial  complexity  of  all  the 
added  curves  is  thus  0{n^Xq{n)  n^+^)  =  0{n^'^^), 

and  they  can  be  computed  in  0(n^‘*'^)  time,  as  is  eas¬ 
ily  verified. 

Let  M'jr  denote  the  refined  cell  decomposition  of 
obtained  by  adding  to  Mp  the  arcs  7^,  the  etched 
arcs  of  the  upper  and  lower  envelopes  in  the  vertical 
manifolds  Vs,  and  the  a:-vertical  walls  (i.e.,  union  of  all 
^-vertical  segments)  contained  in  Vs  and  connecting 
the  arcs  7^  to  the  etched  arcs.  As  just  argued,  the 
combinatorial  complexity  of  is  still  0(n^'^^),  and 
M'jr  has  the  following  crucial  property: 

Lemma  3.1  For  each  3- cell  c  of  M'jr,  every  con¬ 
nected  component  of  a  cross  section  of  c  by  a  plane 
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parallel  to  the  xz-plane  is  x-monotone. 

Let  M'jr{yo)  denote  the  cross  section  of  by  the 
plane  y  =  j/o-  Lemma  3.1  implies  that  M'jr{yo)  is  an 
x-monotone  planar  subdivision,  for  each  yo-  Hence,  if 
we  orient  the  edges  of  in  the  positive  x-direction, 
add  a  pair  of  nominal  points  s,t  at  x  =  -oo  and 
at  X  =  -1-00,  respectively,  and  connect  them  to  the 
appropriate  unbounded  edges  of  Af^(yo),  this  map 
becomes  a  planar  st-graph,  in  the  notation  of  [26]. 
(Note  that  the  vertical  decomposition  that  generates 
M'jr  from  Mjr  may  create  vertical  edges;  if  these 
edges  are  oriented  consistently,  say  in  the  upward  2- 
direction,  then  A/^  remains  an  st-graph.)  We  denote 
this  st-graph  also  by  A'/^(yo).  It  is  easy  to  verify  the 
following  statement. 

Lemma  3.2  Let  I  be  an  open  interval  of  the  y-axis 
that  does  not  contain  the  y-coordinate  of  any  vertex 
of  M'yr,  or  of  any  local  y-extremum  along  any  edge  of 
M'jr-  Then,  for  each  y\,y2  €  I,  the  st-graphs  M'yr{yi), 
are  isomorphic,  as  labeled  embedded  planar 
st-graphs. 

We  are  now  in  a  position  to  apply  the  point  lo¬ 
cation  technique  of  Preparata  and  Tamassia  [26]. 
They  show  that,  given  a  3-dimen.sional  subdivision  of 
combinatorial  complexity  N,  it  can  be  preprocessed 
in  time  0{N  log^  N)  into  a  data  structure  of  size 
0{N  log^  N),  which  supports  0(log^  iV)-time  point- 
location  queries  in  the  given  subdivision.  However, 
for  this  to  hold,  the  subdivision  must  have  the  prop¬ 
erties  that  (a)  (the  1-skeleton  of)  each  cross  section 
of  the  subdivision  by  a  plane  parallel  to  the  x^-plane 
is  a  planar  st-graph,  whose  faces  are  all  monotone  in 
the  x-direction  and  whose  edges  are  all  oriented  in 
the  positive  x-direction  (or,  for  vertical  edges,  in  the 
positive  x-direction),  and  (b)  at  each  point  where  the 
cross  section,  as  an  st-graph,  changes,  the  graph  can 
be  updated  by  a  constant  number  of  operations  from 
the  following  collection:  (1)  insertion  of  a  vertex  in 
the  middle  of  an  edge,  or  the  complementary  opera¬ 
tion  of  deleting  a  vertex  of  in-degree  and  out-degree 
1  and  replacing  the  two  incident  edges  by  one,  while 
maintaining  x-monotonicity  of  the  incident  faces;  (2) 
insertion  of  an  edge  partitioning  an  x-monotone  face 
into  two  x-monotone  subfaces,  or,  conversely,  deletion 
of  an  edge  and  merging  its  two  adjacent  faces,  pro¬ 
vided  their  union  remains  x-monotone;  and  (3)  merg¬ 
ing  two  adjacent  vertices  into  one  vertex  (collapsing 
the  edge  connecting  them),  or  splitting  a  vertex  into 
tw'o  vertices  (forming  a  new  edge  between  these  ver¬ 
tices),  again  maintaining  x-monotonicity. 

It  is  easily  verified  that,  indeed,  each  change  oc¬ 
curring  in  the  structure  of  the  cross  section  Myr{-), 


at  any  of  the  critical  y  values  in  Lemma  3.2,  can  be 
expressed  as  a  constant  number  of  operations  of  the 
types  mentioned  above. In  summary,  we  thus  obtain 
the  following  main  result  of  the  paper: 

Theorem  3.3  Let  J-  be  a  given  collection  of  n  tri¬ 
variate,  possibly  partially  defined,  functions,  all  al¬ 
gebraic  of  constant  maximum  degree,  and  whose  do¬ 
mains  of  definition  (if  they  are  partially  defined)  are 
each  defined  by  a  constant  number  of  algebraic  equal¬ 
ities  and  inequalities  of  constant  maximum  degree. 
Then,  for  any  e  >  0,  the  lower  envelope  Ejr  of  IF  can 
be  computed  in  randomized  expected  time  0(n^+'), 
and  stored  in  a  data  structure  of  size  0(n^+^),  so 
that,  given  any  query  point  re  G  we  can  compute 
Eyr[w),  as  well  as  the  function(s)  attaining  Ejr  at  w, 
in  0(log^  n)  time. 

4  Applications:  Query  Problems 

In  this  section  we  apply  Theorem  3.3  to  two  problems, 
involving  preprocessing  and  querying  certain  collec¬ 
tions  of  objects  in  3-space. 

4.1  Ray  shooting  amidst  spheres 

Given  a  collection  5  of  n  spheres  in  3-space,  we  wish 
to  preprocess  S  into  a  data  structure  that  supports 
ray-shooting  queries,  each  seeking  the  first  sphere, 
if  any,  met  by  a  query  ray.  This  problem  has  re¬ 
cently  been  studied  in  [2,  5,  23].  The  first  two  pa¬ 
pers  present  a  data  structure  that  requires  0(n^+^) 
storage  and  preprocessing,  for  any  e  >  0,  and  an¬ 
swers  a  query  in  time  0(log^  n).  The  third  paper  [23] 
gives  a  rather  elaborate  and  improved  solution  that 
requires  C>(n^+^)  preprocessing  and  storage,  for  any 
£  >  0,  and  answers  a  query  in  0{n^)  time.  These 
algorithms  use  the  technique  described  in  [1],  and  re¬ 
duce  the  problem  to  that  of  determining  whether  a 
query  segment  e  intersects  any  sphere  of  <5.  Then, 
using  a  multi-level  data  structure,  they  reduce  the 
problem  to  that  of  detecting  whether  the  line  A  con¬ 
taining  the  segment  e  intersects  any  sphere  of  S.  As 
observed  in  [5,  23],  this  problem  can  be  further  re¬ 
duced  to  point  location  in  the  upper  envelope  of  a 
set  of  certain  tri-variate  functions,  as  follows.  Let  tt 
be  the  plane  passing  through  A  and  orthogonal  to  the 
vertical  plane  V  passing  through  A.  Let  7r+  (resp.  7r~) 
be  the  half-space  lying  above  (resp.  below)  tt.  Let  S 
be  a  sphere  whose  center  lies  in  7r+  and  which  inter¬ 
sects  ty  in  a  disc  D.  Then  the  center  of  D  lies  above 
A,  so  either  A  intersects  S  or  passes  below  S,  in  the 
sense  that  A  and  S  are  disjoint  and  there  is  a  point 
on  A  that  lies  vertically  below  a  point  in  3.  A  similar 
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property  holds  if  the  center  of  S  lies  in  7r“ .  We  pre- 
process  the  centers  of  spheres  of  S  into  a  half-space 
range-searching  data  structure,  of  size  0(n^+^),  for 
a  query  A,  we  can  decompose  S  into  0(1)  canonical 
subsets,  so  that,  within  each  subset,  either  the  cen¬ 
ters  of  all  spheres  lie  in  7r+,  or  they  all  lie  in  tt”.  Let 
us  consider  the  case  when  the  centers  of  all  spheres 
lie  in  tt"*". 

Hence,  we  need  to  solve  the  following  subproblem: 
Given  a  set  <S  of  n  spheres  in  3-space,  and  given  a 
query  line  A  with  the  property  that  for  each  sphere 
S  £  S,  either  A  intersects  S  or  there  is  no  point  of 
S  lying  vertically  below  A,  determine  whether  A  in¬ 
tersects  any  sphere  of  <S.  We  reduce  this  problem 
to  point  location  in  the  lower  envelope  of  certain  tri¬ 
variate  functions,  as  follows.  We  can  parametrize  a 
line  A  in  3-space  by  four  parameters 
so  that  the  equations  defining  A  are  ?/  =  +  6, 

z  =  -b  ^4.  (We  assume  here  that  A  is  not  parallel 
to  the  y2:-plane;  such  lines  can  be  handled  in  a  dif¬ 
ferent,  and  much  simpler  manner.)  For  each  sphere 
S  €  S,  define  a  function  ^4  =  Fs(^i,^2)^3)i  so  that 
the  line  is  tangent  to  5  from  below 

{Fs  is  only  partially  defined,  and  we  put  Fs  =  +00 
when  it  is  undefined;  it  is  easily  checked  that  Fs  is 
algebraic  of  bounded  degree,  and  that  its  boundary 
is  also  algebraic  of  bounded  degree).  Let  $  be  the 
lower  envelope  of  the  functions  Fs,  for  S  G.  S.  Then  a 
query  line  A(4i,  ^2, 6:^4)  having  the  above  properties 
misses  all  spheres  of  S  if  and  only  if  ^4  <  ^(^i,^2)^3)- 
Thus,  by  Theorem  3.3,  one  can  answer  such  queries 
in  time  O(log^n),  using  0(n^+^)  preprocessing  time 
and  storage,  for  any  £  >  0.  Plugging  the  half-space 
range  searching  data  structure  and  the  point  location 
data  structure  into  the  multi-level  data  structure  de¬ 
scribed  in  [5,  23],  we  obtain  a  data  structure  for  the 
segment-emptiness  problem.  A  closer  analysis  of  this 
data  structure  shows  that  the  preprocessing  time  and 
storage  of  the  overall  data  structure  is  still 
for  any  e  >  0,  and  that  the  query  time  is  O(log^n). 
Finally,  plugging  this  data  structure  for  the  segment- 
emptiness  query  into  the  general  ray-shooting  tech¬ 
nique  by  Agarwal  and  Matousek  [1],  we  obtain  a  fi¬ 
nal  data  structure,  still  requiring  near-cubic  storage 
and  preprocessing,  using  which  one  can  answer  a  ray¬ 
shooting  query  in  time  0{\og^n).  That  is,  we  have 
shown: 

Theorem  4.1  A  set  S  of  n  spheres  in  can  be  pre- 
processed  in  randomized  expected  time  into 

a  data  structure  of  size  0(n^+^),  for  any  e  >  0,  so 
that  a  ray  shooting  query  can  be  answered  in  time 
0(log‘‘n). 


4.2  Nearest  neighbor  queries 

Let  S  =  {Si,  ...,Sn}  be  a  collection  of  n  pairwise- 
disjoint  convex  objects  (‘sites’)  in  3-space,  each  hav¬ 
ing  constant  description  complexity,  namely,  each  de¬ 
fined  by  a  constant  number  of  algebraic  equalities  and 
inequalities  of  constant  maximum  degree.  We  wish  to 
compute  the  Voronoi  diagram  Vor{S)  of  S,  and  pre- 
process  it  for  efficient  point  location  queries.  That  is, 
each  query  specifies  a  point  w  in  3-space  and  seeks 
the  site  of  S  nearest  to  w  (say,  under  the  Euclidean 
distance).  This  is  a  generalization  to  3-space  of  the 
classical  Post  Office  Problem. 

As  observed  in  [18],  the  problem  is  equivalent  to 
the  computation  of  the  following  lower  envelope  in  4- 
space.  For  each  S  G  S,  define  Fs{x,y,z)  to  be  the 
Euclidean  distance  from  {x,y,z)  to  S,  and  let  4>  be 
the  lower  envelope  of  these  functions.  Then,  given  a 
query  point  {x,y,z),  the  site(s)  S  G  S  nearest  to  w 
are  those  for  which  Fs  attains  $  at  {x,y,z).  Thus, 
by  Theorem  3.3,  this  can  be  done  using  0(n^+^)  pre¬ 
processing  time  and  storage,  for  any  e  >  0,  and  each 
query  can  be  answered  in  0(log^  n)  time.  (Note  that 
Theorem  3.3  is  indeed  applicable  here,  because  the 
functions  Fs  are  all  (piecewise)  algebraic  of  constant 
maximum  degree,  as  is  easy  to  verify  from  the  condi¬ 
tions  assumed  above.) 

This  is  a  fairly  general  framework,  and  admits  nu¬ 
merous  generalizations,  e.g.,  we  may  replace  the  Eu¬ 
clidean  distance  by  other  distances,  perform  queries 
with  objects  other  than  points  (as  long  as  the  location 
of  a  query  object  can  be  specified  by  only  3  real  pa¬ 
rameters;  this  is  the  case,  e.g.,  if  the  query  objects  are 
translates  of  some  rigid  convex  object),  etc.  An  inter¬ 
esting  generalization  is  to  dynamic  nearest-neighbor 
queries  in  the  plane,  where  each  object  of  S  moves 
along  some  given  trajectory,  and  each  query  {x,y,t) 
asks  for  the  object  of  S  nearest  to  the  point  {x,y) 
at  time  t.  Using  the  same  approach  as  above,  this 
can  be  done,  under  appropriate  assumptions  on  the 
shape  and  motion  of  the  objects  of  S,  with  0(71^"*"^) 
preprocessing  time  and  storage,  for  any  e  >  0,  and 
0(log^  n)  query  time. 

Remark;  There  is  a  prevailing  conjecture  that  the 
complexity  of  generalized  Voronoi  diagrams  in  3- 
space,  and  of  dynamic  Voronoi  diagrams  in  the  plane, 
is  only  near-quadratic,  under  reasonable  assumptions 
concerning  the  distance  function  and  the  shape  of  the 
sites  (and  of  their  motions).  This  was  indeed  proved 
recently  in  [14]  for  the  case  where  the  sites  are  lines 
in  3-space  and  the  distance  function  is  induced  by  a 
convex  polyhedron.  If  this  conjecture  is  established, 
then  the  above  algorithm,  of  near-cubic  cost,  is  far 
from  being  optimal,  and  will  need  to  be  improved 
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considerably. 

5  Applications:  Batched  Problems 

We  next  consider  batched  applications.  These  appli¬ 
cations  solve  a  variety  of  rather  unrelated  problems, 
but  they  all  involve  an  almost  identical  subproblem, 
where  lower  (or  upper)  envelopes  in  4-space  play  a 
role.  To  avoid  repetition,  we  describe  in  detail  only 
one  application,  and  then  state  the  improved  bounds 
for  the  other  applications  without  proof,  referring  the 
reader  to  the  relevant  literature. 

The  following  applications  are  based  on  the  para¬ 
metric  search  technique  of  Megiddo  [22],  which,  in 
our  case,  requires  a  parallel  algorithm  for  computing 
the  lower  envelope  of  a  set  of  surfaces.  However,  the 
algorithm  presented  above  is  highly  unparallelizable, 
because  it  uses  an  incremental  insertion  procedure  of 
regions  into  2-D  arrangements,  and  later  uses  a  plane- 
sweep  in  3-space.  To  finesse  this  difficulty,  we  apply 
the  parametric  search  technique  with  an  additional 
twist  that  avoids  the  need  to  use  a  parallel  algorithm. 

5.1  Width  in  3-space 

Let  5  be  a  set  of  n  points  in  3-space.  The  width  of  S  is 
the  smallest  distance  between  a  pair  of  parallel  planes 
so  that  the  closed  slab  between  the  planes  contains  S. 
In  two  dimensions,  the  problem  can  be  easily  solved  in 
O(nlogn)  time.  An  0(n^)-time  algorithm  for  three 
dimensions  was  presented  in  [21].  Recently,  Chazelle 
et  al.  [11]  presented  an  0(n®/®+^)-time  algorithm,  for 
any  s  >  0.  In  this  subsection,  we  present  an  improved 
randomized  algorithm,  whose  expected  running  time 
is 

Clearly,  it  suffices  to  compute  the  width  of  the  con¬ 
vex  hull  of  5,  so  assume  that  the  points  of  S  are  in 
convex  position,  and  that  the  convex  hull  P  of  5  is 
given.  It  is  known  that  any  two  planes  defining  the 
width  of  S  are  such  that  either  one  touches  a  face 
of  V  and  one  touches  a  vertex  of  V,  or  each  of  them 
touches  an  edge  of  V;  see  [11,  21].  The  first  case  can 
be  handled  in  0(nlog7i)  time  [11,  21].  The  difficult 
case  is  to  compute  the  smallest  distance  between  a 
pair  of  parallel  planes  supporting  V  at  two  ‘antipo¬ 
dal’  edges,  because,  in  the  worst  case,  there  can  be 
0(n^)  such  pairs  of  edges.  Chazelle  et  al.  [11]  pre¬ 
sented  an  0(n®/®+^)-time  algorithm  to  find  the  small¬ 
est  distance,  using  the  parametric  search  technique. 
We  will  present  a  randomized  algorithm,  whose  ex¬ 
pected  running  time  is  for  any  e  >  0, 

using  a  somewhat  different  approach. 

Let  A4  denote  the  Gaussian  diagram  (or  normal 
diagram)  of  V.  Ad  is  a  spherical  map  on  the  unit 


sphere  C  M®.  The  vertices  of  M  are  points  on  §^, 
each  being  the  outward  normal  of  a  face  of  V,  the 
edges  of  M  are  great  circular  arcs,  each  being  the 
locus  of  the  outward  normal  directions  of  all  planes 
supporting  V  at  some  fixed  edge,  and  the  faces  of 
M  are  regions,  each  being  the  locus  of  the  outward 
normal  directions  of  all  planes  supporting  P  at  a  ver¬ 
tex.  Ad  can  be  computed  in  linear  time  from  V.  Let 
Ad'  denote  the  spherical  map  Ad  reflected  through 
the  origin,  and  consider  the  superposition  of  Ad  and 
Ad'.  It  suffices  to  consider  the  top  parts  of  Ad  and 
Ad',  i.e.,  their  portions  within  the  hemisphere  2:  >  0. 
Each  intersection  point  between  an  edge  of  Ad  and  an 
edge  of  Ad'  gives  us  a  direction  u  for  which  there  exist 
two  parallel  planes  orthogonal  to  u  and  supporting  V 
at  two  so-called  antipodal  edges.  Thus  the  problem 
reduces  to  that  of  finding  an  intersection  point  for 
which  the  distance  between  the  corresponding  paral¬ 
lel  supporting  planes  is  minimized. 

We  centrally  project  the  edges  of  (the  top  portions 
of)  Ad  and  Ad'  onto  the  plane  2  =  1,  Each  edge 
projects  to  a  line  segment  or  a  ray.  Let  S  and  5'  be  the 
resulting  set  of  segments  and  rays  in  the  plane.  Using 
the  algorithm  described  in  [10],  we  can  decompose 
£  X  £',  in  time  O(nlog^n)  into  a  family  of  ‘canonical 
subsets’ 

J^={(£u£[),iS2,£^),...,i£t,£i)},  (1) 

such  that 

(i)  £,C£  and  T'  C  £'; 

t 

(>i)  =  C)(nlog^n); 

i=l 

(hi)  each  segment  in  £i  intersects  every  segment  of 
£■;  and 

(iv)  for  every  pair  e,  e'  of  intersecting  segments  in  f  x 
£',  there  is  an  i  such  that  e  e  £i  and  e'  G  T'. 

By  property  (iv),  it  suffices  to  consider  each  pair 
i£i,£l)  separately.  Let  Hi  (resp.  £')  denote  the  set  of 
lines  containing  the  edges  of  V  corresponding  to  the 
edges  of  £i  (resp.  £').  By  construction,  all  lines  of  £, 
lie  above  all  lines  of  £'.  Property  (iii)  implies  that 
every  pair  of  parallel  planes  tt,  tt',  where  n  contains  a 
line  A  of  £;  and  tt'  contains  a  line  A'  of  £',  are  support¬ 
ing  planes  of  V  as  well.  Hence,  we  want  to  compute 
the  smallest  distance  between  two  such  planes.  Since 
the  distance  between  tt  and  tt',  as  above,  is  equal  to 
the  distance  between  A  and  A',  it  follows  that  this 
problem  is  equivalent  to  that  of  computing  the  clos¬ 
est  pair  of  lines  in  x  £'.  Hence,  we  need  to  solve 
the  following  problem;  Given  a  set  £  of  m  ‘red’  lines 
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and  another  set  L'  of  n  ‘blue’  lines  in  such  that  all 
red  lines  lie  above  all  blue  lines,  compute  the  closest 
pair  of  lines  in  £  x  For  any  pair  of  lines  i,  e 
let  d{£,i')  be  the  Euclidean  distance  between  them, 
and  let  d{C,C')  =  min^g£,  ^'^c'  d{£,i'). 

We  first  describe  a  simple  randomized  divide-and- 
conquer  algorithm  for  computing  d{C,C'),  whose  ex¬ 
pected  running  time  is  -|-  mlog^n).  If  m  = 

0(1),  we  compute  d{£,£')  for  all  pairs  £  &  C,  £'  &  £', 
and  choose  the  minimum  distance.  Otherwise,  the 
algorithm  performs  the  following  steps: 

(i)  Choose  a  random  subset  i?i  C  £  of  m/2  red 
lines;  each  subset  of  size  m/2  is  chosen  with  equal 
probability. 

(ii)  Solve  the  problem  recursively  for  (i?i,£').  Let 

6i=d{Ri,C'). 

(hi)  Compute  the  set  R2  =  {£  G  £  \  Ri  \  d{£,  £')  < 

(iv)  Compute  d{£,£')  for  all  pairs  £  G  R2,  £'  G  C , 
and  output  the  minimum  distance  (or  output  61 
if  i?2  is  empty). 

For  a  line  £'  G  £',  let  R^^"*  =  {^  6  £  |  d{£,£')  <  5i}, 
so  that  Using  a  standard  prob¬ 

abilistic  argument,  we  can  show  that  the  expected 
size  of  R^^''>  is  0(1),  for  each  £'  G  C .  Therefore  the 
expected  size  of  R2  is  0(n).  Consequently,  the  ex¬ 
pected  running  time  of  Step  (iv)  is  0{'n?).  The  only 
non-trivial  step  in  the  above  algorithm  is  Step  (iii). 
We  compute  R2  as  follows.  We  map  each  line  £  G  C 
to  a  point  'tl){£)  =  (ai, 02, 03, 04)  in  where  y  = 
aiX  4-  02  is  the  equation  of  the  a^y-projection  of  £, 
and  z  =  a^u  -|-  04  is  the  equation  of  £  in  the  ver¬ 
tical  plane  y  =  aix  +  02  (where  u  denotes  the  axis 
orthogonal  to  the  z-axis).  We  can  also  map  a  line 
£'  to  a  surface  7  =  'y{£')  which  is  the  locus  of  all 
points  ip{£)  such  that  d{£,£')  =  61  and  £  lies  above 
£! .  Removal  of  7  partitions  R^  into  two  subsets,  one, 
denoted  by  7“,  consisting  of  points  for  which  the  cor¬ 
responding  lines  either  lie  below  or  lie  above  £!  and 
d{£,£')  <  61,  and  the  other  subset,  7+,  consisting 
of  points  whose  corresponding  lines  lie  above  £'  and 
d{£,£')  >  61.  Our  choice  of  parameters  ensures  that 
'){£')  is  a:ia;2a;3-monotone,  i.e.,  any  line  parallel  to  the 
a;4-axis  intersects  '){£')  in  at  most  one  point.  Hence, 
'y{£')  is  the  graph  of  a  function  =  fe'{xi,X2,X3). 
Let 

F{xi,X2,X3)  =  m&xfe'{xi,X2,X3) 

be  the  upper  envelope  of  the  set  {/(,'  |  £'  S  £'}.  For 
a  line  £  G  £  with  ^p{£)  =  (01,02,03,04),  we  have 
04  >  £(01,02,03)  if  and  only  if  d{{£},£')  >  (5i.  The 


problem  of  computing  R2  thus  reduces  to  computing 
the  set  of  points  '4’{£),  for  f  G  £  \  i?i,  that  lie  below 
the  upper  envelope  F.  By  Theorem  3.3,  this  can  be 
accomplished  in  time  4-  m  log^  n),  for  any  e  > 

0.  The  total  time  spent  in  Steps  (iii)  and  (iv)  is  thus 
0(n^+^4-m  log^  n).  Let  T(m,  n)  denote  the  maximum 
expected  running  time  of  the  algorithm.  Then 

r(m, n)  <T(J^,n)  +  O 4-  m log^  n) . 

The  solution  to  this  recurrence  is  easily  seen  to  be 
0(n®+^  logm  4-  m  log^  n)  =  0(n^+^  4-  m  log^  n),  for 
any  s'  >  e.  Hence,  we  can  conclude: 

Lemma  5.1  Given  a  set  £  of  m  lines  and  another 
set  £'  of  n  lines  in  R^,  such  that  all  lines  in  £  lie 
above  all  lines  of  £' ,  the  closest  pair  of  lines  in  £x  £' 
can  be  computed  in  time  4-mlog^n),  for  any 

6  >  0. 

Next,  we  describe  another  algorithm  for  computing 
d{£,£'),  which  uses  the  above  procedure  as  a  sub¬ 
routine.  We  first  describe  an  algorithm  for  the  ‘fixed- 
size’  problem  that,  given  a  parameter  6,  determines 
whether  d{£,£')  <  S,  d{£,£')  =  6,  or  d(£,  £')  >  6. 
Next,  we  apply  the  parametric  search  technique  to 
this  algorithm,  with  an  additional  twist,  to  compute 
d{£,£'). 

Consider  the  fixed-size  problem.  If  m  >  the 
problem  can  be  solved,  in  0(mlog^  n)  time,  by  com¬ 
puting  (5*  =  d{£,£'),  using  Lemma  5.1,  and  by  com¬ 
paring  6*  with  8.  So  assume  that  m  <n^. 

We  now  map  each  line  £'  G  £'  to  the  point  V’(£), 
as  defined  above,  and  each  line  ^  G  £  to  a  surface 
<f{£)  which  is  the  locus  of  all  points  such  that 
d{£,£')  =  5  and  £'  lies  below  1.  (This  is  a  dual  rep¬ 
resentation  of  the  problem,  where  the  roles  of  £  and 
of  £'  are  interchanged.)  The  open  region  </>“  lying 
below  (^(f)  consists  of  points  corresponding  to  lines 
£'  that  lie  below  £  and  satisfy  d{£,£')  >  8.  Again, 
each  surface  (j){£)  is  a;ia;2a;3-monotone,  i.e.,  (l){£)  is  the 
graph  of  a  function  x^  =  gi{x\,X2,xf)-  Let 

G(a;i,a;2,a;3)  =  mmgi{xi,X2.,X2) 

be  the  lower  envelope  of  these  functions.  By  the  same 
argument  as  above,  d{£,£')  <  5  if  and  only  if  there 
is  a  point  t^(£),  for  some  £'  G  £',  that  lies  above  the 
lower  envelope  G,  and  d(£,£')  =  5  if  and  only  if  no 
point  corresponding  to  the  lines  of  £'  lies  above  G 
and  at  least  one  such  point  lies  on  G.  To  detect  these 
two  conditions,  we  proceed  as  follows. 

(i)  Fix  a  sufficiently  large  constant  r.  Choose  a  ran¬ 
dom  subset  i?  C  £  of  size  t  =  cr  log  r,  where  c  is 
some  appropriate  constant  independent  of  r. 
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(ii)  Let  Fij  =  I  f  G  i?}  be  the  set  of  t  tri- variate 
functions  corresponding  to  the  lines  of  R,  and  let 
Gr  be  the  lower  envelope  of  F/j.  Decompose  the 
region  of  lying  below  Gr  into  a  collection  E  = 
{ti,  ...  ,Tfc}  of  A:  =  0{t‘^f3{t))  =  0{r^(i{r)  log^r) 
semialgebraic  cells  of  constant  description  com¬ 
plexity  each;  here  /3(r)  is  a  slowly  growing  func¬ 
tion  whose  exact  form  is  determined  by  the  alge¬ 
braic  degree  of  the  surfaces  in  F.  Such  a  decom¬ 
position  can  be  obtained  using  the  algorithms 
described  in  [4,  11].  It  is  based  on  a  decompo¬ 
sition  of  into  a  family  E'  of  0[r^ (3{r)\og^  r) 
cells  of  constant  description  complexity,  so  that, 
within  each  cell,  the  same  set  of  function  graphs 
appear  on  the  lower  envelope  Gr.  The  decompo¬ 
sition  E  is  then  defined  as  the  following  collection 
of  semi-unbounded  ‘prisms’: 

^  I  X  e  A,  .2;  <  Gfl(x)}  I  A  e  S'}  . 

(iii)  For  each  prism  t  £  E,  let  Cr  =  {(■  £  C  \  n 

r  ^  0},  and  £;  =  {f  £  C  \  )  G  r). 

(iv)  If  UrgH^r  there  is  a  line  f  £  £' 

such  that  the  point  ^(£')  lies  above  Gr),  then 
return  6  >  5* .  If  there  is  a  r  G  E  such  that 
|£t-1  >  rn/r  -f  1,  then  go  back  to  Step  (i). 

(v)  For  each  prism  r  G  S,  if  both  are 

nonempty,  then  solve  the  problem  recursively  for 
£t  and  £:}.  If  there  is  a  subproblem  for  which 
6  >  d{CT,C'r),  then  return  6  >  6* .  If  there  is  no 
such  subproblem,  but  there  is  one  subproblem 
for  which  6  =  8*,  then  return  6  =  6*.  Finally,  if 
none  of  these  two  cases  occur,  then  return  6  <  6* . 

The  correctness  of  the  algorithm  follows  from  the 
above  observations.  As  for  the  running  time,  the  well- 
known  results  on  random  sampling  [15,  20]  imply  that 
if  c  is  chosen  sufficiently  large  then,  with  high  proba¬ 
bility,  \Ct\  <  mlr+  1  for  every  r  G  E.  Hence,  the  ex¬ 
pected  number  of  times  we  have  to  go  back  to  Step  (i) 
from  Step  (iv)  is  only  a  constant.  Let  T(m,  n)  denote 
the  expected  running  time  of  the  algorithm.  Then 

k 

T{m,n)  +  I, +  0{m  +  n) , 

2  =  1 

for  m  <  where  X],  =  n.  The  solu¬ 

tion  of  the  above  recurrence  is  easily  seen  to  be 
Q(^8/ii-i-e'^9/ii+e'  _|_  _j_  for  any  e'  >  e. 

Next,  we  describe  how  to  apply  the  parametric 
search  technique  to  this  algorithm.  As  mentioned 
earlier,  we  cannot  use  this  algorithm  directly  in  the 
parametric  search  paradigm,  because  we  do  not  know 


how  to  parallelize  the  first  algorithm.  We  therefore 
simulate  the  above  sequential  algorithm  at  (5  =  6*, 
with  the  additional  twist  that,  instead  of  just  sim¬ 
ulating  the  solution  of  the  fixed-size  problem  at  6*, 
the  algorithm  will  attempt  to  compute  6*  explicitly. 
Since  r  is  chosen  to  be  constant,  the  set  E  can  be 
computed  by  making  only  implicit  “sign  tests” 
involving  8*,  thus  only  a  constant  number,  of 

solutions  of  the  fixed-size  problem  are  needed;  see 
[4,  11]  for  details.  It  can  be  checked  that  the  prob¬ 
lem  of  determining  whether  a  surface  (f>{i)  intersects  a 
prism  T  G  E  or  whether  a  point  ip{R)  lies  in  t  can  be 
reduced  to  computing  the  signs  of  a  constant  number 
of  univariate  polynomials,  each  of  constant  degree,  at 
<5*.  Let  n  =  {pi((5),  ...  ,Ps((5)},  s  =  0(m-|-n),  be 
the  set  of  these  polynomials,  over  all  lines  of  £,£' 
and  over  all  prisms  of  E.  Let  5i  <  ■  •  •  <  be  the 
real  roots  of  these  polynomials.  By  a  binary  search 
over  these  roots  we  compute  the  largest  root  8a  such 
that  6a  <  d(£,  £').  Each  step  of  the  binary  search 
involves  comparing  6*  with  some  6i,  which  in  turn 
involves  solving  an  instance  of  the  fixed  size  problem 
for  some  6^.  If  6a  =6*,  we  have  found  the  value  of 
6*  so  we  stop  right  away.  Thus,  assume  that  8a  <8*. 
We  can  now  easily  resolve  the  signs  of  all  polynomi¬ 
als  of  n  by  evaluating  them  at  ((5q  -t-  i5q+i)/2.  Once 
we  have  computed  £T,£r  for  all  t  G  E,  we  compute 
d{Cr,  £(-)  recursively  and  return  min.rgE  d{Cr,CT.)  as 
d{C,  £').  (Note  that  UrgH  since  we  are  sim¬ 

ulating  the  algorithm  at  8* .)  The  correctness  of  the 
algorithm  is  established  by  the  following  lemma. 

Lemma  5.2  If  none  of  the  calls  to  the  fixed  size  prob¬ 
lem  in  Steps  (i)-(iv)  returns  the  value  of  d{C,,C'), 
then  d{C,  £')  =  mmygH  rf(£r,  £})• 

Proof:  Let  d{£,C')  =  8*,  and  let  i  £  C,  i'  £  C  be 
a  pair  of  lines  such  that  d{£,£')  =  8*.  Since  we  are 
simulating  the  fixed-size  problem  at  ^  =  <5*,  no  point 
corresponding  to  the  lines  of  £'  lies  above  the  lower 
envelope  of  the  surfaces  defined  by  £  (for  6  =  8*), 
i.e.,  there  exists  a  prism  r  G  E  such  that  £'  £  £(. 
(i.e.,  '0(^0  iios  in  r).  If  £  does  not  belong  to  then 
0(0)  lies  below  0(f),  which  implies  that  d{£,£')  >  8*, 
a  contradiction.  Hence  the  pair  {£,  £')  appears  in  the 
subproblem  involving  £,-  and  £0  □ 

Let  T'{m,n)  denote  the  maximum  expected  run¬ 
ning  time  of  the  algorithm.  Then  we  obtain  the  fol¬ 
lowing  recurrence 

k 

T'{m,  n)  <'^T'  +  1,  Ui'j  +  ■  T{m,  n) , 

i  =  l 

for  m  <  The  solution  of  the  above  recurrence  is 

also  0(m®Ai+e  ^9/ii+e  _|_^i+e  fQj.  differ- 
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ent  but  still  arbitrarily  small  constant  e'  >  e.  Hence, 
we  can  conclude 

Lemma  5.3  Given  a  set  C  of  m  lines  and  another 
set  CJ  of  n  lines  such  that  all  lines  in  C  lie  above  all 
lines  of  C ,  the  closest  pair  of  lines  in  C  x  C'  can  be 
computed  in  time  +m^+^  +  n^+^), 

for  any  e  >  0. 

Finally,  we  apply  Lemma  5.3  to  all  subsets 
{€i,S-)  of  T  (cf.  (1)),  and  output  the  mini¬ 
mum  of  the  distances  obtained  for  each  subprob¬ 
lem.  This  is  equal  to  the  minimum  distance  be¬ 
tween  any  pair  of  parallel  planes,  each  supporting 
an  edge  of  V.  The  total  expected  running  time 

is  ELi  O  +  I'?;!' +")  = 

0(n}'^/^^'^^'),  where  e'  is  yet  another  but  still  arbi¬ 
trarily  small  positive  constant.  Putting  everything 
together,  we  can  conclude: 

Theorem  5.4  The  width  of  a  set  of  n  points  in 
can  be  computed  in  randomized  expected  time 
0{n^’^ ,  for  any  e  >  0. 

5.2  Biggest  stick  in  a  simple  polygon 

Let  P  be  a  simple  polygon  in  the  plane  having  n 
edges.  We  wish  to  find  the  longest  line  segment  e 
that  is  contained  in  (the  closed  set)  P.  Chazelle  and 
Sharir  presented  in  [13]  an  0(n^  ®®)-time  algorithm 
for  this  problem,  which  was  later  improved  by  Agar- 
wal  et  al.  [4]  to  0(n®/®+^),  for  any  £  >  0.  The  latter 
paper  reduces  this  problem,  just  as  in  the  computa¬ 
tion  of  the  width  of  a  point  set  in  3-space,  to  that  of 
finding  the  extreme  value  of  a  certain  function  of  two 
parameters,  optimized  over  all  intersection  points  be¬ 
tween  the  edges  of  two  planar  maps.  This  problem,  in 
turn,  reduces  to  the  problem  of  optimizing  the  func¬ 
tion  over  all  intersection  points  of  pairs  of  lines  in 
Cl  X  C2,  where  £1,  T2  are  two  appropriate  families  of 
lines.  By  regarding  the  lines  of  £1  as  data  points,  and 
each  line  of  £2  as  inducing  a  certain  function  over  the 
lines  of  £1,  and  by  using  an  appropriate  parametriza- 
tion  of  these  points  and  functions,  the  problem  can 
be  reduced  to  that  of  testing  whether  any  point  in  a 
certain  set  of  points  in  4-space  lies  above  the  lower 
envelope  of  a  certain  collection  of  tri-variate  func¬ 
tions.  Combining  the  approach  of  [4]  with  the  one 
described  in  the  previous  subsection,  we  can  compute 
the  longest  segment  e  in  expected  time 
for  any  e  >  0.  Omitting  all  the  details,  which  can  be 
found  in  [4],  we  conclude: 

Theorem  5.5  One  can  compute  the  biggest  stick 
that  fits  inside  a  simple  polygon  with  n  edges,  in  ran¬ 
domized  expected  time  for  any  e  >  0. 


5.3  Minimum-width  annulus 

Let  S'  be  a  set  of  n  points  in  the  plane.  We  wish 
to  find  the  (closed)  annulus  of  smallest  width  that 
contains  S,  i.e.,  we  want  to  compute  two  concentric 
disks  Di  and  D2  of  radii  ri  and  r2,  such  that  all 
points  of  S  lie  in  the  (closure  of)  £>2  -  £^i  and  r2  —  ri 
is  minimized.  This  problem  has  been  studied  in  [4].  It 
is  known  [4,  17]  that  the  center  of  such  an  annulus  is 
either  a  vertex  of  the  closest-point  Voronoi  diagram, 
Vorc(S),  of  S,  a  vertex  of  the  farthest-point  Voronoi 
diagram,  Vor/(S),  of  S,  or  the  intersection  point  of 
an  edge  of  Vorc(S)  and  an  edge  of  YoTf{S).  The 
difficult  part  is  testing  the  intersection  points  of  edges 
of  the  two  diagrams,  because  there  can  be  0(n^)  such 
points  in  the  worst  case.  Following  the  same  idea  as 
in  [4],  which  reduces  the  problem  to  that  of  batched 
searching  of  points  relative  to  an  envelope  of  functions 
in  4  dimensions,  but  using  the  technique  described 
above  for  computing  the  width  in  3-space,  we  obtain: 

Theorem  5.6  The  smallest-width  annulus  contain¬ 
ing  a  set  of  n  points  in  the  plane  can  be  computed  in 
randomized  expected  time  for  any  e  >  0. 
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Query- Sensitive  Ray  Shooting 


Joseph  S.  B.  Mitchell*  David  M.  Mounfl  Subhash  Suri^ 


1  Introduction 

Ray  shooting  is  the  problem  of  determining  the  first  in¬ 
tersection  of  a  ray  with  a  set  of  obstacles.  We  assume 
that  obstacles  are  modeled  as  polygons  in  the  plane,  or 
generally  polyhedra  in  higher  dimensions.  Since  many 
ray  shooting  queries  may  be  applied  to  a  given  set  of  ob¬ 
stacles,  it  is  desirable  to  preprocess  the  obstacles  so  that 
query  processing  is  as  efficient  as  possible.  The  problem 
of  ray  shooting  in  a  simple  polygon  with  n  vertices  has 
been  studied  by  [7,  6,  10].  In  all  cases,  with  linear 
space,  ray  shooting  queries  can  be  answered  in  O(logn) 
time.  Ray  shooting  amidst  a  set  of  polygonal  obstacles 
has  been  considered  by  [1,  6,  8,  10,  15].  In  all  these 
cases,  space  is  within  a  polylogarithmic  factor  of  linear, 
and  the  best  query  time  is  0(-y/nlogn)  query  time. 

One  of  the  shortcomings  of  these  approaches  is  that 
they  fail  to  account  for  the  geometric  structure  of  the 
obstacle  set  and  query  ray.  Analyses  based  solely  upon 
input  size  cannot  account  for  the  fact  that,  for  a  fixed  in- 
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put  size,  there  exist  configurations  of  obstacles  and  rays 
for  which  solving  ray  shooting  queries  is  intuitively  very 
easy,  and  others  for  which  it  is  quite  a  bit  more  com¬ 
plicated.  Practical  experience  suggests  that  worst-case 
scenarios  are  rare  for  many  applications.  By  concen¬ 
trating  on  worst-case  asymptotic  performance,  algorithm 
designers  may  overlook  simple  approaches  that  perform 
very  well  in  the  vast  majority  of  practical  situations,  even 
though  their  worst-case  performance  can  be  quite  poor. 
One  approach  for  dealing  with  this  phenomenon  is  to  de¬ 
sign  algorithms  for  special-purpose  classes  of  inputs,  e.g. 
for  convex,  star-shaped,  or  monotone  polygons.  How¬ 
ever,  these  approaches  may  not  be  useful  if  inputs  do 
not  satisfy  these  conditions.  An  alternative  approach 
is  to  design  algorithms  with  good  expected-case  perfor¬ 
mance.  However,  there  does  not  seem  to  be  a  useful 
notion  of  “random”  polygon  or  “random”  polyhedron 
that  captures  the  structure  existing  in  many  geometric 
applications. 

In  this  paper  we  suggest  a  different  approach.  We 
present  a  fully  general  algorithm  for  the  ray  shooting 
problem,  whose  performance  is  not  described  as  a  func¬ 
tion  of  input,  but  rather  as  a  function  of  the  complexity 
of  a  query  with  respect  to  an  intrinsic  parameter  of  the 
input.  This  parameter  is  called  the  simple  cover  com¬ 
plexity,  and  will  be  defined  shortly.  We  feel  that  this 
parameter  intuitively  captures  the  geometric  complexity 
of  a  given  query.  As  in  [6,  10],  we  construct  a  carefully 
designed  subdivision  of  free  space  that  has  low  “stabbing 
number”  with  respect  to  that  portion  of  a  query  segment 
that  lies  within  free  space.  This  allows  query  processing 
to  proceed  by  a  simple  “walk”  through  the  subdivision. 

We  focus  on  a  generalization  of  ray  shooting,  called 
directed  segment  shooting  (or  just  segment  shooting,  for 
short) .  Given  a  set  of  obstacles  and  a  directed  line  seg¬ 
ment  pg,  determine  the  first  intersection  (if  any)  with  an 
obstacle  for  a  point  traveling  along  this  segment  from  p 
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to  q.  If  there  is  no  intersection,  the  point  q  is  returned. 
The  reason  for  this  formulation  is  that  there  are  applica¬ 
tions  of  ray  shooting  arising  in  motion  simulation  where 
a  number  of  short  segment  queries  are  used  to  approx¬ 
imate  more  complex  curved  motion.  Furthermore,  in 
some  interference-detection  algorithms  for  virtual  reality 
applications,  one  tracks  the  motion  of  an  object  through 
free-space  by  performing  segment-shooting  queries  for 
each  boundary  edge  of  the  moving  object  [12]. 

Before  defining  the  complexity  parameter  referred  to 
above,  we  give  some  definitions.  Let  P  denote  a  fi¬ 
nite  collection  of  pairwise-disjoint  polygonal  obstacles  in 
the  plane,  or,  more  generally,  polyhedral  obstacles  in  d- 
space.  Because  our  algorithms  are  based  on  repeated 
subdivision  of  space  into  sufficiently  simple  regions,  we 
need  to  make  the  bounded  incidence  assumption  that 
each  point  is  incident  to  at  most  a  constant  number, 
/?,  of  faces  of  dimensions  <  d  -  1.  Here  face  refers  to 
boundary  flats  of  all  dimensions  <  d  -  1.  Define  a  ball 
of  radius  r  to  be  the  (open)  set  of  points  that  are  less 
than  distance  r  from  a  given  center  point.  In  the  plane, 
a  ball  is  simple  if  it  intersects  at  most  2  edges  of  P.  In 
dimension  d  >  3,  the  value  2  can  be  replaced  with  any 
constant  >  /?.  Given  any  e  >  0,  we  say  that  a  ball  of  ra¬ 
dius  r  is  e-strongly  simple  if  the  ball  with  the  same  center 
and  radius  (1  -b  e)r  is  simple.  Given  e,  a  strongly  simple 
cover  of  a  domain  D,  is  a  collection  of  e-strongly  sim¬ 
ple  balls  whose  union  contains  D.  (Note  that  the  balls 
themselves  cover  D,  but  their  expansion  by  (1  -b  e)  is 
simple.)  Finally,  the  simple  cover  complexity  of  D  (with 
respect  to  e),  scc(D),  is  defined  to  be  the  cardinality  of 
the  smallest  strongly  simple  cover  of  D.  We  will  use  the 
term  complexity  for  short. 

For  example,  in  Fig  1,  we  show  a  strongly  simple  cover 
of  size  4  for  a  line  segment  (shown  as  a  dashed  line).  The 
e  value  is  1/4.  Solid  circles  are  the  balls  of  the  cover,  and 
dotted  circles  are  balls  of  radius  (1  -b  e)r. 


Figure  1:  Strongly  simple  cover. 

Intuitively,  it  is  easy  to  see  that  a  ray  that  is  short 
and  geometrically  well  separated  from  all  obstacles  will 


have  a  smaller  complexity  value  than  a  ray  that  grazes 
the  boundary  of  the  obstacles.  In  this  sense,  we  feel 
that  this  complexity  measure  is  a  reasonable  reflection 
of  the  intrinsic  difficulty  of  solving  a  ray  shooting  query. 
Observe  that  complexity  is  invariant  under  orthogonal 
transformations  of  the  plane;  however,  it  is  not  invariant 
under  general  affine  transformations,  since  balls  are  not 
preserved. 

We  present  two  results.  For  ray  shooting  queries  in 
arbitrary  dimensions,  we  analyze  the  performance  of  a 
simple  variant  of  an  existing  data  structure,  called  a  PM 
k-d  tree  [16],  with  respect  to  simple  cover  complexity. 
Samet  [16],  gives  an  analysis  of  the  maximum  depth  of 
this  tree  in  terms  of  the  minimum  separation  distance 
between  nonadjacent  vertices  and  edges.  We  show  that 
the  size  of  the  data  structure  and  the  time  complexity 
of  segment  shooting  queries  can  be  related  to  the  sim¬ 
ple  cover  complexity  of  the  obstacle  set  and  the  query 
segment,  respectively.  In  the  plane,  we  strengthen  these 
results.  Given  a  set  of  polygonal  obstacles  with  a  total 
of  n  vertices,  we  show  how  to  to  build  a  subdivision  of 
size  0{n)  in  0(n\ogn)  time,  such  that  segment  shooting 
queries  can  be  answered  in  time  0(logn-bscc(s)),  where 
scc(s)  is  the  simple  cover  complexity  of  the  obstacle-free 
portion  of  the  query  segment. 

Because  we  terminate  our  search  along  a  query  ray  as 
soon  as  it  hits  an  obstacle,  it  is  possible  to  make  a  further 
improvement  in  the  planar  analysis.  We  say  that  two 
points  can  reach  one  another  if  there  is  a  path  between 
them  that  does  not  intersect  any  obstacle.  A  region  of 
space  is  connected  if  all  points  within  this  region  are  mu¬ 
tually  reachable,  and  a  connected  component  of  the  re¬ 
gion  is  a  maximally  connected  subset  of  the  region.  We 
can  modify  the  definition  of  simplicity,  by  saying  that  a 
ball  that  contains  some  point  p  is  C-simple  (where  “C” 
stands  for  connected)  if  p’s  connected  component  in  the 
ball  is  bounded  by  at  most  two  obstacle  edges.  We  can 
modify  the  definition  of  complexity  using  C-simplicity, 
to  derive  a  notion  of  C-complexity.  It  is  easy  to  see  that 
any  ball  that  is  simple  is  also  C-simple,  and  thus  the 
C-complexity  of  an  object  is  not  greater  than  its  com¬ 
plexity.  Consider,  for  example,  the  line  segment  shown 
as  a  dashed  line  in  Fig.  2.  Its  standard  complexity  would 
be  quite  high,  but  because  the  connected  components  of 
the  two  covering  balls  containing  this  segment  are  sim¬ 
ple,  its  C-complexity  is  only  2.  Our  results  in  the  plane 
hold  under  this  connected  notion  of  simple  cover  com¬ 
plexity. 

Before  describing  our  algorithms  we  mention  one  tech¬ 
nical  fact.  The  definition  of  simplicity  is  based  on  a  pa¬ 
rameter  e  >  0.  In  the  following  theorem  we  observe  that 
the  choice  of  e  affects  the  complexity  of  a  domain  by 
only  a  constant  factor,  and  so  henceforth  we  exclude  ref¬ 
erences  to  e  when  deriving  asymptotic  bounds.  Its  proof 
is  straightforward,  and  is  omitted  from  this  version. 
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Lemma  1.1  Let  Q  <  e  <  e' ,  and  let  D  be  some  do¬ 
main.  Let  scc{D)  denote  the  simple  cover  complexity  of 
D  with  respect  to  e,  and  let  scc'{D)  denote  the  simple 
cover  complexity  with  respect  to  e' .  Then  there  is  a  con¬ 
stant  c  (depending  on  e,  e'  and  dimension)  such  that 

scc{D)  <  scc'{D)  <  c  ■  scc{D). 


To.  support  segment  shooting  queries,  we  make  one 
modification,  which  we  call  smoothing.  (Smoothing  is  a 
well  known  operation  on  quad  tree  data  structures  and 
is  better  known  under  the  names  of  “restriction”  [11] 
or  “balancing”  [14,  4,  13].  We  prefer  the  term  “smooth¬ 
ing”  because  it  avoids  confusion  with  balancing  in  binary- 
trees.)  The  idea  of  smoothing  is  quite  simple.  We  say 
two  cells  are  neighbors  if  they  share  a  common  bound¬ 
ary  of  dimension  d  —  1.  Define  the  size  of  a  cell  to  be 
the  length  of  its  longest  dimension.  If  there  are  two 
neighboring  cells  whose  sizes  differ  by  more  than  some 
constant  factor  (at  least  2),  then  the  larger  cell  is  split. 
Splitting  is  repeated  until  all  neighboring  cells  satisfy 
the  size  threshold.  It  has  been  shown  by  Moore[14]  that 
smoothing  does  not  alter  the  size  of  the  subdivision  ex¬ 
cept  by  a  constant  factor  (in  general  depending  on  di¬ 
mension). 

To  analyze  the  time  for  query  processing,  we  first  need 
to  provide  a  relationship  between  the  size  of  cells  in 
the  subdivision  and  strongly  simple  balls.  For  these  re¬ 
sults,  we  use  the  standard  notion  of  simplicity  (not  C- 
simplicity). 


2  A  Simple  Approach 

In  this  section  we  give  a  query-sensitive  analysis  of  a  sim¬ 
ple,  practical  decomposition  strategy,  which  illustrates 
the  essential  features  of  our  method.  The  decomposition 
strategy  is  similar  to  one  described  by  Samet  [16],  called 
a  PM  k-d  tree. 

We  begin  by  assuming  that  we  are  given  a  set  of  poly¬ 
hedral  obstacles  in  d-dimensional  space.  As  mentioned 
in  the  introduction,  we  assume  that  faces  bounding  these 
obstacles  are  simple  in  the  sense  that  each  point  is  in¬ 
cident  to  at  most  a  constant  number,  /?,  of  faces  of  di¬ 
mension  <  d  —  1 .  We  also  assume  that  the  obstacles  are 
contained  within  a  minimal  bounding  hypercube.  (Such 
a  hypercube  is  easy  to  construct  in  linear  time.)  This  hy¬ 
percube  is  our  initial  box.  Let  R  denote  the  side  length 
of  this  hypercube.  A  box  is  said  to  be  crowded  if  it  in¬ 
tersects  more  faces  than  some  given  threshold  (at  least 
as  large  as  the  incidence  bound,  /?).  If  a  box  is  crowded, 
then  it  is  split  by  a  hyperplane  passing  through  the  center 
of  the  box  and  perpendicular  to  its  longest  dimension. 
(Ties  for  the  longest  dimension  can  be  broken  arbitrar¬ 
ily,  e.g.,  taking  the  lowest  numbered  axis.)  The  splitting 
process  naturally  defines  a  binary  tree.  The  original  box 
is  the  parent  of  the  resulting  boxes  in  this  tree.  The 
splitting  process  terminates  when  no  remaining  boxes 
are  crowded.  The  resulting  boxes,  called  cells,  form  a 
subdivision  of  space,  each  of  which  contains  a  constant 
number  of  obstacle  boundary  elements. 

Observe  that  splitting  will  eventually  terminate,  be¬ 
cause  of  the  bounded  incidence  assumption.  However, 
the  number  of  cells  in  the  subdivision  cannot  be  bounded 
simply  as  a  function  of  n. 


Lemma  2.1  Let  e>0,  and  let  b  be  an  e-strongly  simple 
ball  of  radius  r.  Ifb  intersects  a  cell  of  the  decomposition 
of  size  s,  then  s  >  ar  for  some  constant  a  ( depending 
on  e  and  dimension). 

Proof:  Let  a  =  e/(6\/d).  For  concreteness,  assume 
that  in  smoothing,  we  split  a  cell  if  a  neighbor  is  of  size 
less  than  1/2  of  its  size,  and  that  e  <  1.  Suppose  to 
the  contrary  that  there  exists  a  cell  that  violates  the 
conditions  of  this  lemma.  Consider  the  smallest  such 
cell  c,  let  s  denote  the  size  of  c,  and  let  p  denote  the 
parent  box  of  c.  Box  p  may  have  split  for  one  of  two 
reasons.  Either  (1)  p  itself  is  not  simple,  or  (2)  p  is  the 
neighbor  of  a  cell  c'  whose  size  is  less  than  1/2  the  size 
of  p,  and  we  split  p  for  smoothing.  The  size  of  p  is  at 
most  2s,  and  hence  the  diameter  of  p  is  at  most  2s^/d. 

In  case  (1),  since  c  intersects  b,  then  within  distance 
2sy/d  of  the  boundary  of  b  there  is  a  crowded  cell.  But, 


2s'/d  <  2ary/d  ■■ 


2er\/d 

6\/d 


<  er. 


Therefore,  the  (1  -|-  e)  expansion  of  b  is  not  simple,  a 
contradiction. 

In  case  (2),  let  s'  denote  the  size  of  c'.  Since  s'  < 
2s/2  =  s,  and  since  sizes  vary  by  factors  of  2,  s'  <  s/2. 
We  claim  that  there  is  an  e-strongly  simple  ball  b'  of 
radius  r/2  that  contains  c'  (see  Fig.  3).  This  will  provide 
us  with  the  desired  contradiction,  because  c'  satisfies  the 
conditions  of  the  lemma  (since  c  was  the  smallest  cell  not 
satisfying  the  lemma  and  s'  <  s),  and  hence 
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implying  s  >  ar,  contradicting  our  earlier  hypothesis. 

To  prove  the  existence  of  b',  first  observe  that  since 
the  size  of  c'  is  at  most  s/2,  its  diameter  is  at  most 
s\/d/2.  Since  c'  is  a  neighbor  of  p,  and  since  p  intersects 
b,  c'  is  contained  within  a  ball  centered  within  6,  whose 
radius  is  the  sum  of  the  diameters  of  p  and  c',  which  is 
less  than  ?,s-\/d.  (This  is  the  smallest  circle  in  Fig.  3.) 
Since  s  <  ar  it  is  easy  to  verify  that  the  radius  of  this 
ball  is  less  than  re/2.  Assuming  e  <  1,  it  is  easy  to  see 
that  there  is  a  ball  b'  of  radius  r/2  that  contains  this 
ball,  and  is  no  further  from  boundary  of  b.  Because  b  is 
within  distance  3s\/d  of  the  boundary  of  b,  the  (1  +  e) 
expansion  of  b'  is  at  most  a  distance 

7*  — 

f  n  +  Ssvd  <  e—  +  Zar^/d  —  er 

^  z 

from  the  boundary  of  6.  Thus,  b'  lies  within  the  (1  +  e) 
expansion  of  b,  and  so  is  simple.  □ 


Lemma  2.2  Given  an  e-strongly  simple  ball  b  of  radius 
r,  the  number  of  cells  that  intersect  b  in  the  subdivision 
is  bounded  above  by  a  constant  (depending  on  e  and  di¬ 
mension). 

Proof:  By  Lemma  2.1,  the  size  of  a  cell  intersecting 
b  is  bounded  from  below  by  ar.  Thus,  the  number  of 
such  cells  is  bounded  above  by  the  number  of  squares 
of  this  side  length  that  can  be  packed  around  (inside  or 
touching)  b,  which  is  on  the  order  of  l/a'^.  □ 

Lemma  2.3  Let  q  be  a  point  in  the  enclosing  hypercube. 
We  can  locate  the  cell  containing  q  in  time  0{log{R/r)), 
where  r  is  the  radius  of  the  largest  strongly  simple  ball 
containing  q,  and  R  is  the  side  length  of  the  bounding 
hypercube. 

Proof:  The  search  for  g  is  a  simple  descent  through 
the  PM  k-d  tree.  With  every  d  levels  of  descent,  the  size 
of  the  associated  regions  decreases  by  a  factor  of  1/2. 
Thus,  if  the  search  continues  for  k  steps,  then  the  search 


terminates  at  a  region  that  contains  q  and  whose  size  s 

R 

obeys  s  <  But  Lemma  2.1  implies  that  s  >  ar, 

for  a  constant  a,  so  that  k  —  0{log{R/r)).  □ 

Segment  shooting  queries  are  processed  by  first  locat¬ 
ing  the  cell  containing  the  origin  of  the  segment,  and 
then  walking  along  the  segment,  cell  by  cell,  through  the 
subdivision.  Because  neighboring  cells  differ  in  size  by 
a  constant  amount,  the  number  of  neighbors  of  a  given 
cell  is  at  most  a  constant  (dependent  on  d).  By  the 
fact  that  each  cell  contains  a  constant  number  of  bound¬ 
ary  elements,  it  follows  that  the  time  to  perform  this 
walking  is  proportional  to  the  number  of  cells  visited 
(with  constant  factors  depending  on  d).  Let  scc(s)  de¬ 
note  the  simple  cover  complexity  of  a  segment  s  (more 
precisely,  the  complexity  of  the  obstacle-free  initial  sub- 
segment  of  s),  and  let  S  be  a  strongly  simple  cover  for  s. 
By  Lemma  2.2,  the  number  of  subdivision  cells  intersect¬ 
ing  each  ball  in  B  is  a  constant,  and  hence,  since  these 
balls  cover  s,  the  number  of  subdivision  cells  traversed 
by  s  is  0(1B|)  =  0(scc(s)).  By  combining  this  with  the 
previous  lemma,  it  follows  that  segment  shooting  queries 
can  be  answered  in  0((logB/7’)-j-scc(5))  time.  Observe 
that,  by  a  similar  argument,  the  number  of  cells  in  the 
subdivision  is  0(scc{P)),  where  scc(P)  denotes  the  sim¬ 
ple  cover  complexity  of  the  obstacle-free  space.  Since 
each  cell  contains  only  a  constant  number  of  boundary 
elements,  the  size  of  the  final  subdivision  is  0{scc(P)). 
From  this  we  have  the  following  analysis  of  the  smoothed 
PM  k-d  tree  subdivision. 

Theorem  2.1  Let  P  be  a  collection  of  polyhedral  obsta¬ 
cles  satisfying  the  bounded  incidence  assumption.  There 
exists  a  data  structure  of  size  0{scc{P))  from  which  seg¬ 
ment  shooting  queries  can  be  answered  in  0{{logR/r) -\- 
scc(s))  time. 

At  this  point  we  remark  that  there  is  another  way  to 
achieve  the  same  result  without  the  use  of  smoothing. 
The  purpose  of  smoothing  the  subdivision  is  to  guaran¬ 
tee  that  the  number  of  cells  that  are  neighbors  to  any 
given  cell  is  a  constant.  However,  it  is  possible  to  avoid 
smoothing,  and  instead  traverse  the  tree  structure  itself 
to  locate  neighboring  cells  (see  [2],  for  example).  It  can 
be  shown  that  finding  neighboring  cells  by  searching  the 
tree  does  not  increase  the  asymptotic  running  time  of 
segment  shooting  queries.  We  have  presented  here  the 
approach  based  on  smoothing,  since  smoothing  will  be 
needed  for  the  approach  described  in  the  next  section. 

This  PM  k-d  tree-based  data  structure  has  the  advan¬ 
tages  of  simplicity  and  ease  of  generalization  to  higher  di¬ 
mensions.  Its  principle  disadvantages  are  that  the  space 
and,  hence,  preprocessing  time  bounds  are  dependent  on 
the  simple  cover  complexity  of  the  obstacle  set,  rather 
than  a  function  of  n. 
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3  The  Planar  Case 

In  this  section  we  modify  the  simple  PM  k-d  tree  con¬ 
struction  given  in  the  previous  section  to  handle  segment 
shooting  for  a  set  of  polygonal  obstacles,  P,  in  the  plane. 
The  principle  improvements  are  the  following. 

•  Query  time  is  based  on  C-complexity  rather  than 
standard  simple  cover  complexity. 

•  Preprocessing  time  is  O(nlogn). 

•  Space  is  0{n). 

Interestingly,  the  subdivision  presented  here  has  a 
number  of  similarities  to  the  quality  mesh  triangulations 
of  [4,  13].  Since  there  are  no  restrictions  on  the  geomet¬ 
ric  structure  of  our  subdivision,  we  can  provide  the  space 
and  preprocessing  time  improvements  listed  above.  (In 
mesh  generation,  the  size  of  the  triangulation  depends 
on  the  geometric  structure  of  the  input.) 

3.1  Box  Decomposition 

Preprocessing  is  based  on  a  hierarchical  subdivision  sim¬ 
ilar  to  the  PM  k-d  tree  decomposition  described  in  the 
last  section,  but  in  which  a  new  operation  called  shrink¬ 
ing  may  be  applied.  The  resulting  structure  is  called  a 
box-decomposition  tree.  This  is  a  standard  decomposi¬ 
tion  procedure  for  point  sets,  which  can  be  found,  for 
example,  in  [2,  3,  5,  9,  17].  As  in  the  previous  section, 
the  subdivision  partitions  an  enclosing  square  contain¬ 
ing  the  obstacle  set  into  a  collection  of  cells,  with  each 
cell  being  bounded  by  a  constant  number  of  edges,  which 
may  be  either  segments  of  obstacle  edges  or  axis  aligned 
segments.  We  do  not  require  that  our  subdivision  be  a 
cell  complex,  since  there  may  be  vertices  lying  in  the  in¬ 
terior  of  edges;  however,  the  number  of  vertices  in  the 
interior  of  any  edge  will  be  a  constant.  Thus,  we  can  eas¬ 
ily  augment  such  a  subdivision  into  a  cell  complex  or  a 
triangulation  with  a  linear  number  of  additional  Steiner 
points. 

Each  node  of  the  box-decomposition  tree  corresponds 
to  one  of  two  types  of  objects  called  enclosures:  (1)  boxes, 
which  are  rectangles  with  the  property  that  the  ratio  of 
the  longest  to  shortest  side  is  at  most  2,  and  (2)  dough¬ 
nuts,  which  consist  of  the  set  theoretic  difference  of  two 
boxes,  an  inner  box  contained  within  an  outer  box.  The 
size  of  a  box  is  defined  to  be  the  length  of  its  longest 
side,  and  the  size  of  a  doughnut  is  the  size  of  its  outer 
box.  As  mentioned  earlier,  the  subdivision  construction 
begins  by  assuming  that  the  set  of  obstacles  are  con¬ 
tained  within  a  unit  square.  We  begin  by  constructing  a 
box-decomposition  tree  for  the  vertex  set  of  the  obstacles 
(ignoring  obstacle  edge).  For  completeness,  we  outline 
the  procedure  here. 

Inductively,  we  assume  that  the  vertex  set  to  be  de¬ 
composed  is  contained  within  a  box.  If  there  is  only  one 


vertex,  then  the  decomposition  terminates  here.  Other¬ 
wise,  as  with  k-d  trees,  we  split  the  box  into  two  identical 
boxes  by  paissing  a  horizontal  or  vertical  line  through  its 
longest  side,  provided  that  there  exists  at  least  one  vertex 
on  either  side  of  this  splitting  line.  If  so,  we  partition  the 
vertices  into  these  two  boxes,  and  recursively  subdivide 
each.  This  process  is  called  splitting. 


(b) 


1 

J  • 

I 

1 

(c) 

Figure  4:  Box  decomposition. 


Without  the  provision  above  that  there  exists  a  ver¬ 
tex  on  each  side  of  the  splitting  line,  the  splitting  process 
could  result  in'an  arbitrarily  long  series  of  trivial  splits, 
in  which  no  vertices  are  separated  from  any  other,  and 
this  in  turn  could  result  in  a  subdivision  of  size  greater 
than  0{n).  To  avoid  this,  if  there  is  not  a  vertex  on 
either  side  of  the  splitting  line,  we  perform  a  different 
decomposition  step  called  shrinking.  Shrinking  consists 
of  finding  the  smallest  k-d  tree  box  that  contains  all  the 
vertices.  By  a  k-d  box,  we  mean  any  box  that  could  be 
generated  by  some  repeated  application  of  the  k-d  split¬ 
ting  rule.  This  box  can  be  computed  in  constant  time  as¬ 
suming  a  model  of  computation  where  integer  logarithm 
and  floors  can  be  computed  in  constant  time.  (This  as¬ 
sumption  can  be  overcome  with  a  somewhat  more  flex- 
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ible  definition  of  boxes  and  shrinking.)  This  produces 
two  enclosures.  One  is  the  bounding  box  containing  the 
vertices,  and  the  other  is  the  surrounding  doughnut  that 
contains  no  vertices.  We  recursively  subdivide  the  region 
lying  within  the  bounding  box.  Note  that  after  applying 
a  shrinking  step,  the  next  step  will  be  a  splitting  step. 
See  Fig.  4(a)  for  an  illustration  of  the  decomposition. 

By  associating  enclosures  by  containment,  we  natu¬ 
rally  have  a  tree-structure,  the  box-decomposition  tree, 
where  each  internal  node  has  two  children  (either  the  left 
and  right  side  from  a  split,  or  the  doughnut  and  inner 
box  from  a  shrink).  The  leaves  of  this  tree  define  a  sub¬ 
division  of  the  plane  into  boxes  and  doughnuts.  These 
are  called  basic  enclosures. 

3.2  Smoothing 

To  handle  segment  shooting  by  walking  the  ray  through 
the  subdivision,  we  will  need  to  smooth  the  subdivision 
as  in  the  previous  section.  The  presence  of  shrinking  re¬ 
quires  that  this  be  done  more  carefully  than  in  the  pre¬ 
vious  section,  in  order  to  keep  the  size  of  the  resulting 
data  structure  from  growing  by  more  than  a  constant 
factor.  The  problem  is  that  shrinking,  by  its  very  na¬ 
ture,  introduces  neighboring  enclosures  of  significantly 
different  sizes.  Thus,  the  operations  of  shrinking  and 
smoothing  are  incompatible  in  some  sense.  We  over¬ 
come  this  by  performing  smoothing  in  two  phases;  the 
first  phase  handles  inner  boxes,  and  the  second  is  the 
standard  smoothing  process. 

The  first  phase  of  smoothing  is  to  surround  the  inner 
box  of  each  doughnut  with  a  “buffer  zone”  of  eight  iden¬ 
tical  boxes  (see  Fig.  4(b)).  If  the  inner  box  touches  the 
boundary  of  the  outer  box,  then  these  boxes  may  spill 
over  into  neighboring  enclosures.  In  the  case  of  spill-over, 
these  boxes  may  already  be  a  part  of  the  decomposition, 
and  no  further  action  is  needed.  If  spill-over  occurs,  and 
these  boxes  are  not  part  of  the  decomposition,  then  this 
neighbor  is  decomposed  by  applying  either  splitting  or 
shrinking  appropriately,  until  we  have  produced  a  legal 
box-decomposition.  This  can  be  implemented  by  imag¬ 
ining  that  each  of  these  newly  created  boxes  contains  a 
vertex  of  the  obstacle  set  at  its  center,  and  then  reap¬ 
plying  the  box-decomposition  algorithm  described  above 
(see  Fig.  4(c)). 

As  before,  we  define  a  neighbor  of  a  basic  enclosure  to 
be  any  other  basic  enclosure  such  that  the  two  share  a 
common  line  segment  on  their  boundaries.  However,  for 
the  purpose  of  the  second  phase  of  smoothing  we  make 
the  following  exception.  Any  enclosure  which  was  cre¬ 
ated  as  part  of  the  buffer  zone  for  an  inner  box  (and 
hence  is  empty)  is  not  considered  to  have  any  larger  en¬ 
closures  as  a  neighbor.  This  exception  is  made  for  a 
couple  of  technical  reasons.  The  most  important  is  that 
we  do  not  want  the  buffer  zone  around  a  small  inner 
box  to  induce  a  much  larger  surrounding  doughnut  to 


split  (or  else  we  will  undo  the  space  savings  achieved  by 
shrinking).  Henceforth,  we  assume  the  algorithm  main¬ 
tains,  for  each  enclosure,  a  list  of  pointers  to  all  larger 
neighboring  enclosures.  Note  that  there  can  be  at  most 
a  constant  number  of  such  enclosures. 

The  second  phase  of  smoothing  is  to  determine  when¬ 
ever  two  neighboring  enclosures  differ  in  size  by  more 
than  some  constant  factor  (at  least  2);  in  such  a  case, 
the  larger  enclosure  is  repeatedly  split  (never  shrunk), 
until  the  difference  in  size  falls  below  this  factor.  When 
splitting  is  applied  to  a  shrinking  node,  the  doughnut  is 
split.  Because  we  have  chosen  inner  boxes  to  be  k-d  en¬ 
closures,  it  is  not  possible  that  the  splitting  line  will  pass 
through  the  inner  box.  Thus,  after  splitting  a  doughnut, 
there  will  be  a  new  smaller  doughnut  with  the  same  inner 
box,  and  a  regular  box  (which  will  contain  no  vertices). 
If  the  doughnut  shrinks  so  far  that  the  outer  box  and 
inner  box  become  equal,  this  trivial  shrinking  node  can 
be  deleted  from  the  box-decomposition  tree. 


(b) 


Figure  5:  Smoothing:  Second  step. 

This  phase  of  the  smoothing  process  is  implemented 
in  a  bottom-up  manner,  working  from  small  enclosures 
to  larger  enclosures.  When  a  small  enclosure  identifies 
a  sufficiently  large  neighboring  enclosure,  it  splits  the 
neighbor,  creating  a  constant  number  of  new  enclosures. 
In  Fig.  5(a)  we  show  the  output  of  the  first  smoothing 
phase,  and  in  (b)  we  show  the  final  output  of  the  second 
smoothing  phase.  (Observe  that  the  shaded  enclosure  in 
(b)  did  not  induce  the  large  enclosure  to  its  left  to  be 
split,  since  the  shaded  enclosure  is  part  of  a  buffer  zone, 
and  hence  is  not  a  neighbor  of  the  larger  enclosure.) 

Lemma  3.1  The  smoothed  box- decomposition  tree  can 
be  constructed  in  O(nlogn)  time  and  0{n)  space. 
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Proof:  The  proof  that  the  initial  box-decomposition 
construction  can  be  performed  in  O(nlogn)  time  and 
0(n)  space  is  well  known  (see,  e.g.  [2]  or  [5].)  The  first 
phase  of  smoothing  is  effectively  equivalent  to  running 
a  box-decomposition  with  at  most  8n  additional  points, 
and  so  can  be  accomplished  within  the  same  time  bound. 
We  claim:  (1)  that  the  second  smoothing  phase  can  be 
performed  in  time  proportional  to  the  size  of  the  final 
decomposition,  and  (2)  that  the  final  decomposition  is 
of  size  0{n).  To  prove  (1)  it  suffices  to  observe  that  each 
splitting  operation  can  be  performed  in  0(1)  time  and 
results  in  the  creation  of  at  least  one  new  enclosure.  The 
algorithm  maintains  the  box-decomposition  tree  and,  for 
each  enclosure,  it  maintains  the  enclosure’s  size  and  a 
list  of  pointers  to  all  larger  neighboring  enclosures  (of 
which  there  can  be  only  a  constant  number).  With  this 
information,  each  splitting  operation  can  be  performed 
in  0(1)  time. 

To  establish  (2)  we  apply  a  charging  argument.  As 
noted,  the  size  of  the  box-decomposition  tree  for  a  set 
of  n  points  is  0(n).  Thus  the  number  of  nodes  in  the 
box-decomposition  tree  after  the  first  phase  is  0(n).  Let 
Ni  denote  this  set  of  nodes,  and  let  N2  denote  the  nodes 
of  the  tree  after  the  second  phase.  For  each  node  in  N2 
that  is  split  we  will  charge  this  node  to  a  node  of  Ni  in 
such  a  way  that  no  node  in  Ni  is  charged  more  than  a 
constant  number  of  times.  This  will  complete  the  proof 
by  establishing  that  the  size  of  N2  is  0{n). 
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Figure  6:  Charging  scheme. 

Let  c  be  a  node  of  N2,  and  let  s  be  the  size  of  c.  If  the 
box  which  c  represents  is  split  as  a  result  of  smoothing, 
we  claim  that  either  c  or  at  least  one  of  the  8  identical 
boxes  that  surround  c  corresponds  to  a  node  which  exists 
in  Ni-  We  prove  this  by  induction  on  the  size  of  the 
enclosures.  The  reason  that  c  split  is  either  because  c  was 
already  split  in  Ni  (in  which  case  the  claim  is  trivially 
true),  or  because  there  is  a  neighboring  enclosure  of  size 
no  greater  than  s/4.  This  neighbor  cannot  be  part  of  a 
buffer  zone,  for  otherwise  it  could  not  induce  c  to  split. 
Therefore,  the  parent  of  this  neighbor,  denoted  c',  is  an 
enclosure  of  size  at  most  s/2  that  is  also  a  neighbor  of  c. 
By  the  induction  hypothesis,  the  parent  of  c'  or  one  of 
its  8  surrounding  enclosures  of  size  s/2  corresponds  to  a 
node  in  Ni  that  was  split.  Call  this  node  c".  Observe 


that  since  c"  is  in  a  surrounding  cell  of  the  c',  c"  must 
be  contained  within  one  of  the  8  enclosures  surrounding 
c  (see  Fig  6).  We  claim  that  this  enclosure  must  be  part 
of  Ni-  This  can  only  fail  to  be  true  if  c"  is  enclosed 
directly  within  a  much  larger  shrinking  node.  But  this 
would  imply  that  c"  is  part  of  a  buffer  zone,  and  hence 
could  not  have  induced  c'  to  split. 

For  our  charging  scheme,  we  charge  each  node  in  N2 
that  is  split  to  a  node  of  equal  size  in  N^.  We  have  just 
established  the  existence  of  this  node.  Since  each  node 
in  Ni  can  be  assessed  charges  only  by  itself  or  the  nodes 
corresponding  to  one  of  its  8  surrounding  enclosures,  it 
follows  that  each  node  of  Ni  is  charged  at  most  9  times, 
and  hence  IA2I  <  9\Ni\.  (For  tighter  bounds  on  the  size 
of  smoothed  subdivisions,  see  [14].)  □ 


Lemma  3.2  After  smoothing,  the  number  of  neighbors 
of  any  basic  enclosure  is  0(1). 

Proof:  Smoothness  guarantees  that,  except  for  the  case 
of  the  enclosures  of  a  buffer  zone,  neighboring  enclosures 
are  of  roughly  equal  size.  This  implies  that  along  any 
edge  of  any  enclosure,  there  can  be  at  most  a  constant 
number  of  neighboring  enclosures.  In  the  case  of  the  en¬ 
closures  of  a  buffer  zone,  observe  that  no  matter  how 
densely  points  are  packed  into  the  inner  box,  the  prop¬ 
agation  of  splitting  can  result  in  the  outer  edges  of  the 
buffer  zone  being  split  at  most  once.  This  is  illustrated 
in  Fig.  7. 


Figure  7:  Propagation  of  smoothing. 

Due  to  the  doughnuts,  basic  enclosures  may  not  be 
convex;  indeed,  the  vertices  of  an  inner  box  of  a  dough¬ 
nut  are  reflex  vertices.  We  can  make  the  overall  subdi¬ 
vision  convex  by  extending  the  left  and  right  (vertical) 
sides  of  each  inner  box  until  they  contact  the  boundary 
of  the  outer  box.  This  subdivides  each  doughnut  into 
four  rectangles.  Observe  that  this  increases  the  stabbing 
number  of  any  directed  segment  by  at  most  a  constant 
factor,  and  so  has  no  significant  effect  on  the  complexity 
arguments  to  be  made  later. 
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3.3  Adding  Obstacle  Edges 

To  complete  the  description  of  the  decomposition  al¬ 
gorithm,  we  introduce  the  obstacle  edges.  The  prob¬ 
lem  with  simply  adding  the  obstacle  edges  to  the  box- 
decomposition  is  that  the  number  of  intersections  be¬ 
tween  obstacle  edges  and  box-decomposition  edges  may 
be  much  larger  than  0{n).  To  guarantee  that  the  num¬ 
ber  of  intersections  is  linear  in  n,  we  apply  a  simple  trim¬ 
ming  procedure,  which  trims  each  edge  of  the  smoothed 
box-decomposition  subdivision  at  its  first  and  last  inter¬ 
section  with  an  obstacle  boundary.  Let  us  assume  that 
both  the  obstacle  set  and  box-decomposition  have  been 
preprocessed  (by  standard  means — e.g.,  trapezoidiza- 
tion,  and  a  point  location  data  structure)  to  support 
horizontal  and  vertical  ray-shooting  queries  in  O(logn) 
time.  Intuitively,  one  can  imagine  each  vertex  of  the 
box-decomposition  subdivision  firing  a  bullet  along  each 
of  its  incident  horizontal  and  vertical  edges,  until  either 
reaching  the  end  of  this  edge,  or  until  hitting  an  obstacle. 
The  edges  are  then  trimmed  so  only  these  bullet-path 
subedges  remain. 


Figure  8:  The  problem  with  simple  trimming. 

Directly  applying  the  simple  trimming  procedure  de¬ 
scribed  above  can  result  in  regions  of  complexity  Q(n) 
(see  Fig.  8).  To  deal  with  this,  we  isolate  obstacle  ver¬ 
tices  within  the  subdivision.  Before  doing  any  trim- 
n^ing)  for  each  obstacle  vertex  v  we  determine  its  asso¬ 
ciated  enclosure,  c,  in  the  smoothed  box  decomposition 
(this  is  given  to  us  by  the  decomposition).  We  join  v 
to  the  boundary  of  c  by  at  most  4  vertical/horizontal 
segments,  omitting  any  such  connection  that  penetrates 
the  obstacle  containing  u;  this  results  in  at  most  4  new 
(subdivision)  vertices.  This  modification  of  the  box- 
decomposition  subdivision  can  be  performed  for  all  n 
vertices  in  total  time  0{n),  while  increasing  the  size  of 
the  subdivision  by  only  a  constant  factor. 

Now  we  apply  the  above-mentioned  trimming  proce¬ 
dure  for  each  edge  of  the  modified  box-decomposition 
subdivision.  Since  this  involves  shooting  0{n)  directed 
(horizontal/vertical)  segments,  the  entire  process  takes 
0(n  log  n)  time.  Observe  that  the  resulting  subdi¬ 
vision  has  total  size  0(ra),  since  we  have  created  at 
most  two  new  edges  for  each  of  the  0{n)  edges  in 
the  box-decomposition  subdivision.  Within  this  same 
time  bound  we  can  convert  this  collection  of  segments 


(which  includes  box-decomposition  edges  as  well  as  ob¬ 
stacle  edges)  into  a  planar  subdivision  data  structure. 
The  2-faces  of  the  resulting  subdivision,  which  are  nec¬ 
essarily  convex,  are  called  cells.  We  say  that  two 
cells  are  neighbors  if  they  share  a  common  nonobsta¬ 
cle  (box-decomposition)  edge.  An  example  of  this  is 
shown  in  Fig.  9,  where  (a)  shows  the  initial  smooth  box- 
decomposition  of  the  point  set,  and  (b)  shows  the  re¬ 
sult  of  trimming.  (In  connecting  obstacle  vertices  to  the 
sides  of  the  subdivision,  we  have  used  as  few  segments 
as  needed  to  resolve  the  reflex  angles.) 

Notice  that  the  resulting  subdivision  inherits  one 
essential  feature  of  the  smoothed  box-decomposition, 
namely,  that  each  cell  in  the  subdivision  has  at  most 
a  constant  number  of  neighbors.  Also  observe  that  each 
cell  of  the  resulting  subdivision  is  convex  and  is  bounded 
by  at  most  6  sides  (two  obstacle  edges  and  the  four  sides 
of  a  bounding  rectangle). 


Figure  9;  The  final  trimming  procedure. 
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3.4  Merging 

If  we  stop  at  this  point,  we  claim  that  the  subdivision 
constructed  so  far  has  the  property  that  any  segment 
s  stabs  as  many  cells  of  the  subdivision  as  the  simple 
cover  complexity  of  s.  However,  we  want  to  strengthen 
this  to  C-simple  cover  complexity.  This  is  generally  not 
true,  since  it  may  contain  cells  that  are  smaller  than 
they  should  be.  It  is  important  that,  for  every  point  p 
in  the  plane,  the  size  of  the  cell  containing  p  is  at  least 
a  constant  factor  of  the  radius  of  the  largest  C-simple 
ball  that  contains  p.  (This  is  the  C-simple  version  of 
Lemma  2.1.)  This  may  be  violated  because  the  place¬ 
ment  of  box-decomposition  vertices  was  made  without 
knowledge  of  the  underlying  structure  of  obstacle  edges. 
An  example  is  shown  in  Fig.  10(a).  A  large  number  of 
small  subdivision  cells  lie  to  the  right  of  edge  e  in  this 
figure.  The  existence  of  these  cells  is  justified  from  the 
vertices  lying  to  the  left  of  e.  However,  a  ray  lying  just 
to  the  right  of  e  and  parallel  to  e  will  have  a  very  low  C- 
simple  cover  complexity  because  these  other  vertices  are 
not  reachable  from  the  ray,  and  hence  we  cannot  afford 
to  traverse  these  small  cells. 


We  apply  the  following  merging  process  to  eliminate 
these  unnecessary  regions.  We  traverse  the  enclosures 
of  the  box- decomposition  subdivision,  in  a  bottom-up 
fashion  (starting  at  the  bottom  of  the  box-decomposition 
tree)  visiting  enclosures  in  increasing  order  of  size.  Con¬ 
sider  a  pair  of  neighboring  cells  that  were  siblings  in  the 
decomposition  process  (in  the  sense  that  they  were  once 
part  of  a  parent  node  that  was  split  or  shrunken).  If 
their  union  satisfies  the  following  properties: 

(1)  it  has  at  most  two  obstacle  edges  on  its  boundary, 

(2)  it  does  not  have  more  than  a  constant  number  c 
neighboring  cells  (where  c  >  8),  and 

(3)  it  is  convex, 

then  merge  these  two  cells  into  one  cell  (effectively  undo¬ 
ing  the  decomposition  performed  by  the  decomposition 
algorithm).  Repeat  this  until  no  sibling  cells  remain  that 
can  be  merged.  After  this  merging  process  has  been  fin¬ 
ished,  observe  that  cells  still  satisfy  the  properties  of 


bounded  complexity,  bounded  number  of  neighbors,  and 
convexity.  The  outcome  of  this  merging  procedure  is 
shown  in  Fig.  10(b). 

Summarizing  the  discussion  of  this  section,  we  have 
the  following  result. 

Lemma  3.3  Given  a  collection  of  polygonal  obstacles  P 
in  the  plane  with  n  total  vertices,  the  size  of  the  final  sub¬ 
division  is  0{n),  and  the  subdivision  can  be  constructed 
in  0(n  log  n)  time.  The  cells  of  this  subdivision  satisfy 
properties  (l)-(3)  above. 

At  this  point  we  can  present  the  result  analogous  to 
Lemma  2.1.  Recall  that  in  order  to  make  cells  convex, 
we  introduced  new  edges  incident  to  the  vertices  of  in¬ 
ner  boxes  (in  Section  3.2)  and  to  obstacle  vertices  (in 
Section  3.3).  The  effect  of  adding  these  edges  was  to 
subdivide  each  enclosure  of  the  smoothed  box  decompo¬ 
sition  into  a  constant  number  of  convex  subcells.  Since 
each  enclosure  of  the  box  decomposition  can  contain  at 
most  a  constant  number  of  nonrefiex  vertices  (as  a  con¬ 
sequence  of  box  decomposition),  the  creation  of  these 
subcells  does  not  affect  the  number  of  cells  traversed  in 
query  processing  by  more  than  a  constant  factor.  Thus, 
we  can  define  the  size  of  such  subcells  to  be  the  size  of 
original  enclosures. 

Lemma  3.4  For  every  point  p  in  the  plane,  the  size  of 
the  cell  containing  p  in  the  subdivision  is  at  least  a  con¬ 
stant  factor  times  the  radius  of  the  largest  C-simple  ball 
that  contains  p. 

Proof:  Let  r  denote  the  radius  of  the  largest  C-simple 
ball  that  contains  p.  Let  s  be  the  size  of  the  cell  c  con¬ 
taining  p.  Let  c'  be  the  sibling  of  c  in  the  subdivision  (or 
in  general,  the  next  neighboring  cell  that  would  be  con¬ 
sidered  for  merging  with  c).  The  cells  c  and  c'  were  not 
merged  because  of  a  violation  of  one  of  the  conditions 
(1),  (2),  or  (3)  above. 

In  the  case  of  (1),  the  merger  of  c  and  c'  would  result  in 
a  cell  having  3  or  more  reachable  vertices  within  distance 
s2\/2.  Thus  the  ball  of  radius  s2\/2  centered  at  p  is 
not  C-simple.  This  implies  that  any  C-simple  ball  that 
covers  p  can  be  of  radius  at  most  s2-v/2,  and  hence  s  > 

r/{2V2). 

In  the  case  of  (2),  the  proof  is  by  induction  on  the  size 
of  cells.  If  c'  was  separated  from  c  by  splitting  (as  op¬ 
posed  to  shrinking),  then  c  and  c'  are  of  equal  size.  The 
fact  that  condition  (2)  is  violated  implies  that  there  is 
a  reachable  cell  c"  neighboring  either  c  or  c"  that  is  no 
larger  than  s/2  (since  otherwise  there  could  be  at  most 
8  neighbors  of  their  union).  Applying  the  induction  hy¬ 
pothesis  to  the  closest  point  p'  to  p  in  c" ,  implies  that 
the  largest  C-simple  ball  containing  p'  is  of  size  as/2,  for 
some  constant  a.  Note  that  p'  cannot  be  further  than 
s2y/2  from  p,  and  so  any  C-simple  ball  that  covers  p 
can  be  of  radius  at  most  s(2a/2  +  a/ 2).  Choosing  any 
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a  >  4\/5  will  satisfy  the  hypothesis.  If  c  and  c'  were  sep¬ 
arated  by  shrinking,  then  one  of  these  cells  is  a  doughnut 
and  the  other  is  an  inner  box.  If  c  is  the  doughnut,  then 
c'  is  smaller  than  c,  and  the  same  induction  argument 
can  be  applied.  We  claim  that  c  cannot  be  the  inner  box, 
since  any  neighbor  of  the  neighbors  of  an  inner  box  are  a 
subset  of  the  neighbors  of  the  doughnut.  Thus,  merging 
c  and  c'  could  not  violate  condition  (2). 

Finally,  in  the  case  of  condition  (3),  the  edge  separat¬ 
ing  c  and  c'  has  been  added  simply  to  resolve  a  reflex 
vertex.  We  can  simply  apply  the  arguments  given  in 
cases  (1)  or  (2)  to  the  larger  enclosure  of  size  s  in  the 
box  decomposition  that  contains  this  vertex.  □ 

The  analogue  to  Lemma  2.2  follows  from  exactly  the 
same  argument  as  before. 

Lemma  3.5  Given  a  C-simple  ball  b  of  radius  r,  the 
number  of  cells  that  intersect  any  connected  component 
of  b  in  the  subdivision  is  bounded  above  by  a  constant. 

3.5  Query  Processing 

As  we  mentioned  earlier,  query  processing  involves  two 
steps.  Given  a  directed  segment  pq,  we  first  apply  any 
point  location  algorithm  to  determine  the  location  of  the 
starting  point  p  in  O(logn)  time.  Once  the  cell  contain¬ 
ing  p  has  been  located,  the  algorithm  simply  walks  from 
one  cell  to  the  next.  This  can  be  done  in  time  propor¬ 
tional  to  the  number  of  cells  traversed,  since  each  cell 
is  bounded  by  a  constant  number  of  edges  (at  most  two 
obstacle  edges  and  at  most  four  horizontal  and  vertical 
edges  of  the  associated  (trimmed)  enclosure),  and  each 
cell  has  a  constant  number  of  neighbors. 

To  analyze  the  running  time  of  query  processing  it 
suffices  to  count  the  number  of  cells  visited.  Let  scc(s) 
denote  the  C-simple  cover  complexity  of  a  segment  s  (or 
actually  the  complexity  of  the  obstacle  free  initial  sub- 
segment  of  s),  and  let  S  be  a  strongly  C-simple  cover 
for  s.  By  Lemma  3.5,  the  number  of  subdivision  cells  in¬ 
tersecting  each  ball  in  if  is  a  constant,  and  hence,  since 
these  balls  cover  s,  the  number  of  subdivision  cells  tra¬ 
versed  by  s  is  0(|B|)  =  0(scc(s)). 

Theorem  3.1  Let  P  be  a  collection  of  polygonal  obsta¬ 
cles  in  the  plane  (satisfying  the  bounded  incidence  as¬ 
sumption).  Let  n  denote  the  number  of  vertices  in  P. 
In  time  O(nlogn)  a  data  structure  of  size  0{n)  can  be 
constructed,  from  which  segment  shooting  queries  can  be 
answered  in  0{logn  scc{s))  time. 
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Abstract 

Generalized  intersection  searching  problems  are  a 
class  of  geometric  query-retrieval  problems  where 
the  questions  of  interest  concern  the  intersection 
of  a  query  object  with  aggregates  of  geometric  ob¬ 
jects  (rather  than  with  individual  objects.)  This 
class  contains,  as  a  special  case,  the  well-studied 
class  of  standard  intersection  searching  problems 
and  is  rich  in  applications.  Unfortunately,  the  solu¬ 
tions  known  for  the  standard  problems  do  not  yield 
efficient  solutions  to  the  generalized  problems.  Re¬ 
cently,  efficient  solutions  have  been  given  for  gen¬ 
eralized  problems  where  the  input  and  query  ob¬ 
jects  are  iso-oriented  (i.e.,  axes-paraUel)  or  where 
the  aggregates  satisfy  additional  properties  (e.g., 
connectedness).  In  this  paper,  efficient  algorithms 
are  given  for  several  generalized  problems  involv¬ 
ing  non-iso-oriented  objects.  These  problems  in¬ 
clude:  generalized  halfspace  range  searching,  seg¬ 
ment  intersection  searching,  triangle  stabbing,  and 
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triangle  range  searching.  The  techniques  used  in¬ 
clude:  computing  suitable  sparse  representations  of 
the  input,  persistent  data  structures,  and  filtering 
search. 

Keywords:  Computational  geometry,  data  struc¬ 
tures,  filtering  search,  geometric  duality,  intersec¬ 
tion  searching,  persistence. 

1  Introduction 

Consider  the  following  generic  searching  problem: 
Suppose  that  we  are  given  a  set  5  of  n  geomet¬ 
ric  objects  in  Moreover  assume  that  the  ob¬ 
jects  come  aggregated  in  disjoint  groups,  where  the 
grouping  is  dictated  by  the  underlying  application. 
(The  number  of  groups  can  range  from  1  to  0(n).) 
Our  goal  is  to  preprocess  S  into  a  data  structure 
so  that  given  any  query  object  q,  we  can  report 
or  count  efficiently  the  groups  that  are  intersected 
by  q.  (We  say  that  q  intersects  a  group  iff  q  in¬ 
tersects  some  object  in  the  group.)  Notice  that  we 
are  not  interested  in  reporting  or  counting  th^in- 
dividual  objects  intersected  by  q  as  is  the  case  in 
a  standard  intersection  searching  problem.  Indeed 
the  standard  problem  is  a  special  case  of  the  above 
formulation,  where  each  group  has  cardinality  1. 
For  this  reason,  we  call  our  version  a  generalized 
intersection  searching  problem. 

For  our  purposes,  it  wiU  be  convenient  to  asso¬ 
ciate  with  each  group  a  different  color  and  imagine 
that  aU  the  objects  in  the  group  have  that  color. 
Suppose  that  q  intersects  i  groups.  Then,  we  can 
restate  our  problem  as:  ^^Preprocess  a  set  S  of  n 
colored  geometric  objects  so  that  for  any  query  ob¬ 
ject  q,  the  i  distinct  colors  of  the  objects  that  are 
intersected  by  q  can  be  reported  or  counted  effi- 


ciently.^^  This  is  the  version  that  we  will  consider 
throughout  the  paper. 

Before  going  further,  let  us  illustrate  the  useful¬ 
ness  of  our  generalized  formulation  with  some  ap¬ 
plications;  (1)  In  designing  a  VLSI  chip,  the  wires 
(line  segments)  can  be  grouped  naturally  accord¬ 
ing  to  the  circuits  they  belong  to.  A  problem  of 
interest  to  the  designer  is  determining  which  cir¬ 
cuits  (rather  than  wires)  become  connected  when 
a  new  wire  is  to  be  added.  This  is  an  instance 
of  the  generalized  segment  intersection  searching 
problem.  (2)  Consider  a  collection  of  supply  points 
(e.g.,  warehouses)  of  different  types  in  Ti? .  We 
would  like  to  preprocess  these  points  so  that  given 
a  demand  point  q  and  a  radius  r,  we  can  determine 
the  types  of  supply  points  that  are  within  distance 
r  from  q.  By  using  a  well-known  “lifting”  trans¬ 
formation,  this  problem  can  be  transformed  to  an 
instance  of  generalized  halfspace  range  searching  in 
7^3. 

1.1  Potential  approaches 

One  approach  to  solving  a  generalized  problem  is 
to  take  advantage  of  known  solutions  for  the  corre¬ 
sponding  standard  problem.  For  example,  to  solve 
a  generalized  reporting  problem,  we  can  determine 
aU  the  objects  intersected  by  q  (a  standard  prob¬ 
lem)  and  then  read  off  the  distinct  colors  among 
these.  However,  with  this  approach  the  query  time 
can  be  very  high  since  q  could  intersect  f2(n)  ob¬ 
jects  but  only  0(1)  distinct  colors.  Thus,  the  chal¬ 
lenge  in  the  generalized  reporting  problem  is  to 
attain  a  query  time  that  is  sensitive  to  the  out¬ 
put  size  i,  typically  of  the  form  0{f{n)  -|-  i)  or 
0(/(n)  +  Tpolylog(n)),  where  /(n)  is  “small”  (e.g., 
polylog  (n)  or  n^,  where  0  <p  <  1).  For  a  general¬ 
ized  counting  problem,  it  is  not  even  clear  how  one 
can  use  the  solution  for  the  corresponding  standard 
problem  (a  mere  connt)  to  determine  how  many 
distinct  colors  are  intersected.  Nevertheless,  we 
seek  here  query  times  of  the  form  0(/(n)).  Of 
course,  in  both  cases,  we  seek  solutions  that  are 
also  as  space-efficient  as  possible. 

1.2  Previous  work 

While  the  standard  problems  have  been  investi¬ 
gated  extensively,  their  generalized  counterparts 


have  been  less  studied.  The  generalized  problems 
were  first  considered  in  [JL93],  where  efficient  so¬ 
lutions  were  given  for  several  problems  defined  on 
iso-oriented  objects  (i.e.,  the  input  and  the  query 
objects  are  axes-parallel).  In  [GJS93a],  efficient  so¬ 
lutions  were  given  for  the  counting,  reporting,  and 
dynamic  versions  of  several  iso-oriented  problems. 
In  [GJS93b],  solutions  were  given  for  generalized 
problems  involving  circular  and  circle-like  objects 
(among  other  results).  In  [AvK93],  Agarwal  and 
van  Kreveld  consider  the  problem  of  reporting  the 
intersections  of  a  query  line  segment  with  color 
classes  consisting  of  line  segments  and  satisfying 
the  property  that  each  color  class  is  a  simple  poly¬ 
gon  or  a  connected  component. 

1.3  Summary  of  results  and  techniques 

In  this  paper,  we  present  efficient  solntions  to 
several  generalized  intersection  searching  prob¬ 
lems  that  are  defined  on  non-iso-oriented  objects. 
Specifically,  we  consider  the  following  problems: 
generalized  halfspace  range  searching  in  TZ'^,  for 
any  fixed  d  >  2,  generalized  segment  intersection 
searching,  triangle  stabbing,  and  triangle  range 
searching  in  'R?.  Our  main  results  are  summa¬ 
rized  in  Table  1.  No  results  were  known  pre¬ 
viously  for  any  of  these  problems,  with  the  ex¬ 
ception  of  generalized  segment  intersection  search¬ 
ing:  For  this  problem,  the  following  results  were 
known:  (a)  0(n^+')  space  and  -f  i) 

query  time  for  color  classes  consisting  of  simple 
polygons  or  connected  components  [AvK93];  (b) 
0{{n  x)^logn)  (resp.  0((n  +  x)^))  space  and 
0(logn  -f  i)  (resp.  0(log^n  -f  i))  query  time  for 
general  color  classes  [JL93];  and  (c)  0{n^+^)  space 
and  0{{i  +  l)-y/nlog'^^^)  n)  query  time  for  general 
color  classes  [AvK93].  (x  is  the  number  of  pairwise- 
intersecting  segments.) 

Our  results  are  based  on  a  combination  of  sev¬ 
eral  techniques:  (1)  Computing  for  each  color  class 
a  sparse  representation  which,  captures  essential  in¬ 
formation  about  the  color  class  and  allows  us  to 
reduce  the  generalized  problem  at  hand  to  a  stan¬ 
dard  problem.  (2)  The  persistence-addition  tech¬ 
nique  of  Driscoll  et  al.  [DSST89],  which  allows  us 
to  reduce  a  generalized  problem  to  a  generahzed 
dynamic  problem  one  dimension  lower.  (3)  A  ver¬ 
sion  of  filtering  search  which,  in  combination  with 
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0 

Input  objects 

Query  object 

Space 

Query  time 

1 

Points 
in  Te*' 

Halfspace 
in  72*^ 

d  =  2  n  log  n 

log^  n  -i-  i 

n  log"*  n 

d=3 

nlogn 

ni/2+f  +  t 

log^  n  -1- 1 
„2/3+^ 

d>2 

log  n  -1-  i  log"^  n 

2 

Line  segs. 
in  'R? 

Halfplane 

nlogn 

iog^TT+l 

ni/2 

Vertical  ray 

nQ'(n)logn 

log''  n  -1-  i 

(na(n))i/2 

3 

Line  segs.  of 
length  >  a 
constant  in 
unit  square 

Line 

nlogn 

log^  n  -1- 1 

4 

Lines 
in  72.^ 

Vertical  line 
segment 

^z.5-M}ogn 
n^  log  n 

n^  -1-  i 
log  n  -f  1 

5 

Line  segs. 
in  72^ 

Vertical  line 
segment 

(n  +  x)log« 

log  n  -1-  i 

Arbitrary  line 
segment 

log^  n  d-  i  log  n 

6 

Triangles 

Point 

n^l^  logn 

log^  n  -t-  i 

Fat-Wedges 

nlogn 

log'  n  -t-  i 

7 

Points 

Fat-Triangle 

n  log^  n 

log"*  n  -f-  i  log'  n 

Table  1;  Summary  of  main  results  for  generalized  intersection  searching  problems;  additional  results 
can  be  found  in  the  text.  Wherever  the  output  size,  i,  is  missing  in  a  query  time  bound,  it  is  a 
counting  problem.  A  fat-wedge  or  fat-triangle  is  one  where  each  interior  angle  is  greater  than  or 

equal  to  a  fixed  constant.  Here: 
n  input  size 

i  output  size  (number  of  distinct  colors  intersected) 

€  arbitrarily  small  constant  >  0 

^  tunable  parameter,  0.5  <  /i  <  1 

X  number  of  pairwise-intersecting  segments,  0  <  x  <  (2) 

a(n)  slow-growing  inverse  of  Ackermann’s  function 


persistence,  yields  a  space-query  time  tradeoff. 
Moreover,  when  the  input  objects  or  query  objects 
satisfy  certain  reasonable  conditions  (e.g.,  fatness), 
then  we  use  further  ideas  to  obtain  very  efficient  so¬ 
lutions.  Due  to  space  limitations,  we  will  describe 
only  a  subset  of  our  results  here  and  will  omit  all 
proofs  and  many  details.  The  full  paper  is  avail¬ 
able  as  [GJS93c]. 


2  Generalized  halfspace  range 
searching  in 

Let  5  be  a  set  of  n  colored  points  in  TZ'^,  for  any 
fixed  d>2.  We  show  how  to  preprocess  S  so  that 
for  any  query  hyperplane  Q,  the  i  distinct  colors 
of  the  points  lying  in  the  halfspace  Q~  (i.e.,  be¬ 
low  Q)  can  be  reported  or  counted  efficiently.  We 
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first  give  solutions  for  Ti?  and  TZ^.  Let  T  denote 
the  well-known  point-hyperplane  duality  transform 
[Ede87].  Using  T  we  map  5  to  a  set  S'  of  hyper¬ 
planes  (lines  in  V?  and  planes  in  V?')  and  map  Q 
to  the  point  q  =  T{Q').  Our  problem  is  now  equiv¬ 
alent  to:  “Report  or  count  the  i  distinct  colors  of 
the  hyperplanes  lying  on  or  above  i.e.,  the  hy¬ 
perplanes  that  are  intersected  by  the  vertical  ray  r 
emanating  upwards  from  g.” 

Let  Sc  be  the  set  of  hyperplanes  of  color  c.  For 
each  color  c,  we  compute  the  upper  envelope  Ec  of 
the  hyperplanes  in  Sc-  In  TZ^,  Ec  is  an  unbounded 
convex  chain,  and  in  TZ^,  Ec  is  an  unbounded  con¬ 
vex  polytope  whose  facets  are  convex  polygons.  It 
is  clear  that  (i)  r  intersects  a  c-colbred  hyperplane 
iff  r  intersects  Ec  and,  moreover,  (ii)  if  r  intersects 
Ec,  then  r  intersects  the  interior  of  a  unique  facet 
of  Ec-  (For  this  version  of  the  paper,  we  assume 
that  r  does  not  intersect  two  or  more  facets  of  Ec 
at  a  common  boundary;  in  the  fuU  paper  [GJS93c], 
we  show  how  to  remove  this  assumption.)  Let  S 
be  the  collection  of  the  envelopes  of  the  different 
colors.  By  the  above  discussion,  our  problem  is 
equivalent  to:  “Report  or  count  the  facets  of  S 
that  are  intersected  by  r”,  which  is  a  standard  in¬ 
tersection  searching  problem!  Since  we  use  differ¬ 
ent  techniques  to  solve  this  problem  in  E?  and  in 
we  wiU  discuss  each  case  separately. 

2.1  Solving  the  ray— envelope  intersec¬ 
tion  problem  in 

We  store  the  ^-projections  of  the  line  segments 
of  5  in  a  segment  tree  T.  Let  v  be  any  node  of 
T.  Associated  with  v  is  an  x-interval  I{v).  Let 
Strip(v)  be  the  vertical  strip  defined  by  I{v).  We 
say  that  a  segment  s  G  £”  is  allocated  to  a  node 
u  G  T  iff  I(v)  ^  0  and  s  crosses  Strip(v)  but 
not  Strip (parent(v)).  Let  £(v)  be  the  set  of  seg¬ 
ments  allocated  to  v.  Within  Strip(v),  the  seg¬ 
ments  of  S{v)  can  be  viewed  as  lines  since  they 
cross  Strip(v)  completely.  Let  £'(v)  be  the  set  of 
points  dual  to  these  lines.  We  store  £'{v)  in  an 
instance  H(v)  of  the  standard  halfplane  reporting 
(resp.  counting)  structure  for  E^  given  in  [CGL85] 
(resp.  [Mat92b]).  This  structure  has  size  0(m) 
and  a  query  time  of  O(logm-|-A;„)  (resp. 
where  m  =  |^(u)|  and  ky  is  the  output  size  at  v. 

To  answer  a  query,  we  search  in  T  using  g’s  x- 


coordinate.  At  each  node  v  visited,  we  need  to 
report  or  count  the  lines  intersected  by  r.  But,  by 
duality,  this  is  equivalent  to  answering,  in  E^,  a 
halfplane  query  at  v  using  the  query  T{q)~  =  Q~ , 
which  we  do  using  H[v). 

Theorem  2.1  A  set  S  of  n  colored  points  in  E^ 
can  be  stored  in  a  data  structure  of  size  0{nlogn) 
so  that  the  i  distinct  colors  of  the  points  lying  in 
any  query  halfplane  can  be  reported  (resp.  counted) 
in  time  O(log^n-f  i)  (resp.  □ 

Using  a  similar  approach,  we  can  also  solve  Prob¬ 
lem  2  in  Table  1. 

2.2  Solving  the  ray-envelope  intersec¬ 
tion  problem  in 

We  triangulate  the  facets  of  Ec  for  aU  colors  c.  For 
any  triangle  t,  let  h{t)  be  its  supporting  plane.  Let 
t'  and  q'  be  the  projections  of  t  and  q  (the  origin 
of  ray  r)  on  the  xj/-plane  respectively.  Clearly,  t  is 
intersected  by  r  iff  (a)  P’s  interior  contains  q'  and 
(b)  h(t)  is  on  or  above  q. 

Wlog  assume  that  t'  has  a  vertical  side;  other¬ 
wise  decompose  it  into  two  such  triangles.  To  find 
the  triangles  satisfying  condition  (a),  we  store  each 
t'  in  a  segment  tree  T  according  to  its  x-span.  Let 
V  be  any  node  of  T  and  let  A{v)  be  the  set  of  tri¬ 
angles  allocated  to  v.  Let  m  =  |A(u)|.  Note  that  if 
t'  G  A{v)  then  both  its  non-vertical  sides,  caU  the 
upper  one  t'^  and  the  lower  one  cross  Strip{v).  If 
q'  G  Strip{v),  then  q'  is  in  P’s  interior  iff  q'  is  above 
t'l  and  below  t'^.  Since  t'j  and  behave  Hke  fines 
within  Strip{v),  by  duality  we  have  that  q'  G  P  iff 
in  E^  one  endpoint  of  T(t'f)T(t'y)  lies  in  the  open 
halfplane  J-{q')~  and  the  other  lies  in  the  open  half¬ 
plane  jr(g')+.  Next,  consider  condition  (b).  By 
duality,  h{t)  is  on  or  above  q  iff  in  E^  the  point 
E{h{t))  is  in  the  halfspace  T{q)~. 

So,  our  problem  at  v  is  to  report  or  count  the 
iy  triangles  of  A{v)  that  satisfy  the  above  half¬ 
plane  and  halfspace  queries.  We  can  do  this  by 
augmenting  v  with  a  3-level  data  structure  V{v), 
using  partition  trees  [Mat92a]  in  the  outer  two  lev¬ 
els  of  V{v)  and  the  data  structure  of  [AHL90]  for 
halfspace  range  reporting  at  the  innermost  level. 
For  the  counting  problem,  we  use  partition  trees 
at  aU  three  levels.  V{v)  can  be  built  in  space 
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0(m log  m)  (resp.  0(m))  so  that  all  the  desired 
triangles  can  be  reported  (resp.  counted)  in  time 
+  it,)  (resp.  For  the  re¬ 

porting  problem,  instead  of  partition  trees,  we  can 
also  use  2-dimensional  cutting  trees  [Mat91a]  for 
the  two  outer  levels  of  V{v).  Then  'D(v)  has  size 
0(m^+')  and  query  time  0(logm  -f  i„).  Plugging 
these  bounds  into  the  segment  tree,  we  get: 

Theorem  2.2  The  reporting  version  of  the  gener¬ 
alized  halfspace  range  searching  problem  for  a  set  of 
n  colored  points  in  can  be  solved  in  O(nlog^n) 
(resp.  space  and  +  i)  (resp. 

0(log^n  -f  i)^  query  time,  where  i  is  the  output 
size  and  c  >  0  is  an  arbitrarily  small  constant.  The 
counting  version  is  solvable  in  O(nlogn)  space  and 
query  time.  □ 

2.3  Generalized  halfspace  range  report¬ 
ing  in  d  >  2  dimensions 

The  preceding  approach  extends  to  d  >  3,  but 
the  space  bound  is  high,  namely,  We 

now  give  an  approach  for  the  reporting  problem  in 
TZ'^  (d  >  2)  that  needs  much  less  space.  In  pre¬ 
processing,  we  store  the  distinct  colors  in  the  input 
point-set  5  at  the  leaves  of  a  balanced  binary  tree 
CT  (in  no  particular  order).  For  any  node  v  of 
CT,  let  C{v)  be  the  colors  stored  in  the  leaves  of 
n’s  subtree  and  let  S{v)  be  the  points  of  S  colored 
with  the  colors  in  C(v).  At  v,  we  store  a  data 
structure  HSE(v)  to  solve  the  halfspace  emptiness 
problem  on  S{v),  i.e.,  “does  a  query  halfspace  con¬ 
tain  any  points  of  S{v)V'  HSE{v)  returns  “true” 
iff  the  query  halfspace  is  empty.  If  |5v|  =  Uv,  then 
nSE{v)  uses  space  and  has  query  time 

O(logn„)  [Mul93,  page  290]. 

To  answer  a  generalized  halfspace  reporting 
query  we  do  a  depth-first  search  in  CT  and  query 
HSE{v)  at  each  node  v  visited.  If  u  is  a  non-leaf 
then  we  continue  searching  below  v  iff  the  query 
returns  “false”;  if  u  is  a  leaf,  then  we  output  the 
color  stored  there  iff  the  query  returns  “false”. 

Theorem  2.3  For  any  fixed  d  >  2,  a  set  S  of  n 
colored  points  in  TZ^  can  be  stored  in  a  structure  of 
size  such  that  the  i  distinct  colors  of 

the  points  contained  in  a  query  half  space  Q~  can 
be  reported  in  time  O(logn  -f-  ilog^n).  Here  e  >  0 
is  an  arbitrarily  small  constant.  □ 


We  note  that  the  reporting  (resp.  counting)  prob¬ 
lem  can  also  be  solved  in  0(n'^+^)  space  and 
0(logn  -f  i)  (resp.  O(logn))  time,  using  constant- 
depth  cutting  trees. 

3  Generalized  intersection  re¬ 
porting  on  lines  and  line  seg¬ 
ments 

We  consider  several  versions  of  the  generalized  in¬ 
tersection  searching  problem  for  a  set  S  oin  colored 
lines  or  line  segments;  the  query  g  is  a  line  or  a  line 
segment. 

3.1  Querying  colored  line  segments  with 
a  line 

We  dualize  the  colored  line  segments  of  5  to  a  set 
S'  of  colored  doublewedges  and  map  the  query  hne 
g  to  a  point  q' .  Thus,  our  problem  reduces  to  re¬ 
porting  the  i  distinct  colors  that  are  stabbed  by  q' . 
In  Section  4,  we  solve  this  problem  in  O(n^/^logn) 
space  with  a  query  time  of  0(log^  n  -j-  i).  In  the 
rest  of  this  section,  we  consider  the  case  where  the 
segments  of  S  aU  lie  in  the  unit  square  U  and  each 
segment  has  length  at  least  A,  where  A  >  0  is  a  con¬ 
stant.  These  assumptions  are  reasonable  for  prac¬ 
tical  applications  and  they  allow  a  very  efficient 
solution. 

For  now  assume  that  aU  the  segments  intersect 
the  y-axis  Y.  Thus,  one  endpoint  of  s  has  neg¬ 
ative  x-coordinate  and  the  other-  has  positive  x- 
coordinate.  Thus  in  the  corresponding  dual  dou¬ 
blewedge  one  of  the  bounding  lines  has  positive 
slope  and  the  other  has  negative  slope.  We  split 
each  doublewedge  into  a  left-wedge  and  a  right- 
wedge.  Note  that  each  wedge  is  y-monotone.  Con¬ 
sider  the  right- wedges.  Because  of  y- monotonicity, 
q'  is  contained  in  a  right-wedge  w  iff  the  horizon¬ 
tal,  leftward-directed  ray  r  emanating  from  g'  in¬ 
tersects  the  boundary  of  w.  Thus  for  each  color  c, 
we  compute  the  left-envelope  of  the  boundaries  of 
all  c-colored  right  wedges.  If  there  are  Uc  c-colored 
right  wedges,  then  the  c-colored  left-envelope  has 
size  0{nc)  (see  [Ede87,  page  357,  Problem  15.6]). 
In  this  way,  we  obtain  a  collection  S"  of  colored  line 
segments.  Note  that  (i)  r  intersects  the  boundary 
of  a  c-colored  right-wedge  iff  r  intersects  a  c-colored 
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left-envelope  and  (ii)  if  r  intersects  a  c-colored  left- 
envelope  then  it  intersects  a  unique  line  segment 
of  this  envelope.  Thus  we  have  a  standard  prob¬ 
lem  which  we  can  solve  in  O(nlogn)  space  and 
O(log^  n  -\-  i)  query  time  by  using  a  segment  tree 
as  in  Section  2.1. 

What  if  the  segments  of  S  do  not  all  intersect  Yl 
Suppose  that  there  is  a  constant  K  such  that  each 
segment  intersects  one  of  K  fixed  lines  Yi, . .  .,Yk- 
Let  Si  C  5  be  the  set  of  segments  intersecting  Y'. 
(If  a  segment  intersects  more  than  one  Y,  we  put 
it  in  any  one  of  the  5,  ’s;  thus  the  5,’s  partition  S.) 
For  1  <  i  <  Ff,  we  create  a  coordinate  system  Ci, 
where  Y  is  the  y-axis  and  any  line  perpendicular 
to  Y  is  taken  as  the  i-axis.  We  give  the  segments 
of  Si  coordinates  in  Ci  and  store  them  in  the  data 
structure  described  above.  To  answer  a  query,  we 
query  each  of  the  K  structures  separately.  The 
space  and  query  time  are  as  above. 

We  can  now  solve  the  problem  stated  at  the 
beginning.  Wlog  assume  that  the  origin  is  at 
the  bottom-left  corner  of  Li.  Consider  the  K  = 
2  -|-  2|’\/2/A]  lines  x  =  i  •  A/v/2  and  y  =  i  •  X/V^, 
where  0  <  t  <  \\/2jX\.  Since  each  segment  has 
length  at  least  A,  either  its  x-span  or  its  y-span  is 
at  least  A/\/2.  Thus  each  segment  intersects  one 
of  the  K  lines.  We  now  use  the  above  structure. 

Theorem  3.1  Let  X>  Q  be  a  constant  and  letU  be 
the  unit  square.  A  set  S  of  n  colored  line  segments 
in  TZ^ ,  where  each  segment  has  length  at  least  X  and 
all  segments  lie  in  U,  can  be  stored  in  a  structure 
of  size  O(Tilogn)  such  that  the  i  distinct  colors  of 
the  segments  that  are  intersected  by  a  query  line  q 
can  be  reported  in  time  0(log^  n  -|-  i).  □ 

3.2  Querying  colored  lines  with  a  verti¬ 
cal  line  segment 

We  give  a  simple  approach  based  on  persistence 
[DSST89],  and  then  show  how  to  incorporate  fil¬ 
tering  search  to  get  a  space-query  time  tradeoff. 

We  divide  the  plane  into  t  —  O(n^)  strips 
by  drawing  vertical  lines  through  the  intersection 
points  of  the  n  lines.  Within  any  strip,  Yt,  the 
lines  are  totally  ordered  from  top  to  bottom,  as 
Ek  :  ^1,^2, . .  .,£n-  Suppose  that  the  vertical  query 
segment  q  is  in  Yb  and  let  £a  and  £b  be  the  highest 
and  lowest  lines  of  Ek  intersected  by  q.  Our  prob¬ 


lem  is  now  equivalent  to  the  following  generalized 
1-dimensional  range  searching  problem:  Given  col¬ 
ored  integers  1,2,  ...,n  (where  integer  j  gets  the 
color  of  £j),  report  the  distinct  colors  in  the  query 
interval  [a,  6].  In  [GJS93a],  this  problem  is  solved 
using  a  structure  Dk  of  size  0(n)  and  a  query  time 
O(logn-f  z).  Dk  supports  updates  in  O(logn)  time 
with  O(logn)  memory  modifications.  By  sweeping 
over  the  strips  Vk,  1  <  k  <  t,  and  making  the  asso¬ 
ciated  Ujt’s  partially  persistent,  we  get  a  solution 
with  space  O(n^logn)  and  query  time  C>(log 7i-|-z). 

In  a  nutshell,  the  idea  for  the  space-query-time 
tradeoff  is  as  follows:  We  extract  from  the  se¬ 
quence  E  =  (Fli, . . . ,  Flt+i)  a  smaller  subsequence 
E'  -  {E{, . . . ,  E'^)  such  that  (i)  Ej  and  differ 
in  just  two  (not  necessarily  adjacent)  positions,  (ii) 
for  each  Ek  6  E  there  is  an  Ej  G  E'  which  “approx¬ 
imates”  Ei  in  a  sense  that  we  wiU  elaborate  upon 
later,  and  (hi)  m  =  0(n^-®“^),  where  0.5  <  /i  <  1 
is  a  tunable  parameter.  Properties  (i)  and  (iii)  sug¬ 
gest  that  we  can  apply  persistence  to  E'  and  get  a 
scheme  with  space  bound  o(n^);  property  (ii)  sug¬ 
gests  that  instead  of  querying  Ek,  as  we  might  in 
the  simple  scheme,  we  can  query  Ej  in  the  new 
scheme  and  stiU  be  assured  of  correctness. 

Formally,  we  define  a  sequence  b\,b2, . .  .,hB  of 
distinguished  list  positions  called  borders,  where 
bi  =  {i  -  l)[n'^  4-  IJ  +  1  and  .B  =  ©(n^"^)  . 
We  construct  E'  by  scanning  E.  Let  Ei  be  the 
currently  scanned  list  of  E.  Let  E'j  be  the  most 
recently  constructed  list  of  E'  and  suppose  that 
we  constructed  E'j  when  we  reached  E^j  E  E.  If 
Eij^\  is  obtained  from  Ei  by  swapping  lines  across 
some  border  bk,  i.e.  by  swapping  the  ft/^th  line  a 
with  either  the  {bk  —  l)th  line  /?  or  the  {bk  -1-  l)th 
line  7,  then  we  create  from  E'j  by  swap¬ 

ping  a  with  /?  or  7,  as  appropriate;  we  also  set 
=  z  -)-  1.  It  can  be  shown  that  E'j  approx¬ 
imates  any  list  Ek  G  {Es^,  Es^+i, . . . ,  Es^^,-i}  in 
the  following  sense:  for  any  two  successive  borders 
bi  and  6;+i,  the  (unordered)  set  of  lines  at  positions 
6/  -f  1, . . . ,  in  E'j  is  the  same  as  the  (unordered) 
set  of  lines  in  Ek  at  these  same  positions.  More¬ 
over,  it  can  be  shown  that  the  border-lines  in  E'j 
and  Ek  are  the  same;  this  implies  that  the  border¬ 
lines  in  {Eg-,  Esj+i, . . .,  Es^_^_^-l}  are  the  same  and 
can  be  totally  ordered.  Finally,  using  A;-set  theory, 
it  can  be  proved  that  \E'\  =  0(n^'^“^). 

The  data  structure  consists  of  (1)  a  red-black 
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tree  T-  storing  the  total  order  of  the  border  lines 
of  E-,  (2)  an  instance  £>'  of  the  generalized  1- 
dimensional  range  searching  structure  built  on  the 
colored  integers  Ij  :  (l,2,...,n),  where  integer  p 
gets  the  color  of  the  pth  line  of  Ej,  and  (3)  a  red- 
black  tree  Ij  storing  Ij.  We  make  cdl  these  struc¬ 
tures  persistent. 

Given  q,  suppose  we  need  to  query  Ej.  We  find 
the  smallest  border  6,  on  or  above  q's  upper  end¬ 
point  and,  symmetrically  the  greatest  border  bg. 
We  query  the  structure  Dj  with  [6s  -|-  1 , 6^] .  Then 
we  scan  all  the  integers  6,,  6s  -  1, ,  6s-i  -|- 1  in  Ij 
and  report  the  distinct  colors  of  the  lines  of  Ej  at 
these  positions  that  are  intersected  by  g;  symmet¬ 
rically  for  bg. 

Theorem  3.2  A  set  S  of  n  colored  lines  in  'R?  can 
be  stored  in  a  data  structure  of  size  0{n'^'^~^logn) 
such  that  the  i  distinct  colors  of  the  lines  that  are 
intersected  by  a  vertical  query  line  segment  can  be 
reported  in  0{n^  -f  i)  time.  Here  p  is  a  tunable 
parameter  in  the  range  0.5  <  /i  <  1.  The  problem 
is  also  solvable  in  0{n'^  log  n)  space  and  O(log  n+i) 
query  time.  □ 

We  remark  that  our  method  is  based  on  an  idea 
used  in  [AvK093]  for  a  different  (standard)  prob¬ 
lem.  However,  there  are  several  crucial  differences: 
(i)  because  our  problem  is  a  generalized  one,  our 
algorithm  for  constructing  S'  uses  a  different  swap 
criterion,  (ii)  because  our  query  is  a  vertical  line 
segment  rather  than  a  ray  (as  in  [AvK093]),  our 
choice  of  borders  needs  to  be  different  in  order  to 
get  a  sublinear  query  time.  Further  discussion  of 
this  appears  in  [GJS93c]. 

3.3  Querying  colored  line  segments  with 
a  line  segment 

3.3.1  Vertical  query  line  segments 

We  can  use  an  approach  similar  to  the  one  at 
the  beginning  of  Section  3.2.  However,  since  we 
are  now  dealing  with  line  segments  rather  than 
lines,  we  must  overcome  a  subtle  problem  that  can 
arise.  We  discuss  this  later.  We  draw  vertical  lines 
through  the  endpoints  and  the  intersection  points 
of  the  segments  of  S.  Within  any  strip,  the  seg¬ 
ments  that  cross  it  can  be  totally  ordered.  We 


sweep  over  the  strips  starting  at  the  leftmost  non¬ 
empty  strip.  Let  s\,S2,....,Sm  be  the  segments 
that  cross  this  strip,  sorted  from  bottom  to  top. 
For  1  <  i  <  m,  we  give  s,-  a  label  Z(s,)  =  i  and  give 
this  label  the  color  of  s,-.  We  store  the  segments 
si, . . . ,  in  this  order  in  a  partially  persistent  red- 
black  tree  Ts.  We  also  store  the  colored  labels  Z(si), 

1  <  i  <  m,  in  a  partially  persistent  version  Ti  of 
the  data  structure  of  [GJS93a]  for  the  generalized 
1-dimensional  range  reporting  problem. 

Suppose  we  sweep  from  the  zth  to  the  (i  +  l)th 
strip.  The  cases  where  we  encounter  a  right  end¬ 
point  of  a  segment  or  the  intersection  point  of  two 
segments  are  easy  to  handle  [GJS93c].  If  we  en¬ 
counter  the  left  endpoint  of  segment  s,  then  in  the 
current  version  of  Ts,  we  locate  s.  Let  t  and  u 
be  the  segments  that  are  immediately  below  and 
above  s  in  the  (i  -f  l)th  strip.  We  insert  s  into  the 
current  version  of  Ts  and  store  with  it  a  label  /(s) 
that  lies  between  l{t)  and  l(u).  Moreover,  we  give 
the  label  l{s)  the  same  color  as  s  and  insert  this 
colored  number  into  the  current  version  of  Ti. 

In  this  scheme,  we  need  to  assign  special  labels 
to  the  segments  as  we  encounter  them  because  all 
the  segments  are  not  present  in  each  strip.  How¬ 
ever,  we  must  be  careful  in  choosing  the  labels  since 
otherwise  we  may  end  up  getting  labels  consisting 
of  0(n)  bits.  Towards  this  end,  we  use  a  label¬ 
ing  scheme  due  to  Dietz  and  Sleator  [DS87].  Using 
their  approach  we  take  integer  labels  in  the  range 
[0..O(n^)],  i.e.,  labels  consisting  of  only  O(logn) 
bits.  We  need  to  give  segment  s  a  label  that  lies  in 
between  l{t)  and  l{u).  Using  the  scheme  of  [DS87], 
this  may  result  in  the  relabeling  of  other  segments. 
Dietz  and  Sleator  show  how  to  choose  the  labels 
such  that  only  0(1)  amortized  relabelings  are  nec¬ 
essary  per  update.  If  we  relabel  segment  s  from 
l{s)  to  Z'(s),  then  we  just  delete  the  colored  num¬ 
ber  l{s)  from  T;  and  insert  the  number  l'{s),  having 
the  same  color  as  l{s),  into  it. 

Now  let  g  be  a  vertical  query  segment.  We  locate 
the  strip  containing  g  and  then  search  in  the  version 
of  Ts  corresponding  to  this  strip  for  the  lowest  and 
highest  segments  s  and  t  that  intersect  g.  Finally, 
we  search  in  the  version  of  T;  corresponding  to  this 
strip  for  the  distinct  colors  of  aU  labels  that  are 
contained  in  the  interval  [Z(5),Z(t)]. 

Theorem  3.3  A  set  S  of  n  colored  line  segments 
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in  the  plane  can  be  preprocessed  into  a  data  struc¬ 
ture  of  size  0((n  +  x)logn)  such  that  the  i  dis¬ 
tinct  colors  of  the  segments  intersected  by  a  vertical 
query  line  segment  q  can  be  reported  in  O(log  n  +  z) 
time.  Here  X>  0  <  x  ^  Q);  number  of  pair¬ 

wise  intersections  among  the  segments  in  S.  □ 

3.3.2  Arbitrary  query  line  segments 

If  q  is  not  vertical,  then  we  use  a  different  approach 
which  we  sketch  briefly.  We  break  up  the  input  seg¬ 
ments  at  their  X  intersection  points  and  store  them 
in  a  segment  tree  T.  For  any  node  v  £  T,  the  set 
S{v)  of  segments  allocated  to  v  can  be  totally  or¬ 
dered  within  Strip{v)  as  from  bottom 

to  top.  Suppose  q  =  ab  is  such  that  a,  6  G  Strip(v), 
with  a  below  b.  Let  s^  be  the  lowest  segment 
of  S(v)  that  is  above  a;  define  s^j  symmetrically 
w.r.t.  b.  Clearly  it  suffices  to  report  the  distinct 
colors  in  {s„,  , . . . ,  which  we  can  solve  us¬ 

ing  a  structure  for  generalized  1-dimensional  range 
searching.  If  q  =  [a,  6]  is  not  as  above,  then  we  can 
identify  a  set  V  of  O(logn)  nodes  in  T  such  that 
for  each  v  £  V,  the  segment  q^  zz  qf)  Strip(v)  is  as 
above.  We  simply  query  at  each  such  v  with 

Theorem  3.4  A  set  S  of  n  colored  line  segments 
in  V?  can  be  preprocessed  into  a  data  structure  of 
size  0{{n  -|-  x)log^)  so  that  the  i  distinct  colors 
of  the  segments  that  are  intersected  by  a  query  seg¬ 
ment  q  can  be  reported  in  O(log^  n  -f  ilogn)  time. 
Here  X)  0  <  X  <  (2)  number  of  pairwise 

intersections  between  segments  in  S .  □ 

4  Generalized  triangle  stabbing 

We  consider  the  following  problem:  “Preprocess 
a  set  S  of  n  colored  triangles  in  TZ^,  so  that  the 
i  distinct  colors  of  the  triangles  stabbed  by  any 
query  point  q  can  be  reported  efficiently.” 

Wlog  assume  that  each  triangle  t  £  S  has  a  hor¬ 
izontal  side.  We  group  the  triangle  vertices  into 
0(n^/^)  vertical  strips  each  of  size  0(n^/^).  The 
triangles  that  intersect  any  strip  Vi  form  two  dis¬ 
joint  subsets  Ti  and  T/,  where  T,-  (resp.  T-)  consists 
of  triangles  having  no  (resp.  at  least  one)  vertex 
inside  Vj.  We  further  subdivide  Vj  into  vertical 
strips  by  taking  each  triangle  in  T-  and  drawing  a 
vertical  line  through  each  of  its  vertices  that  lies 


in  Vi.  Let  Wij  be  any  such  substrip  within  Vi  and 
let  Tij  consist  of  the  triangles  of  T-  that  cross  Wij. 
Note  that,  by  construction,  no  triangle  of  T,j  can 
have  a  vertex  inside  Wij.  (This  partitioning  tech¬ 
nique  is  reminiscent  of  a  method  used  in  [OY88]  for 
computing  the  measure  of  the  union  of  iso-oriented 
boxes  in  TZ'^.) 

Given  the  query  point  q,  suppose  that  q  £  Vi 
and  q  £  Wij.  Then  we  need  to  only  report  the 
distinct  colors  of  the  triangles  of  T,-  and  of  Tij  that 
are  stabbed  by  q.  We  discuss  how  to  do  this  for  F, 
and  Ti.  (The  discussion  for  Wij  and  Tij  is  similar.) 

For  any  triangle  t  £  Ti,  let  p(t)  be  the  vertex  of 
t  shared  by  the  horizontal  side  h{t)  of  t  and  the 
slanted  side  s(t)  of  t  which  crosses  F).  Let  w(t)  be 
the  wedge  defined  by  p(t)  and  (the  extensions  of) 
h{t)  and  s{t).  Consider  the  set  Ri  C  Ti  that  yields 
right-facing  wedges.  Clearly,  q  stabs  t  £  Ri  iff  the 
horizontal,  leftward  directed  ray  from  q  intersects 
w{t).  We  can  now  solve  the  resulting  generalized 
ray-rightwedge  intersection  problem  by  computing 
left  envelopes  and  using  a  segment  tree  (as  in  Sec¬ 
tion  3.1).  Symmetrically  for  left-facing  wedges. 

Theorem  4.1  A  set  S  of  n  colored  triangles  in 
TZ^  can  be  stored  in  a  data  structure  of  size 
0(n^/^logn)  such  that  the  i  distinct  colors  of  the 
triangles  that  are  stabbed  by  a  query  point  q  can  be 
reported  in  O(log^  n  +  i)  time.  □ 

5  Generalized  triangle  range 
searching 

We  wish  to  preprocess  a  set  S  of  n  colored  points 
in  TZ^  so  that  the  i  distinct  colors  of  the  points 
contained  in  a  query  triangle  q  can  be  reported 
efficiently.  For  arbitrary  q  we  note  that  the  problem 
can  be  solved  in  0(n^+^)  space  and  O(logn  -b  i) 
query  time  using  constant-depth  cutting  trees.  In 
the  rest  of  this  section,  we  give  an  efficient  solution 
for  query  triangles  that  are  fat.  We  define  a  fat- 
triangle  to  be  a  triangle  in  which  each  internal  angle 
is  at  least  7  for  some  constant  7  >  0. 

We  begin  with  a  solution  for  a  query  fat-wedge  q. 
i.e.,  a  wedge  where  the  internal  angle  at  the  vertex 
Vq  of  q  is  at  least  7.  For  now  assume  that  q  is  y- 
monotone.  We  store  the  points  of  S  at  the  leaves  of 
a  balanced  binary  search  tree  T  by  non-decreasing 
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^-coordinates  from  left  to  right.  We  augment  each 
node  u  of  T  with  an  instance  HP{y)  of  the  struc¬ 
ture  of  Theorem  2.1  for  generalized  halfplane  range 
reporting;  HP{v)  is  built  on  the  points  in  u’s  de¬ 
scendant  leaves. 

Given  q,  we  divide  it  into  an  upper  wedge  qa  and 
a  lower  wedge  qb,  by  drawing  a  horizontal  line  L 
through  Vg.  Consider  qa-  Let  la  be  the  line  con¬ 
taining  the  slanted  side  of  qa.  We  search  in  T  us¬ 
ing  the  y-coordinate  of  Vg  and  determine  a  set  Va 
of  nodes  that  lie  to  the  right  of  the  search  path. 
We  query  HP{v)  at  each  v  eVa  with  the  halfplane 
l~ .  Symmetrically  for'qt.  It  can  be  shown  that 
the  space  used  is  O(nlog^  n)  and  the  query  time  is 
C>(log^  n  -f  ilogn). 

What  if  q  is  not  y-monotone?  In  preprocess¬ 
ing,  we  select  t  =  [27r/7]  coordinate  systems  C,-  = 
(a;,y,),  where  all  the  C,-  share  the  same  origin  and 
Ci+i  is  offset  from  C,-  by  an  angle  7,0<i<t— 1 
(indices  are  modulo  t.)  Within  each  Ci  we  build 
an  instance  of  the  data  structure  for  y, -monotone 
fat-wedges.  Given  a  query  fat-wedge  q,  we  locate  a 
Ci  such  that  q  is  y, -monotone  and  then  query  the 
associated  structure.  (It  is  easily  seen  that  such  a 
Ci  exists.) 

Now  consider  the  case  of  a  query  fat-triangle 
q.  We  store  the  points  by  non- decreasing  x- 
coordinates  from  left  to  right  in  a  balanced  search 
tree  T'  and  augment  each  node  v  with  an  in¬ 
stance  FW{v)  of  the  structure  given  above  for  fat- 
wedges,  which  is  built  on  the  points  in  u’s  descen¬ 
dant  leaves.  Given  q,  we  divide  it  into  two  triangles 
qi  and  qr,  each  with  a  vertical  side  s,  with  qi  to  the 
left  of  s  and  qr  to  the  right.  We  search  in  T'  with 
the  x-coordinate  of  s  and  identify  a  set  VJ  of  nodes 
that  lie  to  the  left  of  the  search  path.  For  each 
node  V  €  Vi,  we  query  FW{v)  with  the  wedge  sup¬ 
porting  qi,  which  is  a  fat-wedge.  Symmetrically  for 
qr. 


Theorem  5.1  Let  j  >  0  be  a  constant.  A  set  S 
of  n  colored  points  in  P?'  can  be  stored  in  a  data 
structure  of  size  O(nlog^n)  such  that  the  i  distinct 
colors  of  the  points  that  are  contained  in  a  query 
triangle  q  each  of  whose  internal  angles  is  at  least 
7  can  be  reported  in  time  O(log^  n  -h  ilog^  n).  □ 


6  Conclusions  and  further  work 

We  have  presented  efficient  solutions  to  several 
generalized  intersection  problems  involving  non- 
iso-oriented  objects.  Our  methods  have  included 
sparse  representations,  persistence,  and  filtering 
search. 

Besides  improving  our  bounds,  three  problems 
are  of  particular  interest:  (i)  obtaining  dynamic 
data  structures  for  the  generalized  problems  con¬ 
sidered  here,  (ii)  obtaining  linear-space  or  near 
linear-space  solutions  with  output-sensitive  query 
times  (of  the  form  0{n^+i)  or  0{n^ -\-i-polylog{n))^ 
0  <  p  <  1)  for  the  generalized  halfspace  range 
searching  problem  in  d  >  4  dimensions  and  for 
the  generalized  simplex  range  searching  problem 
in  d  >  2  dimensions,  and  (iii)  solving  the  counting 
versions  of  Problems  3-7  in  Table  1. 
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3rd  Annual  Video  Review  of  Computational  Geometry 


Computational  geometry  concepts  are  often  easiest  to  understand  visually. 
Indeed,  most  papers  in  computational  geometry  rely  on  diagrams  to  communi¬ 
cate  the  intuition  behind  their  results.  However,  static  figures  are  not  always 
adequate  to  describe  geometric  algorithms,  since  algorithms  are  inherently  dy¬ 
namic.  The  accompanying  videotape  showcases  advances  in  the  use  of  algorithm 
animation,  visualization,  and  interactive  computing  in  the  study  of  computa¬ 
tional  geometry.  The  following  pages  contain  short  descriptions  of  the  eight 
segments  in  the  videotape. 

The  eight  video  segments  present  a  variety  of  geometric  algorithms  and  con¬ 
cepts,  organized  by  dimensionality.  The  first  three  segments  are  animations  of 
two-dimensional  algorithms:  a  polygonal  approximation  algorithm  from  cartog¬ 
raphy,  an  algorithm  for  computing  the  rectangle  discrepancy,  and  a  fixed-radius 
neighbor- searching  algorithm.  The  fourth  video  presents  the  GASP  system  for 
animating  geometric  algorithms  in  two  or  three  dimensions;  GASP  is  used  by 
two  other  segments  in  the  review.  The  last  four  segments  illustrate  algorithms 
and  concepts  in  three  or  more  dimensions:  The  fifth  segment  explores  the  rela¬ 
tionship  between  penumbral  shadows  in  three  dimensions  and  four-dimensional 
polytopes.  The  final  three  segments  show  three-dimensional  algorithms  for  in¬ 
teractive  collision  detection,  polyhedral  separators,  and  interactive  display  of 
alpha  hulls. 

We  thank  the  members  of  the  Video  Program  Committee  for  their  help  in 
evaluating  the  entries.  The  members  of  the  committee  were  Marshall  Bern 
(Xerox  PARC),  Marc  Brown  (DEC  SRC),  Leo  Guibas  (Stanford),  John  Hersh¬ 
berger  (Mentor  Graphics),  Jim  Ruppert  (NASA  Ames),  and  Jenny  Zhao  (Silicon 
Graphics). 

We  also  thank  Bob  Taylor  of  the  DEC  Systems  Research  Center  for  his 
support  in  publishing  the  1st  and  2nd  Annual  Video  Reviews.  Both  are  avail¬ 
able  free  of  charge  upon  request.  The  ’92  Review  is  SRC  Research  Report 
#87a  (a  written  guide)  and  #87b  (the  videotape),  and  the  ’93  Review  is  re¬ 
ports  #101a  and  #101b.  You  can  obtain  either  by  sending  electronic  mail  to 
src-report@src.dec.com,  or  physical  mail  to  Librarian,  DEC  Systems  Re¬ 
search  Center,  130  Lytton  Ave.,  Palo  Alto,  CA  94301.  You  can  also  get  the 
written  guides  by  anonymous  ftp  from  gatekeeper.dec.com  in  the  directory 
pub/DEC/SRC/research-reports. 
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An  O(nlogn)  Implementation  Of  The  Douglas-Peucker  Algorithm 

For  Line  Simplification 

John  Hershberger  Jack  Snoeyink 

Mentor  Graphics  Department  of  Computer  Science 

University  of  British  Columbia 


Introduction 

An  important  task  of  the  cartographer’s  art  is  to 
extract  features  from  detailed  data  and  represent 
them  on  a  simple  and  readable  map.  As  com¬ 
puters  become  increasingly  involved  in  automated 
cartography,  efficient  algorithms  are  needed  for  the 
tasks  of  extraction  and  simplification.  Cartogra¬ 
phers  [1,  6]  have  identified  the  line  simplificaiion 
problem  as  an  important  part  of  representing  linear 
features.  Given  a  polygonal  chain,  a  sequence  of 
vertices  V  =  {vo,  ui, . . . ,  u„},  the  problem  is  to 
find  a  subsequence  of  vertices  that  approximates 
the  chain  V.  We  assume  that  the  input  chain  V 
has  no  self-intersections  (but  not  the  output). 


In  1973,  Douglas  and  Peucker  (now  Poiker)  [3] 
published  a  simple  recursive  simplification  method 
in  the  cartography  literature  that  has  become  the 
favorite  of  many  [5,  10].  This  method  has  been 
independently  proposed  in  other  contexts  such  as 


vision  [8]  and  computational  geometry  [9].  Our 
video  animates  two  different  implementations  of 
this  simplification  method — the  straightforward  im¬ 
plementation  suggested  in  Douglas  and  Peucker  and 
the  implementation  of  Hershberger  and  Snoeyink 
that  improves  the  worst-case  running  time. 

We  describe  the  method,  the  implementations, 
and  the  animation. 

Method 

Given  a  polygonal  chain  V  =  {uq,  ui, . . . ,  u„},  the 
recursive  line  simplification  method  proceeds  as  fol¬ 
lows;  Initially,  approximate  V  by  the  line  segment 
vpiT.  Determine  the  farthest  vertex  Vf  from  the 
line  ^0^-  h  its  distance  d(vf,^J^)  is  at  most  a 
given  tolerance  £  >  0,  then  accept  the  segment  vov„ 
as  a  good  approximation  to  V.  Otherwise,  break 
V  at  Vf  and  recursively  approximate  the  subchain 
{uo,ui,...,u/}  and  {u/,...,u„}. 

Algorithms 

The  two  implementations  differ  in  the  way  they  find 
Vf,  the  farthest  vertex  from  io^n- 

The  straightforward  approach  measures  distance 
from  ^0^  for  each  vertex  and  takes  the  maxi¬ 
mum.  Thus,  the  running  time  of  this  algorithm 
will  satisfy  a  quicksort- like  recurrence,  with  best 
case  0(nlogn).  Since  geometry  determines  the  far¬ 
thest  vertex,  the  worst-case  running  time  is  0(n^). 
One  cannot  use  linear-time  median  finding  or  ran¬ 
domization  to  improve  the  running  time.  (If  one 
assumes  that  the  input  data  is  such  that  farthest 
vertices  are  found  near  the  middle  of  chains,  then 
one  expects  O(nlogn)  behavior.) 

The  second  implementation  uses  a  path  hull  data 
structure  [2,  4]  to  maintain  a  dynamic  convex  hull 
of  the  polygonal  chain  V.  Using  Melkman’s  convex 
hull  algorithm  [7],  we  compute  two  convex  hulls 
from  the  middle  of  the  chain  outward.  We  can  find 
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a  farthest  vertex  Vf  from  a  line  by  locating  two 
extreme  points  on  each  hull  using  binary  search. 
When  we  split  V  at  vertex  Vf,vfe  undo  one  of  the 
hull  computations  to  obtain  the  hull  from  the  “mid¬ 
dle”  to  Vf,  recursively  approximate  the  subchain 
containing  the  middle,  then  build  hulls  for  the  other 
subchain  and  recursively  approximate  it.  One  can 
use  a  simple  credit  argument  to  show  that  the  total 
time  taken  by  the  algorithm  is  0(n  logn).  The  best 
case  can  even  be  linear,  if  all  the  hulls  are  small  and 
splits  occur  at  the  ends. 

Unfortunately  for  us,  the  statistical  properties  of 
cartographic  data  usually  means  that  the  straight¬ 
forward  implementation  is  slightly  faster  than  the 
path  hull  implementation.  Since  the  variance  of 
the  running  time  of  the  path  hull  implementation 
is  smaller,  it  may  be  interesting  for  parallel  or  in¬ 
teractive  applications. 


Animation 

The  programs  were  run  on  a  Silicon  Graphics  Crim¬ 
son  in  the  GraFiC  lab  at  UBC  and  output  was 
recorded  to  video  in  real  time. 
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Computing  the  Rectangle  Discrepancy  * 
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This  is  the  animation  of  an  algorithm  that  computes  the 
rectangle  discrepancy  of  a  two-dimensional  point  set.  Our 
main  motivation  comes  from  sampling  problems  in  com¬ 
puter  graphics.  Supersampling  is  one  of  the  most  general 
ways  of  attacking  antialiasing.  In  this  approach  we  sample 
the  picture  at  a  very  high  rate,  and  then  we  resample  by 
averaging  supersamples  within  the  pixel  area  to  compute 
the  pixel  value.  Ray  tracing,  a  technique  used  to  produce 
realistic  images  of  computer  modeled  scenes,  is  another  in¬ 
stance  of  supersampling.  Here  we  sample  the  radiance  at  a 
set  of  points  in  each  pixel. 

The  discrepancy  theory  provides  a  measure  of  the  qual¬ 
ity  of  the  sampling  patterns.  The  rectangle  discrepancy 
in  particular  gives  a  good  measure  on  how  well  a  sample 
pattern,  when  applied  to  the  area  of  one  or  more  pixels, 
captures  small  details  in  the  picture.  This  algorithm  allows 
us  to  find  sampling  sets  with  very  low  rectangle  discrep¬ 
ancy.  Interestingly  the  problem  of  computing  the  rectangle 
discrepancy  also  arises  in  the  context  of  learning  theory  [4]. 

The  algorithm  is  presented  in  [2].  We  give  the  problem 
and  we  show  the  main  techniques  used  in  the  algorithm. 
Finally  we  run  the  algorithm  on  a  sample  point  set. 

Let  us  consider  a  point  set  X  in  the  unit  square.  Let  T 
be  the  set  of  all  rectangles  in  the  unit  square  with  edges 
parallel  to  the  two  axis.  For  an  axis  oriented  rectangle  A 
(A  G  we  define  the  function 

IniA)  =  \AnX\ 

The  rectangle  discrepancy  of  a  rectangle  A  £  T  \s 
V{A)  =  \In{A)/\X\  -  Area{A)\ 

The  maximum  rectangle  discrepancy  of  the  set  X  is 
MaxT>(X)  =  max  A^jr{p{A)) 

*This  work  supported  in  part  by  the  National  Science  Foundation  under  Grant 
Number  CCR93-01254  and  by  The  Geometry  Center,  University  of  Minnesota,  an 
STC  funded  by  NSF,  DOE,  and  Minnesota  Technology,  Inc. 


We  also  define  the  following  simpler  functions  for  ^4 
Vi{A)  =  IniA)l\X\ -  Area{A) 
and 

Vo{A)  =  Area(A)  -  In{A)/\X\ 

The  algorithm  we  present  in  this  animation  computes 

MaxViiX)  =  max  a&j^{T^]{A)) 

for  a  given  input  set  X,  in  time  0(\X\^\og  lA”!).  The  same 
techniques  however  can  be  used  to  maximize  Vo,  and  so 
this  problem  is  equivalent  to  finding  MaxV{X). 

We  explain  the  main  idea  of  the  algorithm  in  a  series  of 
simple  lemmata.  First  we  show  that  we  have  to  consider 
at  most  0{\X\‘')  rectangles,  more  specifically  the  ones  that 
have  all  their  edges  pass  through  a  point  in  X. 

This  allows  us  to  study  pairs  of  points  in  in  .T.  For  each 
such  pair  we  find  the  rectangle  that  maximizes  Vi  over  all 
rectangles  with  horizontal  edges  that  pass  through  the  points 
of  the  given  pair. 

This  problem  is  much  easier  now  because  we  know  the  y 
coordinates  of  the  rectangle.  Assuming  that  the  two  points 
are  pi>,pt  (Pbiy)  <  Ptiy)l  and  the  height  of  the  rectangle 
is  dy,  {dy  =  pt{y)  -  Pb{y))  we  have  to  find  the  following 
maximum 

maxABTi.idxA  dy  -  In{A)l\X\) 

where  Tij  is  the  set  of  rectangles  that  pass  through  points 
Pb  and  pu  and  dxA  is  the  width  of  A.  If  we  eliminate  the 
points  in  X  that  either  lie  above  the  top  y  coordinate  of 
the  rectangle  or  below  the  lower  y  coordinate,  this  becomes 
an  one-dimensional  problem.  We  project  the  points  on  the 
X  axis  and  obtain  a  new  set  X'  of  one-dimensional  points. 
We  can  equivalently  maximize  the  following  function 

maxAe[o,i],pb(x)€AiLength(A)  —  IniA)/\X'\) 

We  can  reduce  this  problem  to  the  problem  of  computing 
the  halfspace  discrepancy  of  one-dimensional  point  sets,  a 
simpler  problem  studied  in  [1].  Here  we  want  to  find  the 
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anchored  interval  that  maximizes 

maxA=[o,Zi]{zi  -  il\Z\) 

for  a  given  one-dimensional  set  Z  =  [zy. .  .z^}  with  0  < 
Zi  <  Zi+i  <  1.  This  is  a  linear  function  of  the  rank  and  the 
coordinate  of  the  point.  To  solve  it  we  find  the  convex  hull 
of  the  set  {(2^1 , 1). .  .(z„,  n)}  and  find  the  maximum  with  a 
binary  search.  From  this  maximum  we  obtain  the  solution 
to  the  original  maximization  problem. 

In  this  way  we  compute  the  maximum  rectangle  for  all 
pairs  of  points.  Clearly  the  rectangle  that  maximizes  X>;  is 
the  maximum  of  these  Oi\X\'^)  rectangles. 

Finally,  we  show  how  to  use  the  dynamic  data  structure 
of  [5]  to  solve  a  sequence  of  one-dimensional  problems 
efficiently.  With  the  use  of  this  technique  we  can  maintain 
the  convex  hull  at  a  cost  of  O(log  \X\)  time  per  insertion. 
This  in  turn  allows  us  to  find  the  maximum  rectangle  for 
each  pair  of  points  in  O(log  lA”!)  time,  and  yields  the  final 
running  time  of  0(|A'plog  |A'|). 

This  video  was  prepared  in  the  Computer  Science  de¬ 
partment  at  Princeton  University.  The  implementation  of 
the  algorithm  was  written  in  C,  to  run  under  UNIX  on  a 
Silicon  Graphics  Iris  workstation.  The  animation  was  cre¬ 
ated  with  the  animation  system  GASP  that  Ayellet  Tal  and 
David  Dobkin  are  developing  in  this  department  [3].  Figure 
1  shows  the  animation  running  on  a  sample  input  set.  The 
rectangle  that  maximizes  X>/  is  now  displayed  on  the  screen. 
Recording  was  done  at  the  Interactive  Computer  Graphics 
Lab  at  Princeton  and  editing  was  done  with  the  assistance 
of  the  Department  of  Media  Services  at  Eh-inceton. 
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Fig.  1:  The  rectangle  that  maximizes  X>;. 
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An  animation  of  a  fixed-radius  all-nearest-neighbors  algorithm* 


Hans-Peter  Lenhof  Michiel  Smid 


This  video  shows  an  animation  of  an  algorithm 
due  to  the  authors  [2]  for  solving  the  fixed-radius 
all-nearest-neighbors  problem.  In  this  problem,  we 
are  given  a  set  5  of  n  points  and  a  real  number 
5,  and  we  have  to  report  all  pairs  of  points  that 
are  at  distance  at  most  S.  The  algorithm  works  in 
two  stages.  In  the  first  stage,  a  grid  is  computed. 
Instead  of  a  standard  grid,  we  use  a  so-called  de¬ 
graded  grid  that  is  easier  to  construct  by  means  of 
a  simple  sweep  algorithm.  This  degraded  grid  con¬ 
sists  of  boxes  with  sides  of  length  at  least  S.  If  a 
box  contains  points  of  5,  then  its  sides  are  of  length 
exactly  S.  In  the  second  stage,  this  grid  is  used  for 
the  actual  enumeration.  For  each  non-empty  box, 
it  suffices  to  compare  each  of  its  points  with  aU 
points  in  the  same  box  or  in  one  of  the  neighbor¬ 
ing  boxes.  Although  the  algorithm  may  compare 
many  pairs  having  distance  more  than  6,  it  can  be 
shown  that  the  total  number  of  pairs  considered 
is  proportional  to  the  number  of  pairs  that  are  at 
most  S  apart.  As  a  residt,  the  total  running  time 
of  the  algorithm  is  proportional  to  nlogn  plus  the 
number  of  pairs  that  are  at  distance  at  most 
As  an  application,  we  give  an  animation  of  the 
algorithm  of  Heiden  at  al.[l]  for  triangulating  the 
contact  surface  of  a  molecule.  In  a  first  step,  points 
on  this  surface  are  computed.  Given  these  points. 


‘This  work  was  supported  by  the  ESPRIT  Basic  Re¬ 
search  Actions  Program,  under  contract  No.  7141  (project 
ALCOM  II).  Authors’  address:  Max-PIanck-Institut  fiir  In- 
formatik,  D-66123  Saarbriicken,  Germany. 


we  compute  aU  pairs  that  are  at  distance  at  most  6 
for  a  suitable  choice  of  8.  This  gives  for  each  point 
a  neighbor  list  containing  aU  points  that  are  at  dis¬ 
tance  at  most  8  to  it.  Given  these  lists,  we  can 
triangulate  the  surface  incrementally:  An  edge  of 
the  current  triangulation  is  called  an  outer  edge,  if 
it  belongs  to  only  one  triangle.  AU  outer  edges  are 
maintained  in  a  queue.  With  each  edge,  we  store 
a  so-caUed  common  neighbor  list,  which  is  the  in¬ 
tersection  of  the  two  neighbor  lists  of  the  points 
belonging  to  this  edge.  We  start  with  any  trian¬ 
gle.  Its  edges  and  their  common  neighbor  fists  are 
inserted  into  the  queue.  As  long  as  the  queue  is 
non-empty,  we  do  the  following:  We  take  any  edge 
from  the  queue  and  look  at  all  points  in  its  common 
neighbor  list.  Each  such  point  defines  a  triangle 
with  the  current  edge.  We  take  that  point  defin¬ 
ing  the  smallest  triangle.  This  triangle  becomes 
part  of  the  triangulation  and  we  update  the  queue 
appropriately. 

The  video  was  produced  at  the  Max-Planck- 
Institute  for  Computer  Science,  using  Silicon 
Graphics  Indy  and  Indigo^  machines,  and  using 
Showcase  and  Geomview-1.1  software.  We  thank 
Roland  Berber! ch,  Jack  Hazboun,  Dirk  Louis  and 
Peter  MiiUer  for  their  help  in  producing  this  video. 
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GASP  -  A  System  to  Facilitate  Animating  Geometric 

Algorithms  * 

Ayellet  Tal  David  Dobkin 

Department  of  Computer  Science 
Princeton  University 
Princeton,  NJ  08544 


The  GASP  system  helps  in  the  creation  and  viewing 
of  animations  for  geometric  algorithms.  The  user  need 
not  have  any  knowledge  of  computer  graphics  in  order  to 
quickly  generate  an  animation.  GASP  allows  the  fast  pro¬ 
totyping  of  algorithm  animations.  A  typical  animation  can 
be  produced  in  a  matter  of  days  or  even  hours.  Even  highly 
complex  geometric  algorithms  can  be  animated  with  ease. 
The  system  is  also  intended  to  facilitate  the  task  of  imple¬ 
menting  and  debugging  geometric  algorithms. 

The  application’s  code  specifies  only  the  structure  of  the 
animation  (which  building  blocks  are  included),  and  the 
structure  of  each  scene  (e.g.,  the  position  of  the  objects). 
The  user  need  not  be  concerned  with  the  way  the  animation 
appears  on  the  screen.  If  the  user  wishes  to  influence  the 
look  of  the  animation  and  not  only  its  structure,  he  can  edit 
an  ASCII  "Style  File”  which  controls  viewing  aspects  of 
the  animation.  Even  when  the  user  changes  the  style  file, 
the  user  needs  no  specific  knowledge  of  computer  graphics. 
The  animation  is  generated  automatically  by  the  system. 
But  a  different  animation  will  be  generated  if  the  style  file 
is  modified. 

The  GASP’S  environment  allows  the  viewer  to  control  the 
execution  of  the  algorithm  in  an  easy  way.  The  animation 
can  be  run  at  varying  speeds:  fast(>>),  slow(>)  or  step  by 
step  (>  I).  The  analogous  <,  <<  and  |  <  push  buttons  run 
the  algorithm  in  reverse.  The  viewer  can  PAUSE  at  any  time 
to  suspend  the  execution  of  the  algorithm  or  can  EJECT  the 
movie.  The  environment  is  invaluable  for  education  and 
debugging. 

This  video  shows  five  examples  of  GASP  in  action. 
These  examples  were  chosen  to  give  a  sense  of  the  range 
of  possible  applications  of  GASP.  In  addition,  two  other 
videos  [1],  [3]  that  use  our  system  appear  in  this  collection 
of  animations. 


This  work  supported  in  part  by  the  National  Science  Foundation  under  Grant 
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To  begin,  we  show  a  clip  from  [6].  This  is  the  hierarchi¬ 
cal  data  stmcture  for  representing  convex  polyhedra  [4,  5]. 
We  explain  a  single  pluck  and  then  show  how  the  hierarchy 
progresses  from  level  to  level.  To  create  the  animation  for 
explaining  a  single  pluck  the  user  need  only  write  the  fol¬ 
lowing  short  code.  The  animation  was  an  aid  in  debugging 
the  algorithm  for  detecting  plane-polyhedral  intersection. 

explaiii_pluck(pQly_ver£_iiQ,  poly _v:ert ices , 

pQly_face_iiQ.  poly.faces,  pQly_iiames, 
vert.nQ,  vertices,  face.nn,  faces) 

{ 

/*■  creafe  the  pelyhedren.  */ 
Begin_alQiiiic(“pQly“) ; 

Create.pQlyhedrQnC^EQ",  pDly_vert_iiQ, 

pQly_face_iiQ,  pQly_vertices ,  pely.faces); 
Bj2tate_HQrld.() ; 

Eiui_ atomic  0 ; 

/*  remove  vertices  and  cores  */ 
Begin_atomic(“pluck“) ; 
Split_polyhedron(poly_names ,  ‘ ‘EQ' ^ , 
vert_rQ,  vertices); 

Er£i_atomicO  ; 

/=t  add.  reH  faces  */ 

Begiii_atomic(“adii_facBs“) ; 
Add._faces(pQly_aamBs[Q3  ,  face_iio,  faces); 
Erd_atQmicO  ; 

/*  ufldo  plrckirg  */ 

L[ndQ(2)  ; 


The  second  animation,  based  on  [10]  is  a  remake  of  [9]. 
The  animation  shows  a  configuration  of  six  tetrahedra  that 
cannot  be  taken  apart  by  translation  with  two  hands.  Then, 
it  presents  a  configuration  of  thirty  objects  that  cannot  be 
taken  apart  by  applying  an  isometry  to  any  proper  sub¬ 
set.  The  purpose  of  the  animation  is  to  illustrate  the  use  of 
GASP  as  an  illustration  tool  for  geometric  configurations. 
It  took  us  far  less  than  a  day  to  generate  that  animation. 
The  animation  was  created  by  the  following  function. 


388 


haiids(){ 

lor  (i  =  Q;  i  ^  stick.im;  i±±){ 
stick  i  */ 

get_pQlyliedrQn(&pQiiils,  ftindicea,  toimax, 
fefmax,  aatQniic_iiame ,  astick_iianie) ; 

BegiiL_alQniic(atQniic_iiaiiie) ; 

Cteate_pQlyliedxQiL(stick_name,  maax,  fmax, 
paints,  indices); 

End_atQmic() ; 

> 

Begin_atQniic(“aatatB“) ; 

RQtate_HQrld() ; 

Eiid_atQmic() ; 

} 

The  following  is  part  of  the  style  file  for  the  above  anima¬ 
tion.  The  style  file  determines  the  following  aspects  of  the 
animation.  The  background  color  is  light  gray.  The  colors 
to  be  chosen  by  GASP  are  colors  which  fit  the  creation  of 
a  video  (rather  than  the  screen).  Each  atomic  unit  spans  10 
frames.  Rotation  of  the  world  is  done  380  degrees  around 
the  Y  axis,  and  it  is  executed  over  160  frames,  instead  of 
over  10.  The  color  of  the  stick  which  fades  in  during  atomic 
unit  stickQ  is  red.  (Similarly,  the  style  file  includes  colors 
for  the  other  sticks.) 

tiegin_glQhal_5£yle 

background  =  Q.9  Q.S  Q.3; 

CQlQi:  =  ItIDEQ; 
frames  =  IQ; 
end_glQhal_  style 

hegin_unit_style  Rotate 
frames  =  16Q; 
rotatiQn_HQrld  =  I  380. Q; 
end_unit_style 

begin_unit_style  stickQ 

color  Q.625  Q.2Z34Z5  Q.2Z34Z5; 
end_unit_  style 

Next  we  show  an  animation  of  a  motion  planning  algo¬ 
rithm  [7].  It  is  a  repeat  of  the  first  part  of  [8].  The  object 
to  be  moved  is  a  disc  and  the  obstacles  are  simple  disjoint 
polygons.  The  goal  of  the  animation  is  to  show  the  use  of 
GASP  in  creating  two-dimensional  animations. 

The  forth  animation,  which  is  based  on  [2],  shows  a  line 
segment  intersection  algorithm  in  action  and  illustrates  its 
most  important  features.  The  animation  consists  of  three 
phases.  The  first  phase  presents  the  initial  line  segments  and 
the  visibility  map  that  needs  to  be  built.  The  second  phase 
demonstrates  that  the  visibility  map  is  being  constmcted 
by  operating  in  a  sweepline  fashion,  scanning  the  segments 
from  left  to  right,  and  maintaining  the  vertical  visibility 
map  of  the  region  swept  along  the  way.  The  third  phase 
demonstrates  that  the  cross  section  along  the  sweepline  is 
maintained  in  a  lazy  fashion.  The  animation  is  used  as 
an  aid  in  explaining  a  highly  complex  algorithm  and  to 


allow  students  to  interact  with  the  algorithm.  A  student  can 
choose  the  initial  line  segments  by  editing  an  ASCII  file 
and  view  how  the  animation  is  changing. 

The  last  clip  animates  heapsort.  The  colored  rods  have 
lengths  to  be  sorted.  We  first  display  the  initial  creation 
of  the  heap.  Next,  simple  motions  are  used  to  control  the 
repeated  operations  of  removing  the  largest  element,  and 
reheaping.  Heapsort  is  given  as  an  example  for  using  GASP 
to  facilitate  the  animation  of  any  algorithm  that  involves  the 
display  of  three  dimensional  geometries. 

This  video  was  prepared  in  the  computer  science  depart¬ 
ment  at  Princeton  University.  The  programs  were  written 
in  C  to  run  under  UNIX  on  a  Silicon  Graphics  Iris.  Record¬ 
ing  was  done  at  the  Interactive  Computer  Graphics  Lab  at 
Princeton  and  editing  was  done  with  the  assistance  of  the 
Princeton  Department  of  Media  Services. 

We  thank  Kirk  Alexander  and  Mike  Mills  for  their  help 
in  producing  the  final  video. 
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Penumbral  Shadows 


Adrian  Mariano*  Linus  Upson'*' 


The  shadows  cast  by  polygonal  occluding  objects  illu¬ 
minated  by  polygonal  light  sources  are  complicated.  Un¬ 
like  shadows  cast  by  point  sources,  they  are  divided  into 
regions  where  the  shadow  intensity  changes  at  different 
rates.  The  boundaries  between  these  different  regions 
form  the  shadow  diagram. 

The  video  tape  depicts  shadows  cast  by  a  square  light 
source  with  a  square  occluding  object.  We  have  included 
two  types  of  images:  flat  shadow  scenes  and  perspective 
views.  The  flat  shadow  images  were  calculated  using 
the  exact  formula  for  radiative  energy  transfer.  This 
calculation  depends  only  on  the  boundary  of  the  visible 
light  source.  For  polygons,  it  can  be  reduced  to  a  simple 
summation.  [1] 

Finding  the  boundary  of  the  visible  portion  of  the  illu¬ 
minating  polygon  when  it  is  occluded  by  a  second  poly¬ 
gon  requires  computing  the  intersection  of  two  polygons. 
General  purpose  algorithms  for  computing  intersections 
of  arbitrary  polygons  in  the  plane  are  quite  complicated, 
so  we  used  the  Sutherland-Hodgman  algorithm  [2]  which 
cannot  handle  concave  polygons,  but  which  is  much  sim¬ 
pler  than  the  fully  general  methods  [3].  We  created  the 
three-dimensional  perspective  views  from  the  flat  images 
using  the  texture  mapping  capabilities  of  Rayshade. 

The  shadow  diagrams  can  be  obtained  by  projecting 

*adrian®cam. comell.edu,  Center  for  Applied  Math,  657  ETC, 
Cornell  University,  NY  14853-3801. 

^Lmus-Upson@NeXT.com,  NeXT,  900  Chesapeake  Drive,  Red¬ 
wood  City,  CA  94063. 


the  edges  or  vertices  of  the  light  source  through  vertices 
or  edges  of  the  occluder  onto  the  plane  of  the  shadow. 
The  video  illustrates  that  when  a  square  occluding  ob¬ 
ject  is  illuminated  by  a  square  light  source,  the  shadow 
diagram  is  a  projection  of  a  four-dimensional  hypercube. 

In  general,  the  shadow  diagrams  for  an  arbitrary  light 
source  and  occluder  can  be  seen  as  projections  of  four¬ 
dimensional  polytopes  by  considering  the  lines  which  in¬ 
tersect  the  plane  of  the  shadow.  Let  S  be  the  plane  of 
the  shadow  and  let  P  be  a  different  plane  parallel  to  S. 
If  a  line  intersects  S  at  the  point  {xg ,  yg)  and  it  intersects 
P  at  the  point  {xp,yp)  then  the  line  can  be  mapped  to 
a  point  {xg,yg,Xp,yp)  in  in  R^.  Now  let  D  be  the  set  of 
all  lines  which  intersect  both  the  light  source  and  the  oc¬ 
cluder.  When  D  is  mapped  into  the  result  is  a  four¬ 
dimensional  polytope.  If  this  polytope  is  projected  into 
R^  by  discarding  the  last  two  coordinates  of  each  point, 
the  shadow  diagram  results.  Because  P  can  be  changed, 
this  construction  describes  a  particular  shadow  diagram 
as  the  projections  of  any  one  of  a  family  of  polytopes. 
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Abstract: 

We  demonstrate  algorithms  for  coUision  detection  in 
large-scaled,  interactive  environments.  Such  environments 
are  characterized  by  the  number  of  objects  undergoing  rigid 
motion  and  the  complexity  of  the  models.  We  do  not  as¬ 
sume  that  the  motions  of  the  objects  are  expressible  as 
closed  form  functions  of  time,  nor  do  we  assume  any  upper 
bounds  on  their  velocities. 

Over  the  last  few  years,  a  great  deal  of  interest  has  been 
generated  in  virtual  or  synthetic  environments,  such  as  ar¬ 
chitectural  walkthroughs,  visual  simulation  systems  for  de¬ 
signing  the  shapes  of  mechanical  parts,  training  systems, 
etc.  Interactive,  large-scaled  virtual  environments  pose  new 
challenges  to  the  collision  detection  problem,  because  they 
require  performance  at  interactive  rates  for  thousands,  of 
pairwise  intersection  tests  [1]. 

Our  algorithms  use  a  two-level  hierarchical  detection 
test  for  each  model  to  selectively  compute  the  precise 
contact  between  objects,  achieving  real-time  performance 
without  compromising  accuracy.  At  the  top  level,  we  use 
a  dimension  reduction  approach  (based  on  a  technique 
termed  as  sweep  and  prune)  and  the  temporal  and  geomet¬ 
ric  coherence  that  exists  between  successive  frames  to  over¬ 
come  the  bottleneck  of  O(ra^)  pairwise  comparison  tests  in 
a  large  environment  of  n  moving  objects  [l].  After  we  elim¬ 
inate  the  objects  pairs  which  are  not  in  the  close  vicinity  of 
each  other,  we  employ  an  exact  collision  detection  scheme. 
The  exact  convex  polytope  collision  detection  algorithm  is 
based  upon  [2]:  locality  (captured  by  walking  from  one 
Voronoi  region  of  the  candidate  feature  to  one  of  its  neigh¬ 
boring  feature’s  Voronoi  region  according  to  the  constraints 
failed)  and  coherence  (used  to  reduce  computation  efforts 
in  a  dynamic  environment).  We  can  deal  with  non-convex 
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•  (Pa  (Ea ,  Vi,)  since  Vj  fails  the  applicabil¬ 

ity  test  imposed  by  the  constraint  plane  CP. 


Figure  1:  An  Example  of  collision  detection  for  convex 
polyhedra 

objects  or  articulated  bodies  by  using  a  sub-part  hierarchi¬ 
cal  tree  representation. 

•  Exact  Collision  Detection 

The  Voronoi  regions  form  a  partition  of  space  outside  the 
polytope  according  to  the  closest  feature.  The  collection  of 
Voronoi  regions  of  each  polytope  is  the  generahzed  Voronoi 
diagram  of  the  polytope.  Note  that  the  generalized  Voronoi 
diagram  of  a  convex  polytope  has  linear  size  and  consists 
of  polyhedral  regions.  Each  Voronoi  region  is  bounded  by 
a  set  of  constraint  planes  with  pointers  to  the  neighboring 
cells  (which  each  share  a  constraint  plane  with  it). 

If  a  point  lies  on  a  constraint  plane,  then  it  is  equi¬ 
distant  from  the  two  features  which  share  this  constraint 
plane  in  their  Voronoi  regions.  If  a  point  P  on  object  A 
lies  inside  the  Voronoi  region  of  the  feature  /s  on  object 
B,  then  fs  is  a  closest  feature  to  the  point  P. 

Our  method  is  straightforward.  We  start  with  a  candi¬ 
date  pair  of  features,  one  from  each  polytope,  and  check 
whether  the  closest  points  lie  on  these  features.  Since  the 
polytopes  and  their  faces  are  convex,  this  is  a  local  test 
involving  only  the  neighboring  features  of  the  current  can¬ 
didate  features.  If  either  feature  fails  the  test,  we  step  to 
a  neighboring  feature  of  one  or  both  candidates,  and  try 
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again.  With  some  simple  preprocessing,  we  can  guarantee 
that  every  feature  has  a  constant  number  of  neighboring 
features.  This  is  how  we  can  verify  or  update  the  closest 
feature  pair  in  expected  constant  time.  Fig.  1  shows  how 
this  algorithm  works  on  a  pair  of  simple  convex  polytopes. 

•  Sweep  and  Prune 

The  exact  polytope  collision  detection  algorithm  is  fast 
enough  to  perform  coUision  detection  at  real-time,  in  a 
multi-body  environment  of  moderate  size.  However,  the 
0[n^)  pairwise  detection  tests  become  a  bottleneck  for 
large  n.  By  exploiting  the  spatial  and  temporal  coherence, 
we  use  a  method  whose  running  time  is  a  linear  function  of 
the  number  of  objects  and  the  number  of  pairs  of  objects 
in  close  vicinity  of  each  other. 

For  each  object  in  the  environment,  we  compute  a  small¬ 
est  fitting,  axis-aligned  bounding  box,  which  is  dynamically 
resized  using  a  hiU-cUmbing  algorithm  based  on  the  prop¬ 
erties  of  convex  polytopes  and  the  principle  of  coherence. 

These  bounding  boxes  are  used  to  prune  down  the  num¬ 
ber  of  pairwise  collision  detection  tests.  We  take  the  or- 
thogoncJ  projections  of  the  bounding  boxes  onto  the  x,  y, 
and  z  axes  and  sort  the  intervals  in  three  separate  lists.  If 
the  projections  overlap  in  all  three  dimensions,  we  activate 
the  exact  collision  detection  algorithm.  At  each  frame,  we 
sweep  over  the  sorted  lists  of  intervals,  updating  them  and 
culling  out  the  pairs  of  non-overlapping  bounding  boxes. 

Each  polytope  in  the  environment  now  has  a  bounding 
box.  We  only  call  the  exact  coUision  detection  algorithm 
between  two  objects  when  their  bounding  boxes  overlap. 
Due  of  coherence,  the  sorted  lists  of  intervals,  constructed 
from  the  projections  of  the  bounding  boxes,  change  lit¬ 
tle  from  frame  to  frame,  so  we  use  a  bubble  or  insertion 
sort  to  update  these  Usts  in  expected  linear  time.  Without 
the  bounding  box  hierarchy,  we  would  have  to  perform  a 
quadratic  number  of  pairwise  exact  detection  tests. 

•  Video  Description 

We  have  successfuUy  demonstrated  the  algorithm  in  an 
architectural  walkthrough  environment  and  a  multi-body 
simulation  system.  For  simulations  consisting  of  thousands 
of  models,  the  overaU  algorithm  has  exhibited  linear  time 
performance  in  practice. 

We  first  demonstrate  how  the  polytope  collision  detec¬ 
tion  algorithm  [2]  tracks  the  closest  features  between  two 
convex  polytopes  in  real  time.  We  maintain  and  update 
the  closest  features  using  the  Voronoi  regions  of  the  poly¬ 
topes.  Since  the  polytopes  move  only  sUghtly  between  the 
frames,  the  temporal  and  geometric  coherence  is  weU  pre¬ 
served.  Thus,  the  expected  running  time  of  this  algorithm 
is  constant,  independent  of  the  complexity  of  the  polytopes. 

This  is  foUowed  by  demonstration  the  algorithm  on  an 
articulated  body,  namely  a  hand,  represented  by  the  union 
of  multiple  convex  polytopes.  To  demonstrate  the  efficiency 
of  the  algorithm  and  effectiveness  of  the  coherence,  we  keep 
track  of  all  pairs  of  closest  features  between  each  finger  and 
the  nearby  objects.  However,  using  a  subpart  hierarchical 
tree  representation  as  in  [2]  can  eliminate  the  unnecessary 
computation. 

The  over  all  collision  detection  algorithm  is  tested  on  a 


A»;hltectum  for  Multi>bocly 
CoHision  Detecllon 


Figure  2:  The  Architecture  for  multi-body  simulation 

walkthrough  and  a  multi-body  dynamic  simulation  environ¬ 
ment.  The  video  footage  of  walkthrough  was  taken  from  a 
user’s  actual  walkthrough  experience.  The  collision  de¬ 
tection  algorithm  performs  very  well  in  large  environments 
composed  of  hundreds  of  polytopes.  The  kitchen,  like  most 
realistic  environments,  is  fairly  static;  therefore,  we  can  sort 
the  bounding  boxes  of  the  polytopes  in  sub-linear  time  at 
each  instance  (this  requires  a  modified  sorting  algorithm 
which  sorts  only  the  moving  objects).  We  see  very  little 
degradation  of  frame  rate  when  we  add  the  capability  of 
collision  detection  to  our  walkthrough  environment. 

We  further  demonstrate  a  multi-body  simulation  with 
several  complex  objects  in  a  moderately  dense  environment. 
The  objects  are  placed  in  a  bounded  3-dimensional  simula¬ 
tion  volume  whose  walls  act  as  barriers.  (The  architecture 
of  multi-body  simulation  is  shown  is  Fig.  2.)  Unlike  the 
walkthrough  environment,  all  the  objects  move  indepen¬ 
dently.  The  dynamics  of  multi-body  simulation  assumes 
elastic  collision.  In  order  to  show  the  asymptotic  time 
bound  on  the  algorithm,  we  increase  the  number  of  pos¬ 
sible  interactions  quadratically,  but  the  frame  time  using 
our  collision  detection  algorithm  increases  only  linearly. 

•  Acknowledgements  We  are  grateful  to  Dr.  Fred 
Brooks  and  the  walkthrough  group  at  UNC  Chapel  HiU  for 
the  model  of  the  kitchen. 
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Abstract 

We  ^nirtiate  two  Heterministic,  poly [lotiiial  time  meth¬ 
ods  lor  hiidiiig  a  aepaxator  tor  two  ueated  c.ocivex.  poly- 
hedxa  iii  3d.  While  this  pmhiem  is  N.H-complete,  we 
show  a  reductioii  to  set  c.over.  We  thea  animate  the 
greedy  method  [d]  and  the  weighted  method  [1]. 

1  Introduction 

Let  Q  C  id  he  two  eouYex  polyhedxa  iu  IR^.  Xhe  prob¬ 
lem  o£  ti riding  a  separating  anivea  polyhedra  between 
id  and  Q  with  as  few  facets  as  possible  is  helieried  to 
be  Aid-hard  [2],  hnt  one  can  find  good  approximations 
for  it.  Mitchell  and  Suri  [4]  cast  the  problem  as  a  set 
coyer  problem.  Let  H  —  H{Q)  denote  the  set  of!  snp- 
porting  hyperplaues  of  Q.  Let  the  cautz  set  sj^stem  he 
(7t,  {Tip  ;  p  e  dA}),  where,  for  any  point  p.  Tip  consists 
of  those  hyperplanes  in  Ti  for  which  p  and  Q  lie  on  op¬ 
posite  sides.  They  show  that  a  hitting  set  for  the  coyer 
set  system  giyes  a  subset  of  facets  of  Q  whose  support¬ 
ing  halfspaces  defme  a  polytope  whose  boundary  lies 
between  Q  and  R. 

A  polytope  between  Q  and  R  such  that  each 
facet  of  is  supported  by  a  facet  of  Q  is  called  cauam- 
aiL  The  smallest  (with  respect  to  the  number  of  facets) 
canonical  poly  tope  contained  in  R  can  he  obtained  from 
a  minimal  hitting  set  of  this  set  system  and  is  within  3 
rimes  the  optimal  separating  polyhedron  (in  the  num¬ 
ber  of  faces).  'Therefore,  in  this  yideo,  we  just  show 
how  to  obtain  an  optimal  canonical  separator. 

*.Sii£>poTf.eH  ill  part  by  NSE  Grant  CCB.-93-Qi2S4,  ttu:  Ge- 
uiusUiy  Center,  University  q£  Minnesota,  an  SIC  funded  by 
NSE,  DQE,  and  Minnesota  Technology  Inn.,  and  Ecule  Nomiale 
Siiperieiire  of  Eads. 


2  Contents  of  the  yideo 

We  first,  familiarise  the  yiewer  with  the  two  polyhe¬ 
dra  Q  C  R]  Q  i&  seen  inside  by  means  of  transparency 
(Eig.  1).  We  explain  how  to  obtain  canonical  separa¬ 
tors,  and  how  to  c.onstrijct  the  set  system.  'The  portion 
of  the  arrangement  A  of  Tf((y)  on  the  boundary  of  A  is 
yisualAed  (Eig.  2),  and  some  rows  of  the  set  system  are 
then  shown  sequentially  (Eig.  3),  one  corresponding  to 
each  facet  of  this  arrangement. 

To  explain  the  greedy  method  on  this  set  system,  an 
approach  which  is  proposed  by  Mitcbell  and  Suri  [4] ,  we 
show  how  the  choice  of  a  facet  removes  a  portion  of  the 
boundary  of  A  (Eig.  4),  and  how  the  greedy  method 
proceeds  sequentially  by  choosing  the  facet  which  coy- 
ers  the  biggest  remaining  portion  (measured  by  the 
number  of  cells  of  A  it  separates  from  Q).  After  four 
iterations,  we  skip  to  the  final  greedy  separator,  which 
has  14  facets.  In  general,  the  greedy  separator  is  gnar- 
anteed  to  have  at  most  log  |(y|  facets,  where  Hapt 
the  sise  of  an  optimal  canonical  separator. 

Another  algorithm  is  then  animated,  it  is  the 
weighting  method  described  by  Goodrich  and  the  au¬ 
thor  in  [1] .  We  show  how  putting  a  weight  on  the  fkces 
of  Q  ill  duces  a  weight  for  the  cells  of  the  arrangement 
A,  and  how  keeping  only  the  heavy  cells  yields  a  sep¬ 
arator  which  is  nothing  else  than  a  (l/2c)-net  for  the 
planes  7{(Q)  supporting  faces  of  Q  (Eig.  5).  We  then 
explain  the  reweighting  strategy  in  this  context.  Eyen- 
tually,  a  separator  is  found  that  has  13  facets  (Eig.  6). 
In  general,  the  weighted  separator  is  guaranteed  to  haye 
0(ai;pj)  facets. 

3  ImplementaiiDn  notes 

Both  algorithms  are  implemented  and  animated  us¬ 
ing  GASH,  a  Geometric  Animation  System  created 
at  Hrinceton  Uniyersity  by  Ayellet  Tal  and  Dayid 
Dobkin  [3].  'The  implementation  is  written  in  G  and 
inns  on  a  Silicon  Graphics  iris  workstation.  'The  algo¬ 
rithm  is  fully  animated,  in  the  sense  that  it  produces  an 
animation  for  any  giyen  data  A  and  Q .  The  data  used 
for  the  yideo  were  created  with  a  random  generator  of 
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piaxLca  taiigeiit  ta  tile  sphere,  thee  iuteraected  ta  ahtaie 
Q,  ffihidi  li^as  hiowe  up  tu  yield  id. 
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Higure  3;  The  row  generated  by  a  particular  cell  con¬ 
sists  of  the  facets  of  Q  visible  from  that  cell  (highlighted 
iu  the  picture) . 


Higure  4;  The  greedy  method  chose  the  highlighted 
facet  on  Q,  thereby  chopping  off  a  few  ceils  on  id. 


Higure  5;  'The  set  of  heavy  ceils  on  id.  'I’he  cover  for 
those  cells  is  a  (l/2c)-uet  for  the  planes  with  Meight.  it 
is  shown  highlighted  on  th  boundary  of  Q. 


Higure  2;  'The  arraugeiueut  olTi^Q)  on  the  boundary  Higure  6:  The  weighted  separator  is  shown  in  traus- 

of  id.  Each  ceil  generates  a  row  of  the  cover  set  system.  pareucy  between  id  aud  Q. 


394 


Interactive  Visualization  of  Weighted  Three-dimensional  Alpha  Hulls 

Amitabh  Varshney  Frederick  P.  Brooks,  Jr.  William  V.  Wright 

Department  of  Computer  Science 
University  of  North  Carolina  at  Chapel  Hill 
Chapel  HiU,  NC  27599-3175 


Abstract 

An  interactive  visualization  of  weighted  three-dimensional 
a-huUs  is  presented  for  static  and  dynamic  spheres.  The 
a-huU  is  analytically  computed  and  represented  by  a  tri¬ 
angulated  mesh.  The  entire  surface  is  computed  and  dis¬ 
played  in  real-time  at  interactive  rates.  The  weighted  three- 
dimensional  a-huUs  are  equivalent  to  smooth  molecular  sur¬ 
faces  of  biochemistry.  Biochemistry  applications  of  interac¬ 
tive  computation  and  display  of  a-huUs  or  smooth  molecular 
surfaces  are  outlined. 

1  Introduction 

The  a-huU  has  been  defined  as  a  generalization  of  the  con¬ 
vex  hull  of  point-sets  by  Edelsbrunner,  Kirkpatrick,  and  Sei¬ 
del  [4,  3].  Given  a  set  of  points  P,  a  ball  b  of  radius  a  is 
defined  as  an  empty  a-ball  if  6  n  P  =  ifi.  For  0  <  a  <  oo, 
the  a-huU  of  P  is  defined  as  the  complement  of  the  union 
of  all  empty  a-balls  [3].  It  has  been  shown  that  it  is  pos¬ 
sible  to  compute  the  o-huU  of  a  set  of  points  P  from  the 
Voronoi  diagram  of  P .  For  a  —  oo  the  a-huU  over  the  set 
of  points  P  is  the  same  as  their  convex  hull.  Edelsbrunner 
and  Miicke  [5]  have  defined  the  a-shape  over  P  to  be  the 
polytope  that  approximates  the  a-huU  over  P  by  replacing 
circular  arcs  of  the  o-hull  by  straight  edges  and  spherical 
caps  by  triangles.  An  a-shape  of  a  set  of  points  P  is  a  sub¬ 
set  of  the  Delaunay  triangulation  of  P.  Edelsbrunner  [3], 
has  extended  the  concept  of  a-shapes  to  deal  with  weighted 
points  (i.e.  spheres  with  possibly  unequal  radii)  in  three  di¬ 
mensions.  An  a-shape  of  a  set  of  weighted  points  Pw  is  a 
subset  of  the  regular  triangulation  of  Pw  ■ 

The  smooth  molecular  surface  of  a  molecule  is  defined  as 
the  surface  which  an  exterior  probe-sphere  touches  as  it  is 
rolled  over  the  spherical  atoms  of  that  molecule.  This  defini¬ 
tion  of  a  molecular  surface  was  first  proposed  by  Richards  [8], 
This  surface  is  useful  in  studying  the  structure  and  inter¬ 
actions  of  proteins,  in  particular  for  attacking  the  protein- 
substrate  docking  problem.  The  analytic  computation  of 
the  molecular  surface  was  first  done  by  Connolly  [2]. 

Looking  at  the  above  definitions,  one  can  see  that  the 


weighted  a-huUs  for  three  dimensions  and  the  smooth  molec¬ 
ular  surface  of  a  molecule  with  a  probe-radius  a  have  the 
same  definitions.  In  this  video  we  present  the  visualization 
of  weighted  a-huUs  as  used  to  define  molecular  surfaces. 

2  Our  Approach 

Computation  of  a-huUs  and  a-shapes  has  traditionally  been 
done  by  first  constructing  power  diagrams  or  regular  tri¬ 
angulations.  Since  these  methods  involve  computing  the 
entire  diagram  or  triangulation  first  and  then  culling  away 
the  parts  that  are  not  required,  their  complexity  is  O(n^)  in 
time,  where  n  is  the  number  of  points.  This  is  worst-case 
optimal,  since  an  a-shape  in  three  dimensions  could  have  a 
complexity  of  Q.{n?). 

However,  when  a-huUs  are  used  as  molecular  surfaces, 
one  can  do  better.  Let  M  =  {Si , . . . ,  Sn},  be  a  set  of 
spheres,  where  each  sphere,  Si,  is  expressed  as  a  pair  {a,  r;), 
a  being  the  center  of  the  sphere  and  n  being  the  radius  of 
the  sphere.  CoUections  of  spheres  representing  molecules 
have  two  interesting  properties;  (i)  the  minimum  distance 
dij  between  any  two  centers  a  and  Cj  is  greater  than  or 
equal  to  a  positive  constant  Imin  —  the  smaUest  bond-length 
in  the  molecule  and  (ii)  the  values  of  aU  the  radii  can  be 
bounded  from  above  and  below  by  strictly  positive  values, 
0  <  Tmin  <T,<  Tmax-  We  take  advantage  of  the  first  prop¬ 
erty  to  arrive  at  better  running  times  for  our  algorithm. 
Stated  simply,  the  first  property  says  that  the  number  of 
neighboring  atoms  within  a  fixed  distance  from  any  atom 
i,  is  always  bounded  from  above  by  a  constant  kmax  that 
depends  on  the  minimum  spacing  between  any  two  atoms. 
We  refer  to  two  atoms  i  and  j  as  neighboring  ii  it  is  possible 
to  place  a  probe  sphere  such  that  it  is  contact  with  both  S{ 
and  Sj. 

A  power  ceU  (as  described  by  Aurenhammer  [1])  for  a 
given  sphere  Si  with  respect  to  M  can  be  computed  as  the 
intersection  of  n  —  1  halfspaces,  each  halfspace  contributed 
by  the  pair  {Si,  Sj),l  <  j  <  n,j  i.  If  the  average  number 
of  neighbors  for  an  atom  is  k,  then  for  our  purposes  it  is  suf¬ 
ficient  to  just  compute  an  approximation  to  the  power  ceU, 
which  we  call  a  feasible  cell,  as  an  intersection  of  k  halfspaces 
(one  halfspace  contributed  by  each  neighbor).  This  can  be 
done  in  deterministic  time  0(1:  log  k).  For  n  atoms,  this  task 
can  be  parallelized  over  n  processors,  each  processor  com¬ 
puting  the  feasible  ceU  for  one  atom.  To  check  if  a  feasible 
ceU  is  non-nuU,  we  use  a  randomized  Unear  programming 
algorithm  that  has  Unear  expected  time  complexity  and  is 
quite  fast  in  practice  [9].  For  details  of  our  approach  the 
interested  reader  can  refer  to  [10]. 
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We  have  used  ideas  from  the  theory  of  packing  and  cover¬ 
ing  of  spheres  to  estimate  the  value  for  k,  the  average  num¬ 
ber  of  neighboring  atoms,  for  an  atom  in  protein  molecules. 
We  can  prove  that  for  proteins,  k  <  140  for  a  probe-sphere 
radius  of  lAA  -  the  radius  of  a  water  molecule.  Details 
about  this  result  can  be  found  in  [11].  In  practice  we  have 
found  that  for  protein  molecules  and  for  a  probe-radius  of 
1.4.4,  k  is  around  45 

In  the  algorithms  for  computing  the  convex  hull  of  a  set 
of  points,  it  is  assumed  that  the  points  are  in  a  general  po¬ 
sition,  i.e.  no  more  than  d  points  lie  on  the  same  d  —  \ 
dimensional  hyperplane.  In  reality  this  assumption  often 
fciils  to  hold,  leading  to  problems.  For  example,  planar  ben¬ 
zene  rings  occur  often  in  proteins,  causing  six  carbon  and 
six  hydrogen  atoms  to  be  aU  coplanar.  One  of  the  recent 
approaches  to  solving  this  problem  has  been  to  perturb  the 
input  point  set  slightly  to  avoid  these  degeneracies.  The  ap¬ 
proach  of  Emiris  and  Canny  [6]  perturbs  the  dimension 
of  the  point  as: 

=  P‘,]  +  g)l  <  i  <  n,l  <  j  <  d  (1) 

where  c  is  a  symbolic  infinitesimal  and  q  is  the  smallest 
prime  greater  than  n.  Instead  of  performing  exact  integer 
arithmetic,  we  just  perturb  the  centers  of  the  spheres  by 
the  above  scheme  and  that  has  worked  quite  well  for  us  in 
practice. 

With  no  preprocessing,  we  can  compute  the  molecular 
surface  for  a  396  atom  Crambin  and  a  probe-radius  of  l.lA, 
using  40  Intel  i860  processors  in  0.2  seconds.  We  have  imple¬ 
mented  a-huUs  on  Pixel-Planes  5  [7],  though  the  method  is 
general  enough  to  be  easily  implemented  on  any  other  par¬ 
allel  architecture.  Our  approach  is  analytically  exact.  The 
only  approximation  is  the  degree  of  tessellation  of  spherical 
and  toroidal  patches. 

3  Applications  in  Biochemistry 

Interactive  computation  and  display  of  molecular  surfaces 
should  benefit  biochemists  in  three  important  ways.  First, 
the  ability  to  change  the  probe-radius  interactively  helps  one 
study  the  surface.  Second,  it  helps  in  visualizing  the  chang¬ 
ing  surface  of  a  molecule  as  its  atom  positions  are  changed. 
These  changes  in  atom  positions  could  be  due  to  user-defined 
forces  as  the  user  attempts  to  modify  a  molecular  model  on 
a  computer.  Third,  it  assists  in  incorporating  the  effects  of 
the  solvent  into  the  overall  potential  energy  computations 
during  the  interactive  modifications  of  a  molecule  on  a  com¬ 
puter. 

4  Scope  for  further  work 

At  present  we  are  not  using  any  incremental  temporal  in¬ 
formation  in  constructing  these  surfaces.  Thus,  if  the  atoms 
move  slightly  from  their  positions,  the  whole  surface  is  re¬ 
computed  from  the  beginning,  although  one  could  conceiv¬ 
ably  store  the  feasible  cell  description  with  each  atom.  As¬ 
suming  the  atoms  of  the  molecule  move  along  continuous 
trajectories,  it  should  be  possible  to  compute  such  surfaces 
(and  indeed  a-huUs  and  a-shapes)  incrementally  and  effi¬ 
ciently  by  using  the  information  from  previous  time  steps. 
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